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While writing this Foreword, I was reminded of a quote attributed to Mahatma 
Gandhi: “The expert knows more and more about less and less until he knows 
everything about nothing.” The quote illustrates the great dilemma that all of 
us face in modern times: but this 1s especially acute for those engaged in the 
pursuit of science. Compared to the times of Archimedes or Leonardo da 
Vinci or Antonie Philips van Leeuwenhoek, whose range of interests covered 
several disciplines (they looked at the world in its entirety), most of us have 
now become narrow specialists of one kind or another, knowing less and less 
about the wider world. Thus, edited monographs, proceedings of seminars 
and the like have become absolutely essential to keep us informed and 
engaged in research and teaching more meaningfully (such publications 
allow summarizing of recent researches at a more advanced level than is pos- 
sible in ordinary textbooks). 

Turning to plant sciences, the Annual Review of Plant Biology, started in 
the middle of the last century, continues to be an invaluable source of infor- 
mation on the broad advances of plant biology. Yet, it is necessary to have a 
more inclusive look at advances over a somewhat longer period and also have 
this information in a way more organized than the format of annual reviews 
allows. Thus, Prof. Bir Bahadur and his colleagues deserve our grateful 
thanks on undertaking an incredibly difficult task of summarizing advances 
on the very broad front of plant biology — the topics cover not only fundamen- 
tal aspects of plant biology but also plant biotechnology, which is now grow- 
ing almost as a separate discipline. I welcome their style of a historical 
approach (nearly every article follows this style). This approach is often 
neglected by specialists, but the fact is that this is the only way to genuine 
understanding and for a non-expert to easily discern major advances or mile- 
stones. This unity in overall planning and laying out the style has obviously 
been possible due to the fact that two of three co-editors are in fact former 
pupils of the senior editor (Prof. Rajam, the senior most of them, was, in a 
sense, a colleague while I was at Delhi University). Understandably, in the 
combined work on Volumes I and I, Prof. Bir Bahadur 1s author of nearly ten 
chapters and Prof. Rajam author of five chapters. Their two other colleagues 
Dr. Leela Sahijram and Prof. Krishnamurthy have also contributed several 
chapters. Nonetheless, the work has very valuable contributions also from 
several national and international contributors (in Volume 2, there are around 
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ten authors from outside India), which has immensely added to the value of 
this work. 

I think that on the whole, a very laudable contribution has been made. The 
editors have managed to include almost all topics which are significant in 
modern plant biology. In Volume 1, I was delighted to see several chapters 
close to my interest, such as those relating to polyploidy, photosynthesis, 
apomixis and flower development. But in Volume 2, there is special emphasis 
on genomics and plant biotechnology, and there are many other chapters of 
current interest. Space is not adequate to mention all the chapters or their top- 
ics, but to me, those on genetic markers, doubled haploids, plant genomes and 
genomics (there are several on these topics), epigenetic mechanisms, bioin- 
formatics and systems biology were of special interest. Also, I am very 
delighted that Volume 2 starts with an excellent chapter on Arabidopsis thali- 
ana. Inspired by a lecture on Langridge’s work by Prof. Arthur W. Galston, I 
undertook in 1960s to ‘tame’ a wild Indian strain of Arabidopsis by raising 
in vitro cultures. However, despite the fact that Arabidopsis is now the prin- 
cipal material for basic research in plant biology, there are many who have 
never seen a live Arabidopsis plant, and surely, the opening chapter of this 
volume will be valuable for all. 

Although ably aided by his pupils, Prof. Bahadur remains the chief archi- 
tect of this endeavour. And [ am struck with the expanse of his canvas and the 
breadth of his interest — it seems to me that in part, it is due to his early asso- 
ciation with Prof. J.B.S. Haldane, FR.S., whose own interest covered many 
disciplines, from mathematics, biochemistry and genetics to animal and plant 
biology. The topics he and his colleagues cover are of both fundamental and 
applied interest. I have to admit that many of us in universities are a bit distant 
from fields and sometimes unfamiliar with the full potential of fundamental 
discoveries for biotechnological applications. This work will help focus due 
attention of readers on both aspects of plant biology. 

When the chapters were first sent to me, I noticed many typographic mis- 
takes than are normally present in finished manuscripts — it is true that Eng- 
lish is not the mother tongue of many of us in India, but I hope these mistakes 
have been rectified. 

Once again, I wish to congratulate Prof. Bir Bahadur and his colleagues 
for a very unique monograph and insight in modern plant biology. 


Honorary Scientist of the Indian, National Satish C. Maheshwari 
Science Academy, Biotechnology Laboratories 

Centre for Converging Technologies 

University of Rajasthan, Jaipur, India 


Foreword 


The human population is increasing at an alarming rate and is expected to 
reach 11 billion by 2050. As there is a big gap between population growth 
and food production, food security for an ever-increasing population poses a 
major challenge for the present and future times. In fact, it will become nec- 
essary in the coming two decades or so to double food production with avail- 
able arable land; else, it may precipitate great famines in some parts of the 
world. This is not achievable with just conventional strategies like plant 
breeding. However, the projected increase in food production may be achieved 
if traditional breeding methods are coupled with biotechnological approaches 
as the latter can offer novel ways for increasing productivity and quality of 
crops as also for producing an array of useful compounds including pharma- 
ceuticals and biofuels. Indeed, during the past couple of decades, dramatic 
progress has been made in the field of plant genomics and biotechnology. 
Therefore, a need was felt for updating scientific developments in these areas. 

Plant Biology and Biotechnology — Volume 2 was planned to present state- 
of-the-art scientific information on various basic and applied aspects of plant 
genomics. This volume comprises 37 chapters spanning various aspects of 
plant genomics and biotechnology and provides comprehensive and updated 
information on a wide variety of topics including Arabidopsis as a wonderful 
model system for plant research, plant—fungus interactions, microalgae in 
biotechnological applications, genetic markers and marker-assisted breeding, 
doubled haploids in breeding, DNA fingerprinting for plant identification, 
nuclear and organellar genomes, functional genomics, proteomics, epig- 
enomics, bioinformatics, systems biology, applications of tissue culture in 
crop improvement and conservation of plant genetic resources, genetically 
modified crops for production of commercially important products and engi- 
neering abiotic and biotic stress tolerance, RNAi and microRNAs in crop 
improvement and environmental, marine, desert and rural biotechnologies. 
The book can serve as a good reference for plant molecular geneticists, plant 
biotechnologists, plant breeders, agricultural scientists and food scientists. 
Besides, it will also serve as a reference book for post-graduate students, 
researchers and teachers besides scientists working in agri-biotech 
companies. 

Contributors of these volumes were selected from a wide range of institu- 
tions for introducing a diversity of authors. At the same time, these authors 
were selected based on their vast expertise in specific areas of their choice to 
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match the diversity of topics. These authors have a deep understanding of 
their subject to enable them not only to write critical reviews by integrating 
information from classical to modern literature but also to endure an unend- 
ing series of editorial suggestions and revisions of their manuscripts. Need- 
less to say, this is as much their book as ours. 

We hope that these books will help our fellow teachers and a generation of 
students enter the fascinating world of plant genomics and biotechnology 
with confidence, as perceived and planned by us. 


Hyderabad, Telangana, India Bir Bahadur 
New Delhi, India Manchikatla Venkat Rajam 
Bangalore, Karnataka, India Leela Sahijram 


Bangalore, Karnataka, India K.V. Krishnamurthy 
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Abstract 

Arabidopsis thaliana, a small, flowering, self-pollinating weed, has been 
developed into an elegant model system. Concerted effort from the plant 
research community has led to development of extensive genomic 
resources, tools, and techniques. Advances in high-throughput (omics- 
based) approaches and their application in Arabidopsis research have pro- 
vided ample understanding of basic biological processes in plants. Further, 
bioinformatics platforms allow for integration of the multiple “omics” 
data, thus, enhancing our appreciation of biological interactions at an 
organismal level. Taken together, A. thaliana has emerged as an excellent 
reference source for functional and comparative genomic analysis. In this 
chapter, we summarize advances made in the field of Arabidopsis research 
and resources, tools, and technologies available to the plant scientific com- 
munity. In addition, we briefly discuss ways in which knowledge gained 
from this model system can be harnessed for effective deployment in crop 
improvement. 


Keywords 

Arabidopsis thaliana « Model organism ¢ Forward and reverse genetics ¢ 
Functional genomics * Community resources ¢ Crop plants « Plant 
biology 


1.1 Passage to Glory: From “Tiny 
Weed” to a“Model Plant” 


Mendel’s seminal work on Pisum sativum (pea) 
and, later, Zea mays (maize) brought the two 
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suitable for cytogenetic studies, played an instru- 
mental role in providing valuable insights into 
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various facets of plant biology. Horticultural 
plants like tomato (Solanum lycopersicum) and 
petunia (Petunia hybrida) were the other models 
being used by plant geneticists for studying bio- 
logical processes. Despite being extensively used 
in plant biology, these crops failed to develop 
into ideal model systems for studies on molecular 
genetics. A major drawback with most of these 
crop plants being their long generation time and 
complex genomes. Arabidopsis thaliana, a dicot 
flowering weed belonging to the Brassicaceae 
family, was not given much importance until 
Friedrich Laibach included it in his search to 
identify a plant which had fewer numbers of 
large chromosomes suitable for cytogenetic anal- 
ysis. But, due to the small-sized chromosomes, 
Arabidopsis was left out and was not mentioned 
in plant research for long (Meyerowitz 2001). 
Laibach refocused his attention on Arabidopsis 
in 1943 and proposed it as a genetic model owing 
to its short generation time, small size, large 
progeny, and self-pollinating lifestyle with pos- 
sibility of outcrossing (Fig. 1.1). Laibach, along 
with Albert Kranz, further contributed to 
Arabidopsis research by collecting a large number 


Attributes of A.thaliana as a 
Model system 


“Small size- requiring less growth 
space 


-Shorter generation time -increasing 
the pace of research 


-Large progeny for genetic analysis 


-Small genome size (125Mb)- 
completely sequenced 


-Small number of chromosomes (n=5) 
-Amenable to transformation 


- Spectrum of genetic and molecular 
resources 


R. Sivasubramanian et al. 


of natural accessions (750) from around the 
world. George Redei, another plant geneticist, 
extensively worked towards standardizing muta- 
genesis protocols for Arabidopsis and generated 
a collection of X-ray induced mutants. Langridge, 
in 1955, described the first auxotrophic mutant in 
higher plants. Thereafter, the use of Arabidopsis 
mutants to dissect physiological and biochemical 
pathways underlying various biological pro- 
cesses gained momentum. Maarten Koornneef’s 
group at Wageningen Agricultural University in 
Netherlands also started using Arabidopsis 
mutants in a major way and constructed its 
detailed genetic map, further facilitating research 
in Arabidopsis genetics (Koornneef and Meinke 
2010). Around the same time, Estelle and 
Somerville (1986) used Arabidopsis mutants to 
characterize important biochemical processes 
like photorespiration, further emphasizing use- 
fulness of this plant in genetic analysis. Pruitt 
and Meyerowitz (1986) demonstrated _ that 
Arabidopsis had a small genome relative to other 
crop models, thereby making mapping and gene 
cloning comparatively convenient. The next big 
step in Arabidopsis research was the discovery of 





Fig. 1.1 (a) Large number of seeds can be grown on 90 mm petri plate. (b) Rosette of 4-week-old Arabidopsis 
(c) a 5-week-old Arabidopsis plant with the inflorescence flowers 
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Fig 1.2 Milestones in Arabidopsis research 


a simple, convenient Agrobacterium tumefaciens- 
mediated transformation of germinating seeds 
which opened the floodgates for developing vari- 
ous tools for genetic analysis (Feldmann and 
Marks 1987). Clough and Bent (1998) further 
simplified plant transformation by devising the 
“floral dip’ method. All these advances, together, 
brought this weed into limelight as a model plant 
in the field of plant genetics (Fig. 1.2). 


1.2 The Arabidopsis Genome: 


“Catalyst” for Plant Research 


A relatively smaller genome size (approx. 
125 Mb) was the simple reason Arabidopsis was 
chosen as a subject for the first plant genome 
sequencing project. By contrast, the genome size 
of related Brassica napus (rapeseed mustard) and 
Brassica juncea (Indian mustard) is about ten 
times that of Arabidopsis. Similarly, the genome 
of important cereals like rice, maize, and wheat is 
much more complex and roughly about 3x, 45x, 
100x, respectively, compared to Arabidopsis. It 
was the first plant genome to be completely 
sequenced in the year 2000 under an international 
Arabidopsis Genome Initiative (AGI). Analysis 
of the genome using various gene-finding algo- 
rithms, along with supporting data from vast 
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experimental evidences like EST sequences, 
MPSS tags, cDNA clones, etc. predicts about 
33,000 gene models (TAIR 10). The genome 
analysis also revealed that it is enriched for genes 
with an average size of 5 Kb (Bevan et al. 1998). 
It was also observed that there is very little repeti- 
tive DNA compared to any other higher plant, 
which facilitates molecular studies and map- 
based cloning. Release of its genome sequence 
acted as a catalyst for commencement of various 
projects on functional genomics, leading to gen- 
eration of a vast stockpile of resources discussed 
hereunder (Fig. 1.2). 


Genetic Resources 
for Functional Genomics 


1.3 


Properties of a living organism are determined 
mainly by its genetic constitution and its interac- 
tion with the environment. With the ever- 
expanding wealth of genomic data produced by 
genome sequencing projects, the next essential 
step is to decipher the gene function. Multiple 
tools and techniques have been developed for 
Arabidopsis with a focus on dissecting and defin- 
ing its gene function and interactions in a given 
biological process (Table 1.1). 
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Forward Genetic Tools 
for Functional Analysis 


1.3.1 


Forward genetic approach is the classical 
phenotype-based approach for screening mutants 
in a biological pathway or process of interest. 
Large-scale forward genetic screens have pro- 
vided a basis for the discovery of a multitude of 
new genes and pathways fundamental to various 
aspects of plant biology. Gene disruption is the 
most robust and direct approach to address the 
biological function of a gene. Various libraries of 
mutants for forward genetics generated in A. 
thaliana are available as a public resource and are 
discussed below. 


1.3.1.1 EMS Mutagenesis 

Ethyl methane sulfonate (EMS), a known and 
commonly used chemical mutagen (alkylating 
agent), induces point mutations which vary from 


complete knockouts to hypomorphic mutations, 
thus allowing isolation of a series of allelic vari- 
ants of a given gene (Bowman et al. 1991). 
Isolation of weak alleles is advantageous espe- 
cially when characterizing genes involved in 
essential cellular functions. EMS treatment has 
been successfully used for generating a high 
frequency of irreversible, randomly distributed 
mutations across the Arabidopsis genome 
(Greene et al. 2003). To dissect any biological 
process, a saturated mutagenized population is 
screened for a desired phenotype, and the classi- 
cal positional cloning approach is used for identi- 
fying the causal mutation/gene (Fig. 1.3). This 
approach requires the mutant to be crossed to an 
Arabidopsis accession significantly polymorphic 
at the DNA level to generate a segregating F, 
mapping population. The mapping is done in a 
biphasic manner, where coarse mapping with a 
few, well-dispersed genome-wide markers is fol- 
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lowed by fine mapping with a large number of 
region-specific markers. A large number of 
molecular markers have been used for mapping 
in Arabidopsis, viz., simple sequence length 
polymorphism (SSLP), insertions and deletions 
(Indels), cleaved amplified polymorphic 
sequences (CAPS), and single nucleotide poly- 
morphisms (SNPs). Mapping resolution depends 
upon (i) the number of molecular markers 
employed and (11) the number of meiotic recom- 
binants analyzed. Mutations that interfere with 
pattern formation during embryogenesis (Souter 
and Lindsey 2000), branching pattern (Schmitz 
and Theres 1999), flower morphology (Komaki 
et al. 1988), flowering time (Putterill et al. 1995), 
response to hormones, and many cellular/physi- 
ological processes have been identified in simple, 
forward EMS mutant screens. With the advent of 
“next generation sequencing” (NGS), the tradi- 
tional positional-cloning approach can _ be 
replaced by direct identification of mutations by 
whole-genome sequencing. Multiple studies have 
successfully used NGS to identify directly the 
mutation of interest (Schneeberger et al. 2009; 
Austin et al. 2011; Schneeberger and Weigel 
2011; Uchida et al. 2011). 


1.3.1.2 Insertional Mutagenesis and Its 
Modification 

Since the classical positional cloning is an exten- 
sive and a long-drawn exercise for identifying a 
corresponding gene responsible for a phenotype 
of interest, insertional mutagenesis — an alternate 
tool for gene disruption — was developed. 
Integration/insertion of T-DNA/transposable ele- 
ment (TE) into the genic region causes disruption 
of the gene. Since the mutant genes are tagged 
with T-DNA inserts, the gene can be easily iden- 
tified by isolating the sequences flanking the 
insertion sites. Success of this approach depends 
upon (i) an easy and efficient transformation 
system and (11) the ability of T-DNA and trans- 
posable elements to integrate randomly into the 
host genome (Galbiati et al. 2000). Efficient and 
simplified Agrobacterium-mediated floral dip 
transformation method has helped in generating 
exceptionally large numbers of insertional 
mutants and a near-saturation mutagenesis of the 


Arabidopsis genome (Clough and Bent 1998). 
Four major T-DNA mutant collections available 
at TAIR collectively encompass 95 % of pre- 
dicted Arabidopsis genes: (a) SALK_ lines 
(Alonso et al. 2003), (b) GABI-Kat lines (Rosso 
et al. 2003), (c) Syngenta Arabidopsis Insertion 
Library (SAIL) (Sessions et al. 2002), and (d) 
INRA/Versailles lines (Samson et al. 2002). 
These stocks are in the public domain and are 
available from ABRC and NASC stock centers. 
These are popular resources for both forward and 
reverse genetic approaches for functional analy- 
sis. Similar to EMS mutants, insertion mutants 
too have been used for unraveling molecular 
mechanisms underlying various biological pro- 
cesses, ViZ., meiotic recombination in plants 
(Reddy et al. 2003; Kerzendorfer et al. 2006), 
embryo development (Stacey et al. 2002), organ 
development (Dievart et al. 2003), and systemic 
acquired resistance signaling (Maldonado et al. 
2002). 


1.3.1.2.1 Trap lines 

Traditionally, gene identification relied on dis- 
ruption of a gene function leading to a recogniz- 
able phenotype. But most of the genes in 
Arabidopsis and other crop plants are members 
of multigene families and can act redundantly, 
which makes it difficult to characterize them 
using the classical approach. In addition, some 
phenotypic characters are hard to be detected 
unless the mutated gene is studied in a certain 
mutant background which reveals its loss-of- 
function phenotype. Yet another class of genes 
not amenable to classical genetic studies is the 
ones that function at multiple developmental 
stages and whose loss of function may lead to 
lethality at early developmental _ stages. 
Modification of the insertional mutagenesis tool 
kit has led to the development of an alternate 
powerful strategy that permits gene identification 
based on their expression pattern, thus eliminat- 
ing the need for a mutant phenotype (Sundaresan 
et al. 1995; Springer 2000). The basic principle 
underlying this strategy is to randomly integrate 
into the genome a promoterless reporter con- 
struct (gene/promoter trap) or a reporter construct 
with a minimal promoter (enhancer trap) close to 


the end of the insertional element (T-DNA or 
TE). The expression of the reporter gene is acti- 
vated when an endogenous cis-acting promoter 
or transcriptional enhancer is present at the site of 
integration. Bacterial uidA encoding for 
B-glucuronidase (GUS) is a commonly used 
reporter since endogenous B-glucuronidase activ- 
ity in plants is absent (Jefferson et al. 1987). 
Alternatively, light emitting bacterial protein 
(lux) and luciferase (luc) enzyme from the fire fly 
have been used as reporters for nondestructive 
screens. Gene traps have been extensively used to 
unravel genes involved in various developmental 
processes like lateral root formation (Malamy 
and Benfey 1997), female gametophyte develop- 
ment (Springer et al. 1995), embryo development 
(Topping and Lindsey 1997), and floral organ 
development (Nakayama et al. 2005). Trap lines 
have also been used to identify stress responsive 
genes (Alvarado et al. 2004). Besides gene iden- 
tification, several organ-, tissue-, cell-, and stage- 
specific markers have been identified which are 
useful tools in developmental biology studies. 
Additionally, the promoter traplines provide a 
direct access to highly specific promoters. For 
example, to tackle the problem of drought stress 
in crop plants engineering stomatal activity is an 
attractive idea. Guard cells control the influx of 
CO, for photosynthesis and water loss during 
transpiration, and the signaling cascade involved 
in these responses are well dissected (Schroeder 
et al. 2001). Francia et al. (2008) screened gene 
trap and promoter trap lines to isolate stomata- 
specific genes and promoters for biotechnologi- 
cal applications. This approach has also been 
extended to other crops like rice to identify cell- 
type-/tissue-, stage-, and/or conditionally specific 
regulatory elements (Yang et al. 2004). 


1.3.1.3 Natural Variation 
and Association Mapping 

Extensive genotypic and phenotypic variations 
have been documented in natural accessions of A. 
thaliana. Natural variation is the basis for tradi- 
tional linkage mapping/quantitative trait loci 
(QTL) mapping aimed at identifying genes gov- 
erning a trait of interest. F, populations and 
recombinant inbred lines (RILs) have been used 


R. Sivasubramanian et al. 


as experimental populations for QTL mapping. 
RILs allow higher mapping resolution as com- 
pared to F, populations. Over 60 RIL populations 
have been developed and are available to the 
research community through stock centers. The 
wide range of intraspecific diversity (wild acces- 
sions) available in A. thaliana makes it well 
suited for association mapping (Fig. 1.3). 
Association mapping is based on linkage disequi- 
librium (LD) and offers very high resolution in 
comparison to traditional linkage mapping, since 
it takes advantage of historic recombination 
events accumulated over several generations. 
Linkage disequilibrium (LD) in Arabidopsis on 
an average extends over 5—10 kb, thus offering 
nearly single-gene resolution (Kim et al. 2007b). 
Array-based re-sequencing of 20 maximally 
diverse natural accessions of A. thaliana has led 
to development of a _ genotyping array 
(AtSNPtilel) containing probe sets for 2,48,584 
SNPs (Kim et al. 2007b). Given the small size of 
the genome (~125 Mb), this array provides, on 
average, 1 SNP for every 500 bp, sufficient 
enough for genome-wide association mapping. 
There have been several reports of genome-wide 
association studies (GWAS) in A. thaliana 
(Aranzana et al. 2005; Atwell et al. 2010; Brachi 
et al. 2010; Chan et al. 2011; Nemri et al. 2010). 
The SNP chip has been used for genotyping 
around 1,307 accessions, and the data 1s available 
to the public (Horton et al. 2012). Several soft- 
ware and web-based platforms have been 
developed for GWAS in plants, viz., TASSEL 
(Bradbury et al. 2007), GAPIT (Lipka et al. 
2012), GEMMA (Zhou and Stephens 2012), 
Matapax (Childs et al. 2012), and the GWA- 
portal (Seren et al. 2012). A major drawback of 
association mapping is the confounding caused 
due to population structure and the consequent 
increase in number of false positives. In addition, 
identification of epistatic loci continues to be a 
major challenge in GWAS. A new mapping 
design that combines advantages of classical 
QTL mapping and association mapping, known 
as nested association mapping (NAM), has been 
pioneered in maize wherein experimental popu- 
lations derived from crosses of several founder 
lines are used (McMullen et al. 2009; Yu et al. 
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2008). In Arabidopsis, two such mapping popu- 
lations have been developed, viz., AMPRIL 
(Arabidopsis Multiparent RIL) populations 
(Huang et al. 2011) and MAGIC (Multiple 
Advanced Generation Intercross) populations 
(Kover et al. 2009) (Fig. 1.3). The MAGIC popu- 
lation is derived from a heterogeneous stock of 
19 inter-mated accessions which have been com- 
pletely sequenced, and tools required for QTL/ 
association mapping in these populations are 
freely available (Table 1.1). Alternatively, asso- 
ciation mapping can be combined with QTL 
mapping in several independent RIL populations 
to retain statistical power and, yet, not compro- 
mise on resolution of mapping. Though the array- 
based re-sequencing effort led to identification of 
around 250K SNPs, it also revealed that the refer- 
ence accession Col-0 lacks a substantial portion 
of genes present in other accessions. A 1001 
genome project for A. thaliana was announced in 
2007 to sequence genomes of other accessions 
which would contain sequences not present in the 
reference genome (Weigel and Mott 2009) 
(Table 1.1). This multinational effort would not 
only shed light on local polymorphism patterns 
and chromosomal-scale differences but be 
directly useful in QTL and association mapping 
as well. Since many of the accessions sequenced 
are parents of RIL populations, availability of the 
genome sequence may identify polymorphisms 
responsible for various QTLs detected so far. 
Complete-genome sequences will not only help 
identify the causal allele directly in GWAS but 
also assist in predicting activity differences 
between causal alleles and tackling problems of 
allelic heterogeneity and rare variants. 


1.3.2 Reverse Genetic Tools 
for Functional Genomics 


The genome annotations for Arabidopsis have 
been refined overtime and there are more than 
30,000 genes predicted, but the role of majority 
of these genes in various biological processes is 
yet to be elucidated. This poses a major challenge 
and has made it essential to carry out systematic 
genome-wide functional analysis. As emphasized 


above, cloning the genes based on phenotype 
requires tremendous labor and time; therefore, 
complementary reverse genetic approaches are 
developed to directly investigate the gene function 
in specific pathways of interest. Some of the genetic 
resources commonly used in the Arabidopsis 
functional analysis are discussed below (Fig. 1.3). 


1.3.2.1 Sequence-Indexed Insertion 
Mutants 

As a part of the extensive effort to experimentally 
validate the function of all the predicted genes, 
high-throughput thermal asymmetric interlaced- 
PCR (TAIL-PCR) in combination with sequenc- 
ing has made it possible to index all the insertion 
mutants available. The gene indexed T-DNA 
mutant library is a valuable resource for several 
reverse genetic studies. The ultimate aim of the 
project was to identify at least two genetically 
stable loss-of-function mutations in all the pre- 
dicted Arabidopsis genes. Using simple PCR- 
based screening, the mutations can be confirmed 
and one can select for lines that are homozygous 
for the mutation. Recently, Bethke et al. (2014) 
attempted to dissect the role of multiple members 
of the Arabidopsis pectin methyl esterases 
(AtPME) (a 66-member gene family), in pattern- 
triggered immunity and immune responses to 
Pseudomonas, Botrytis, and Alternaria brassici- 
cola. They identified T-DNA mutants 1n multiple 
members of AtPME gene family and analyzed 
single and combinations of multiple mutations to 
address their role in plant defense. 


1.3.2.2 Targeting Induced Local 

Lesions in Genome (TILLING)/ 

EcoTILLING 
EMS mutagenesis can create a higher frequency 
of broad-spectrum mutations as compared to the 
insertion mutagenesis approach. The major draw- 
back of EMS is that the eventual cloning of the 
mutation is tedious. Colbert et al. (2001) devel- 
oped a PCR-based high-throughput strategy for 
identifying a SNP in the gene of interest from a 
mutagenized F, population. TILLING (Targeting 
Induced Local Lesion in Genome) combines the 
robustness of random EMS-induced mutagenesis 
with high-throughput PCR-based screening to 
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identify mutations in the gene of interest. The 
PCR products from the gene of interest are dena- 
tured and reannealed to form heteroduplex which 
is preferentially digested by the miusmatch- 
specific CEL1 endonuclease, and the cleaved 
products are analyzed on a denaturing polyacryl- 
amide gel. Initially, this reverse genetic strategy 
was used to identify numerous mutations in 
Arabidopsis mutagenized populations (Enns 
et al. 2005). The use of this technique was further 
extended for identifying naturally occurring 
genetic variations in the available accessions and 
has been named as EcoTILLING (Comai et al. 
2004). A high-throughput and cost-effective 
TILLING approach has been employed by the 
Arabidopsis TILLING project (ATP) to provide 
series of allelic point mutations for the general 
Arabidopsis community. This strategy can be 
extended to any crop plant for functional genom- 
ics and even crop improvement (Slade et al. 2005; 
Cooper et al. 2013). 


1.3.2.3 RNA Interference (RNAi) 
for Targeted Mutagenesis 

Although insertional mutagenesis is an effective 
method for generating loss-of-function mutants, 
the method falters when dealing with lethal 
genes or those genes which are functionally 
redundant. Targeted gene silencing via anti- 
sense, CO-Ssuppression, posttranscriptional gene 
silencing, and most recently RNA interference 
(RNAi) has emerged as a powerful alternative 
reverse genetic approach for attenuating the 
gene expression. In this approach, stable trans- 
formants expressing double-stranded RNA 
(under a constitutive or inducible promoter) 
against target genes are generated, and the effect 
of knockdown of the gene expression is ana- 
lyzed in terms of the phenotype. In comparison 
to T-DNA mutagenesis, RNAi lines typically 
show a wide spectrum of gene expression, from 
no reduction to complete shutdown (Waterhouse 
and Helliwell 2003). To date, experimental 
proof of function for only 10 % of the predicted 
genes is available. The AGRIKOLA project 
(Arabidopsis Genomic RNAi Knockout Line 
Analysis) aims to create targeted gene knock- 
down lines via RNAi for all the predicted genes 
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in Arabidopsis (Hilson et al. 2004). In this 
project, about 150-600 bp long gene-specific 
tags (GSTs) have been designed for approx. 
25,000 genes which were used to construct 
dsRNA-expressing vectors. These RNAi con- 
structs have been transformed into wild-type 
Arabidopsis plants to generate a library of 
knockdown transformants. These knockdown 
lines will be an invaluable source for determin- 
ing the function of individual Arabidopsis genes 
and, by extrapolation, function of orthologous 
genes in other crop plants as well. 


1.3.2.4 Gain-of-Function Systems 
for Functional Analysis 

Activation tagging is another addition in the arse- 
nal of gene identification tools available to the 
plant scientific community. This is a popular 
gain-of-function approach where the overexpres- 
sion of the gene results in a novel phenotype. 
Gene activation tagging systems have been estab- 
lished in Arabidopsis using either T-DNA vectors 
or transposon-based vectors carrying multimers 
of the 35S CaMV enhancers (Walden et al. 1994; 
Weigel et al. 2000; Nakazawa et al. 2003). 
Agrobacterium-mediated genome-wide random 
integration of the activation construct results in 
upregulation of the gene present in the vicinity of 
the integration site. Ectopic upregulation of the 
gene can result in an observable phenotype. Since 
the gene is tagged, identification by inverse PCR 
or TAIL-PCR can be carried out rapidly. Activation 
tagging approach has been instrumental in deci- 
phering the function of a number of genes includ- 
ing ADR1 in defense response (Grant et al. 2003; 
Aboul-Soud et al. 2009), BAK 1 in brassinosteroid 
signaling (Li et al. 2002), and FT in floral transition 
(Kardailsky et al. 1999). 

FOX hunting system (full-length overexpres- 
sion of cDNA for gene hunting) is an alternative 
approach to activation tagging where the ease of 
transformation of Arabidopsis and the availabil- 
ity of the full-length cDNA sequences have been 
exploited. As opposed to activation tagging, 
genes responsible for the overexpression pheno- 
type can be easily identified. Using a normalized 
full-length cDNA library, Ichikawa et al. 2006 
have developed 30,000 independent Arabidopsis 
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FOX lines which are available from RIKEN. As a 
further extension of this, full-length rice cDNA 
clones have been transformed into Arabidopsis 
with an aim of screening for rice functional genes 
in a high-throughput manner (Sakurai et al. 
2010). 


1.4 Omics: A High-Throughput 
Tool for Deciphering Gene 
Function 

Functional genomics is a genome-wide 


approach that attempts to use the ever expanding 
wealth of data produced through various high- 
throughput analysis for defining the gene func- 
tion and its interactions in a given biological 
process. 


1.4.1 Transcriptomic Resources 


for A. thaliana 


Transcriptomics, a comprehensive study of the 
whole-genome expression, is an informative 
approach towards functional gene analysis. The 
spatial and temporal expression of a gene to a 
certain extent is reflective of its activity within 
the cell. This section discusses the various publi- 
cally available resources for gene expression 
analysis and their application in functional 
genomics. 


1.4.1.1 Expression Profiling Provides 
Insights into Gene Function 

To get a glimpse of the transcriptional activity 
within the cell, a large-scale expressed sequence 
tag (EST) sequencing project was undertaken in 
Arabidopsis. The data generated in the EST 
sequencing was useful in gene discovery as it 
helped in annotating the expressed regions in the 
genome besides providing information about the 
gene expression. Serial analysis of gene expres- 
sion (SAGE) and its various modifications like 
micro SAGE and mini SAGE were commonly 
used by various research groups to identify large 
number of differentially expressed transcripts 
present in different tissues/conditions. More 
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recently, the NGS-based massively parallel 
signature sequencing (MPSS) has gained the 
edge. In this approach, short sequence tags are 
generated for a cDNA library by sequencing 
approx. 20-25 bp from the 3’ side of 
cDNA. Besides discovering novel transcripts, 
MPSS also provides a robust method for assess- 
ing the transcript abundance. Additionally, the 
MPSS platform has been used to identify a large 
number of small RNAs. 

The hybridization-based approaches for 
acquiring large-scale gene expression profiles 
have also been established for Arabidopsis and 
are being continuously improved. The traditional 
microarrays used for analyzing the transcrip- 
tional profiles were biased towards the known 
and predicted genes. With the whole genome 
sequence available, it became possible to develop 
tiling arrays (TA) and whole genome arrays 
(WGA). These arrays cover the entire genome 
with probes either at regular interval (TA) or 
probes that are overlapping along the entire 
length of the genome (WGA). These arrays have 
not only been used for estimating the transcript 
levels but also play a significant role in identify- 
ing novel transcripts, various alternate transcripts, 
and polymorphisms (Mockler et al. 2005). Using 
the WGA, Zeller et al. (2009) studied the stress- 
induced changes in the A. thaliana transcripts in 
response to various abiotic stresses like salt, 
osmotic, cold, and heat. They identified several 
novel stress-induced genes which were missed in 
the earlier classical microarray experiments. 
Thousands of microarray experiments that have 
been conducted in different laboratories with A. 
thaliana now form a part of large quantitative 
data on gene expression in different tissue and in 
response to different treatments and experimental 
conditions. Similarly, a comprehensive expres- 
sion atlas for A. thaliana has been developed 
based on WGA and is available at A. thaliana 
Tiling Array Express (At-TAX) (Laubinger et al. 
2008). The utility of the tiling array has been fur- 
ther extended by combining this platform with 
immunoprecipitation methods for detecting chro- 
mosomal locations at which protein-DNA inter- 
action occurs across the genome (Wang and Perry 
2013). Further, Zhang et al. (2006) generated an 


12 


extensive Arabidopsis 
coupling tiling arrays with methylcytosine 
immunoprecipitation methods. Most of the 
microarray data has been deposited in the public 
databases such as NASC and TAIR, and this data 
can be accessed for analysis either directly from 
these sites or through the various available links/ 
tools like Genevestigator (Zimmermann et al. 
2004) and MAPMAN (Thimm et al. 2004) 
(Table 1.1). 


methylome data by 


1.4.1.2 Co-expression Analysis: Guilt by 
Association 

Since an extensive set of microarray expression 
data across multiple experiments is available, the 
attention has now shifted to exploring the corre- 
lated expression of the entire genome with fine 
focus on defining specific pathways in a biologi- 
cal process. These correlation studies are a pow- 
erful tool to identify new genes which could be 
functionally related. Several co-expression anal- 
ysis interfaces like ATTED-II and CressExpress 
have been developed that use the comprehensive 
collection of the publicly available transcriptome 
data sets to identify co-regulated genes 
(Nakashima et al. 2009). The PLAnt co- 
Expression database (PLANEX) is a genome- 
wide database which includes publicly available 
GeneChip data obtained from the Gene 
Expression Omnibus (GEO). A comparative 
transcriptomic analysis using meta-analysis 
approach for drought (abiotic) and bacterial 
(biotic) stress response in rice and Arabidopsis 
identified several genes, common to both the 
stresses (Shaik and Ramakrishna 2013). This 
identification of master regulatory genes that act 
in biotic and abiotic stress response would be the 
potential candidates for manipulating stress toler- 
ance in crop plants as well. 

Though co-expression analysis proves to be a 
powerful tool to identify new genes which could 
be functionally related, one needs to keep in mind 
that the co-expression analysis is a reflection of 
regulation at the mRNA level only. The analysis 
would gain weightage if any known protein—pro- 
tein interaction information can be integrated 
into the co-expression analysis. 
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1.4.1.3 Small RNA Database and Tools 
for Functional Genomics 

The regulation of gene expression occurs at 
multiple levels including mRNA stability. It has 
become evident that small RNAs are one of the 
major players in regulating gene expression 
during growth, development, and _ stress 
responses in plants. The cost-effective next 
generation sequencing has made it possible to 
discover hundreds of small RNA _ from 
Arabidopsis and other plants. This extensive 
data is available through the web interface —the 
Arabidopsis Small RNA Project (ASRP) which 
also integrates the community-wide resources 
related to small RNA and various bioinformatic 
tools for mi- and siRNA identification 
(Backman et al. 2008). The Arabidopsis model 
system has been useful in dissecting the small 
RNA component of genetic and epigenetic reg- 
ulation in plant development, growth, and dis- 
ease resistance. Palatnik et al. (2003) showed 
the involvement of microRNA in controlling 
leaf morphogenesis. Auxin signaling responses 
were also shown to be regulated via small 
RNA. Several auxin response factors (ARFs) 
were predicted to be targets for miRNA. ARF10 
and ARF17 contain potential sites for miR160 
(Jover-Gil et al. 2005). Overexpression of 
miR160 resistant version of ARF17 led to 
higher accumulation of ARF17. These changes 
in expression correlate with the pleotropic mor- 
phological abnormalities and reduced fertility 
observed in the transgenic suggesting the regu- 
lation of ARFI7 by miR160 (Mallory et al. 
2005). Similarly, another miR393 also plays a 
significant role in integrating the environmental 
cues to auxin signaling pathway (Windels and 
Vazquez 2011). 

Additionally, using the basic principle of 
small RNA-directed gene silencing, virus 
induced gene silencing (VIGS), hairpin-based 
RNA interference (RNAi), and artificial 
microRNA (amiRNA) tools have been developed 
to regulate targeted gene expression. The use of 
these strategies has been further extended for 
selectively regulating gene expression in crop 
plants as well. 
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1.4.1.4 Tools for Regulatory Sequence 
Analysis 

The control of gene expression is pivotal to all 
cellular processes, and one of the major chal- 
lenges in biology is to unravel the mechanisms 
that regulate gene expression. The gene function 
is directly linked to its spatial and temporal 
expression which is regulated by a network of 
transcription factors, the key regulatory proteins. 
The cues for gene regulation are hard wired into 
the promoter region which is formed by cis-regu- 
latory elements. These cis-regulatory elements 
are recognized by specific transcription factors. 
Therefore, in order to understand the gene expres- 
sion and thereby the gene function, basic infor- 
mation on the transcription factors and their 
binding sites is important. A. thaliana encodes 
more than 1,500 transcription factors which are 
classified into 40-50 families based on the 
sequence similarity (Riechmann et al. 2000). 
Arabidopsis Gene Regulatory Information Server 
(AGRIS) is the interface that hosts AtcisDB, 
AtTFDB, AtRegNet, and ReIN_ databases 
(Table 1.1), which provide a catalogue of cis ad 
trans factors involved in gene regulation. 


1.4.2 Epigenomic Resources 


Epigenetics is the study of changes in the regula- 
tion of gene expression that do not involve a 
change in the DNA sequence. Epigenetic changes/ 
modifications include methylation of the DNA, 
chemical modification of the histones, and alter- 
native histone variants. These modifications are 
known to play important roles during develop- 
ment and in responses to different environmental 
cues. An integrated epigenome map of Arabidopsis 
was published in 2008, which describes interac- 
tions between the methylome, transcriptome and 
the small RNA transcriptome, and their effect on 
gene regulation (Lister et al. 2008). Further, the 
epigenetic variation between the different acces- 
sions of Arabidopsis and its dependence on the 
genetic variation between the accessions has been 
reported (Schmitz et al. 2013). DNA methylations 
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and their effect on the transcriptional activity 
have been studied in response to biotic stress 
(Dowen et al. 2012). The Epigenomics of Plants 
International Consortium (EPIC) provides a list 
of epigenetic resources in the form of databases 
and tools, which are available to the research 
community (Table 1.1). Also, The EPIC-CoGe 
Epigenomics Webbrowser provides a_ user- 
friendly interface to scan and search for epigen- 
etic marks in the genome (Table 1.1). 


1.4.3. Proteomics Resources 


Transcriptional analysis in Arabidopsis using 
the whole genome arrays and tiling arrays sug- 
gested that the transcriptional capacity of the 
Arabidopsis genome is far more than predicted 
by genome annotation. Many non-annotated 
intergenic regions were transcribed and a lot of 
antisense transcripts were identified. These find- 
ings prompted the scientists to look at the global 
translational products and their posttranslational 
modifications which are known to play a key 
role in many cellular processes like cell signal- 
ing, regulation of gene expression, protein 
degradation, and protein-protein interactions. 
“Proteomics” is the study and characterization 
of the complete set of proteins present at a given 
time in the cell and organelle using gel- and 
mass spectrometry-based high-throughput tech- 
niques. Proteome profile of Arabidopsis has 
led to the identification of several organ-specific 
biomarkers for different developmental stages, 
organs, and undifferentiated cell cultures 
(Baerenfaller et al. 2008). A proteome map of 
different root cell types using Arabidopsis root 
marker lines and fluorescence activated cell 
sorting (FACS) was recently released (Petricka 
et al. 2012). Proteome profiling of Arabidopsis 
under diverse abiotic stresses such as cold, 
drought, salinity, hypoxia, etc. (Kosova et al. 
2011; Ghosh and Xu 2014) determined the role 
of several families of transcriptional factors 
(TFs) and protein phosphorylation/dephosphor- 
ylation events in mediating stress response. For 
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instance, analysis of Arabidopsis nuclear 
proteome and its variation under cold stress 
identified several differentially expressed tran- 
scriptional factors (Bae et al. 2003). A compre- 
hensive and_ well-annotated database of 
transcription factors like Stress Responsive 
Transcription Factor Database (STIFDB) 
(Shameer et al. 2009) and Arabidopsis Stress 
Responsive Gene Database (ASRGDB) 
(Borkotoky et al. 2013) provide a useful resource 
to study gene regulatory pathways. MASC-P is 
a web-based interface which contains a compre- 
hensive set of databases containing proteome 
profile of not only Arabidopsis but also other 
plant species as well. 

Proteomics has proved to play a significant 
role in characterizing virulence and pathogenic- 
ity factors produced by pathogens. Exoproteome 
of phytopathogenic fungus Sclerotinia sclerotio- 
rum identified several cell wall degrading 
enzyme, in addition to the uniquely identified 
a-arabinofuranosidase, formerly not been 
detected by EST studies, thus highlighting the 
importance of proteomic studies (Yajima and 
Kav 2006; Mehta et al. 2008). With respect to 
plant responses, proteomic studies on 
Arabidopsis—A. brassicicola pathosystem identi- 
fied the role of different pathogenesis-related 
proteins PR4, glycosyl hydrolases, and 
glutathione-S-transferases in mediating defense 
response (Mukherjee et al. 2009). Proteome 
analysis of Arabidopsis plants challenged with 
Pseudomonas syringae pv. tomato (Pst) identi- 
fied several phosphorylated  glutathione-S- 
transferases and peroxiredoxins involved in 
redox signaling (Jones et al. 2006). Cecconi 
et al. (2009), using the same pathosystem, 
addressed the role of nitric oxide (NO) in hyper- 
sensitive response. They identified several pro- 
teins undergoing tyrosine nitration during an 
incompatible interaction. Most of these proteins 
were enzymes associated with important cellular 
processes like photosynthesis, glycolysis, and 
nitrate assimilation, indicating their importance 
in mediating plant hypersensitive response. A 
high-throughput proteomic study thus presents a 
new way to gain deeper insights into organismal 
biology. 
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1.4.4 Metabolomics Resources 


Plants respond to changes in the environmental 
conditions by altering their metabolism. 
Transcriptome and proteome analyses provide 
useful insights into the metabolic network that 
contributes to a biochemical reaction, but these 
techniques are not all-encompassing methodolo- 
gies. Metabolomics hence can be used as an addi- 
tional tool to characterize the degree of impact of 
environmental perturbations. The basic require- 
ment for metabolomics study is to quantitatively 
and qualitatively analyze all the metabolites in a 
given sample in a truly unbiased manner. To date, 
“metabolite profiling’ has mainly been _per- 
formed using NMR, Fourier transform infrared 
(FT-IR) spectroscopy, or MS techniques. Besides 
relative quantification, metabolomics also help in 
de novo identification of unknown metabolites 
whose presence has been demonstrated. The met- 
abolic response of plants to various biotic 
(Balmer et al. 2013) or abiotic (Kaplan et al. 
2004) stresses has recently received attention. 
Metabolite profiling of Arabidopsis plants 
subjected to high salinity revealed the role of 
polypropanoid pathway and glycine betaine bio- 
synthesis in mediating salt stress response (Kim 
et al. 2007a). Similarly, molecular basis of freez- 
ing tolerance in different A. thaliana accessions 
has also been addressed using metabolomic 
approach. Enhanced freezing tolerance in acces- 
sions originating from Scandinavia to the Cape 
Verde Islands has been found to be associated 
with downregulation of photosynthesis and 
hormonal responses and induction of flavonoid 
metabolism (Hannah et al. 2006). Metabolomics 
has also emerged as a functional genomics tool 
for elucidating the functions of many unknown 
genes. For instance, genes associated with gluco- 
sinolate, flavonoid, and sterol biosynthesis have 
successfully been functionally annotated using 
the metabolomic approach (Hirai et al. 2004; 
Morikawa et al. 2006; Schauer and Fernie 2006; 
Tohge et al. 2005). Till date, approximately 1,800 
metabolites have been detected in Arabidopsis. 
Many publicly available databases like 
AtMetExpress and Plant Metabolic Network 
(PMN) have been developed to provide the 


1. Arabidopsis thaliana: A Model for Plant Research 


scientific community with access to the metabo- 
lomics data along with tools to allow for its inter- 
active analysis. 


1.4.5 Systems Biology:“Integrating 
the Omics” 


The omics approaches have allowed us to sneak 
a glance at each step of translation of the genetic 
information to the trait, and in this process, a 
large quantity of high-quality data has been gen- 
erated. Systems biology integrates the omics 
data and allows us to study the interactions 
among various biological components such as 
genes, metabolites, proteins, and regulatory ele- 
ments. The study of interactions involves majorly 
four types of networks: (1) gene—metabolite net- 
works, (11) transcriptional regulatory networks, 
(111) gene regulatory networks, and (iv) protein— 
protein interaction networks. Gene—metabolite 
networks have been used to study the responses 
of the plant during sulfur and nitrogen starvation 
(Hirai et al. 2004) and during biotic stress 
(Goossens et al. 2003). Also, gene expression 
data has been used in the context of metabolic 
pathways to dissect out the interaction of sugar 
and circadian rhythm signaling during the diur- 
nal cycles (Blasing et al. 2005; Gibon et al. 
2006). Diverse biological processes within a cell 
are mediated by multiple protein complexes 
consisting of individual proteins organized by 
their protein—protein interactions (PPI). Several 
large-scale protein interaction studies have 
been carried out in Arabidopsis (Braun P et al. 
2011; Mukhtar et al. 2011), and multiple 
databases with PPI data have been established. 
Apart from PPI, genetic interactions have also 
been mapped for many genes in Arabidopsis, 
and data from both genetic and protein—protein 
interactions have been consolidated to define an 
interactome of Arabidopsis. Some of the data- 
bases that contain curated PPI and genetic 
interaction data include Biological General 
Repository for Interaction Datasets (BioGRID) 
(Stark et al. 2006, 2011), IntAct Molecular 
Interaction database (Aranda et al. 2010), 
Arabidopsis thaliana Protein  Interactome 
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Database (AtPID) (Cui et al. 2008), Biomolecular 
Interaction Network Database (BIND) (Bader 
et al. 2003), the Molecular [NTeraction database 
(MINT) (Ceol et al. 2010), and Search Tool for 
the Retrieval of Interacting Genes/Proteins 
(STRING) (Jensen et al. 2009; Szklarczyk et al. 
2011) (Table 1.1). Knowledge from these inter- 
actions has been used extensively to study and 
understand various complex cellular signaling 
pathways. In Arabidopsis, this data has been 
used to define the interfaces of various pathways 
involved in retrograde signaling (Glasser et al. 
2014), to study how WRKY transcription factors 
interact with themselves and other proteins to 
regulate various biological processes (Chi et al. 
2013), and to understand how the JAZ proteins 
in the jasmonate signaling cascade interact with 
other transcription factors and proteins from 
other hormone signaling pathways to regulate 
male fertility, trichome development, and other 
biological processes (Pauwels and Goossens 
2011). Transcriptional regulatory networks 
describe the interactions between transcription 
factors and their downstream targets/genes. 
Transcriptional regulatory networks have been 
used to dissect out the regulatory network of glu- 
cosinolate biosynthesis and identify major tran- 
scription factors regulating glucosinolate 
biosynthesis (Hirai et al. 2007). Transcriptional 
regulatory networks have also aided in elucidat- 
ing the regulatory mechanisms controlling dehy- 
dration, cold stress response, and systemic 
acquired resistance (SAR) (Yamaguchi- 
Shinozaki and Shinozaki 2006). Gene regulatory 
networks define gene—gene interactions during a 
biological process and reflect the dynamics of a 
biological system. Gene regulatory networks 
have been mostly described for developmental 
processes that are well studied in Arabidopsis 
such as flower development (Espinosa-Soto 
et al. 2004) and guard cell size (Li et al. 2006). 
Several large-scale studies have been carried out 
involving various omics technologies such as 
genomics, proteomics, transcriptomics, interac- 
tomics, and metabolomics. Datasets generated 
from such studies need to be integrated to derive 
models for describing biological interactions at 
the organismal level. 
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1.5 Translating Weed Power: 


From Arabidopsis to Crops 


Fourteen years after the release of the whole- 
genome sequence, Arabidopsis research is more 
vibrant and vigorous than ever. There has been an 
enormous development of new high-throughput 
tools, techniques, and a parallel surge of advance- 
ment on the bioinformatics front enabling the 
researchers to handle and analyze the available 
data. The studies on Arabidopsis have generated 
a wealth of new fundamental insights in all areas 
of biology. Large genetic and genomic resources 
and extensive set of expression, proteome, and 
metabolome data covering a range of environ- 
mental conditions and various developmental 
stages have made A. thaliana an excellent refer- 
ence source for functional and comparative 
genomic analysis in other crop plants. The trans- 
lation of this knowledge from Arabidopsis to 
crop plants has been multipronged. One approach 
has been to look for orthologues in other crop 
species to identify genes regulating agronomi- 
cally important traits. The other approach is to 
the ectopically overexpress the Arabidopsis 
genes in crop species for improving the traits. 
Additionally, the Arabidopsis system has proved 
to be a useful and time-saving testing ground for 
components discovered in crops. 


1.5.1 Arabidopsis: A Reference 


for Brassica Genomics 


The genus Brassica, within Brassicaceae family, 
is agronomically the most important genus since 
it is a rich source for oil seeds, vegetable, and 
fodder crops. Six different Brassica species are 
extensively cultivated all over the world, out of 
which three are diploids (B. rapa, B. nigra, B. 
oleracea) and three are amphidiploids (B. juncea, 
B. napus, B. carinata). The Brassica and 
Arabidopsis lineages have diverged approxi- 
mately 14—20 million years ago. Previous studies 
have shown that the present diploid species in the 
Brassica genus have descended from a common 
hexaploid ancestor (Lagercrantz and Lydiate 
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1996). Comparative mapping — between 
Arabidopsis and B. nigra and B. oleracea indi- 
cate that the Brassica diploid genomes have 
evolved through extensive duplication followed 
by frequent fragmentation and rearrangements 
(Lagercrantz 1998; Town et al. 2006). 
Comparative mapping in the amphidiploid B. 
napus with Arabidopsis revealed that there are 21 
syntenic blocks shared between them which can 
be duplicated and rearranged to represent 90 % of 
the Brassica napus genome (Parkin et al. 2005). 
The Arabidopsis genome can be represented as 
gene blocks (from A to X) based on its synteny 
with the B. rapa and B. nigra genomes (Schranz 
et al. 2006). These gene blocks can be rearranged 
to model genome structures of other crucifer spe- 
cies where genome information 1s limited. 

Flowering time and its control are of impor- 
tance in agriculture as well as to the understand- 
ing of plant development. In Arabidopsis, the 
pathway that promotes flowering in response to 
photoperiod changes is well studied and charac- 
terized (Mouradov et al. 2002). Some of the major 
players involved in the control of flowering time 
are CO (daylength response), FLC (repressor of 
flowering and vernalization responsive), FRI (ver- 
nalization responsive), FT, and SOC1 (flowering 
time). Homologues of CO were found to be col- 
located with loci that regulate flowering time in B. 
nigra (Lagercrantz et al. 1996). Similarly, QTLs 
responsible for vernalization-responsive flower- 
ing time in B. napus (Osborn et al. 1997; Long 
et al. 2007) B. rapa (Schranz et al. 2002; Lou 
et al. 2007) and B. oleracea (Okazaki et al. 2007) 
co-localized with homologues of the FLC gene of 
Arabidopsis. Identification of QTLs for other eco- 
nomically important traits such as oil composi- 
tion, glucosinolate content, and disease resistance 
has also been expedited by the information from 
Arabidopsis. QTL studies for oil components 
such as oleic acid and erucic acid have identified 
orthologues of Arabidopsis FAD2, FAD3, and 
FAE1 gene, respectively, as the genes controlling 
variation for the traits (Fourmann et al. 1998; Hu 
et al. 2006). Similarly, genes for clubroot resis- 
tance were identified in B. rapa through compara- 
tive mapping with Arabidopsis (Suwabe et al. 
2006). 
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1.5.2. Insights into Crop 


Developmental Biology 


Various aspects of plant development and physi- 
ology have been deciphered using Arabidopsis, 
and the insights gained have improved our under- 
standing of similar biological processes in the 
crop counterparts. Fruit ripening in horticultural 
crops is one such important trait as inefficient rip- 
ening or overripening results in unpalatable hor- 
ticultural products. The phytohormone ethylene 
is known to play an important role in fruit ripen- 
ing. A lot of effort is spent in controlling the eth- 
ylene effects to save the products especially 
under postharvest storage conditions. The basic 
knowledge regarding the mode of perception and 
action of ethylene has been elucidated in depth 
mainly from the work done in Arabidopsis. A 
classical genetic screen of EMS-mutagenized 
population for a triple-response phenotype led 
to isolation of mutants which were either insens1- 
tive to ethylene or showed enhanced ethylene 
responses even in the absence of ethylene 
(Bleecker et al. 1988). The ensemble of advanced 
molecular genetic tools developed in Arabidopsis 
facilitated the rapid identification of several 
components of the complex ethylene signaling 
pathway like the ethylene receptors ETR1, ETR2, 
ESRI, and EIN4 (Stepanova and Ecker 2000). 
ETRI1 belongs to a family of transmembrane 
proteins. Orthologues of ETR1 have been found 
in other plant species. A dominant negative 
mutation of the Arabidopsis ethylene receptor 
etrl-1 when heterologously expressed in tobacco, 
tomato, or petunia resulted in ethylene insensi- 
tive transgenics (Wilkinson et al. 1997). Similarly, 
Constitutive Triple Response! (CTR1) identified 
in Arabidopsis as a negative regulator of the eth- 
ylene response pathway has homologues in other 
crop plants, and some of them can complement 
the weak Atctrl-18 mutants thereby suggesting 
functional similarity as well (Adams-Phillips 
et al. 2004). These observations indicate that the 
mechanism of ethylene perception and response 
defined in the model plant is largely conserved in 
other crop species as well. Another integral com- 
ponent of the ethylene signaling pathway EIN2 
when mutated results in a complete block of the 
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ethylene responses (Bisson and Groth 2011). 
Based on the sequence similarity, two rice ortho- 
logues of EIN2 were identified and reported to be 
positive regulators of ethylene signaling similar 
to the Arabidopsis counterpart (Jun et al. 2004). 
Since the ethylene signaling pathway is highly 
conserved in crop plants, manipulating the ethyl- 
ene production and response at the molecular 
level has been explored as an efficient strategy to 
prolong the shelf life of fruits and ornamentals. 
Floral longevity of petunia and carnation flowers 
has been extended by either overexpressing the 
etrl-1 dominant mutant under fbp1, a floral spe- 
cific promoter, or by suppression of ein2 
(Wilkinson et al. 1997; reviewed in Stearns and 
Glick 2003). 

The root system plays a diverse and crucial 
role in the overall growth of the plant. The root 
system not only provides anchorage and mechan- 
ical strength but also determines the amount of 
water and nutrient uptake from the soil. In addi- 
tion, it is the major site of interaction with symbi- 
otic microorganisms. In light of the diverse role 
the underground system has in the overall plant 
growth, breeders have focused their attention on 
this system for improving crop productivity. A. 
thaliana has been instrumental in providing 
insights into several aspects of root development 
such as description of cellular structure of the 
root (Dolan et al. 1993). Fitz Gerald et al. (2006), 
using Arabidopsis, identified a QTL involved in 
regulating size and plasticity of the root system. 
Comparative analysis of genome-wide transcrip- 
tional changes associated with the process of 
lateral root development in the model plants 
Arabidopsis and maize has helped identify a core 
set of genes putatively conserved between both 
species. This helps researchers extrapolate 
knowledge gained from Arabidopsis to crop spe- 
cies for improvement of root architecture. 


1.5.3. Engineering Stress Tolerance 


The ever-increasing demand for agricultural 
products due to population growth, coupled with 
the impending climate change, has imposed 
immense pressure on agricultural production. 
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Minimizing yield losses from various biotic and 
abiotic factors through development of stress- 
tolerant crops is, therefore, essential for sustain- 
able food production. 


1.5.3.1 Abiotic Stress 

Abiotic stress responses well studied in 
Arabidopsis include salt, cold, and drought 
stress tolerance. Membrane transport proteins 
(transporters) are involved in transport of miner- 
als, sucrose, and water and are implicated in 
various pathways mediating abiotic stress toler- 
ance. The vacuolar ion transporter, ALNHX1 in 
Arabidopsis, has been shown to confer tolerance 
to salt stress in B. napus (Zhang et al. 2001) and 
tomato (Zhang and Blumwald 2001). Also, 
AtNHX1 orthologues have been isolated from 
various other crop species such as rice (Fukuda 
et al. 1999), buckwheat (Cheng et al. 2007), and 
sugar beet (Xia et al. 2002). Drought tolerance, 
as a trait, has received much attention due to its 
direct implication in agriculture, and multiple 
pathways involved in drought-stress tolerance 
have been elucidated in Arabidopsis. Certain 
families of transcription factors such as the AP2/ 
ERF family, DREB (dehydration responsive ele- 
ment binding), and NAC family have been 
largely implicated in mediating drought toler- 
ance. DREB/CBF proteins identified in 
Arabidopsis have been ectopically expressed in 
wheat, rice, and tomato to confer drought-stress 
tolerance (Wang et al. 2003; Pellegrineschi et al. 
2004; Oh et al. 2005). Arabidopsis, being easily 
amenable to genetic transformation, has also 
been used for testing the function of genes iso- 
lated from various crop species. Constitutive 
overexpression of CBF orthologues from rice 
and maize has resulted in drought tolerance in 
Arabidopsis (Dubouzet et al. 2003; Qin et al. 
2004). Also, PI4K = (phosphatidylinositol 
4-kinase) gene from wheat has been ectopically 
overexpressed in Arabidopsis to confer drought 
tolerance (Liu et al. 2013). 


1.5.3.2 Biotic Stress 

Crops suffer heavy losses due to diseases caused 
by a range of phytopathogens. Improving disease 
resistance has been an important mandate in most 
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of the crop breeding programs. For devising new 
crop protection strategies, it is fundamental to 
understand the biology of the plant responses to 
pathogen invasion. Arabidopsis has been estab- 
lished as a model for molecular investigations 
of resistance to different crop pathogens like 
Albugosp., Hyaloperonosporasp., Colletotrichum 
sp., Plasmodiophora brassicae, plant viruses, 
Ralstonia sp., etc. Arabidopsis research has not 
only contributed immensely to delineating vari- 
ous defense responses to biotrophic and necrotro- 
phic pathogens but also yielded insights into 
conserved plant responses across different classes 
of pathogens. “R” genes identified in Arabidopsis 
can be used to engineer resistance into crop 
plants. The Arabidopsis—Albugo candida patho- 
system has led to identification and molecular 
characterization of white rust resistance 4 
(WRR4), a TIR-NB-LRR gene. WRR4 confers 
broad-spectrum resistance against Albugo sp. and 
has the potential for use in Brassica crops. 
Recently, transgenic B. juncea and B. napus 
expressing AtWRR4 from Arabidopsis have been 
reported to show improved resistance against 
multiple races of A. candida (Borhan et al. 2010). 
Various research groups are using the natural 
genetic variation existing in Arabidopsis acces- 
sions for identifying novel resistance genes via 
association mapping approaches. Bouwmeester 
et al. (2011), in a similar approach, identified 
Arabidopsis Lectin Receptor Kinase 1.9 
(AtLecRK 1.9) which is capable of recognizing 
Phytophthora infestans, a notorious pathogen of 
potato. Interfamily transfer of the gene AtLecRK 
1.9 to potato and tobacco could confer resistance 
against P. infestans and, hence, can be used in 
breeding strategies to develop durable late-blight 
resistant Solanaceous crops (Bouwmeester et al. 
2014). Similarly, NPR1 (non-expressor of PR1), 
a master regulator of the SA pathway, plays a 
critical role in systemic acquired resistance 
(SAR). Overexpression of AtNPR1 has been 
shown to confer resistance against various patho- 
gens in several crop plants including rice (Chern 
et al. 2001; Quilis et al. 2008), tomato (Lin et al. 
2004), wheat (Makandar et al. 2006), carrot 
(Wally et al. 2009), and cotton (Parkhi et al. 
2010). 
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It is equally important to understand the 
mechanism used by the pathogen to successfully 
colonize and reproduce on its host plant, as this 
could also provide leads for developing novel 
resistance strategies. Arabidopsis has also been 
used for investigating genomics and effectromics 
of different oomycetes like A. candida, 
Hyaloperonospora arabidopsidis, and P. infes- 
tans, to name a few. 


1.6 Moving Beyond the Model 


System 


Knowledge gained from studies in the dicot 
model Arabidopsis has facilitated understanding 
of various aspects of plant growth, development, 
and responses to environmental cues. Genes 
involved in core cellular processes like cell cycle, 
transcription, and translation tend to be con- 
served throughout eukaryotic evolution (Koonin 
et al. 2004). Although homologues of genes regu- 
lating key biological pathways have been identi- 
fied in various crop species, there is a growing 
awareness that regulation of these pathways dif- 
fers considerably between species, thus render- 
ing translation of knowledge from the model 
plant into a crop inefficient. A prime example of 
this comes from the well-studied flowering-time 
pathway, wherein homologues of flowering-time 
genes such as FT and CO have been found in 
many crops such as rice and wheat, but transcrip- 
tional and posttranscriptional regulation of these 
genes varies considerably in response to external 
and internal cues (Ballerini and Kramer 2011). 
Also, it has been shown that responses of 
Arabidopsis and B. napus to Leptosphaeria mac- 
ulans infection differ significantly (Sasek et al. 
2012; Larkan et al. 2013). A recent commentary 
discusses inadequacies of Arabidopsis as a model 
system for understanding complex networks of 
responses that have evolved through years of 
host—pathogen coevolution (Rouxel and Balesdent 
2013). Therefore, the model system may be used 
to define a set of genes involved in a particular 
biological process, but regulation and interac- 
tions need to be resolved in a context-specific 
manner. There is also a need to develop other 
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model systems more closely related to the target 
crops that are amenable to cultivation under labo- 
ratory conditions. One such emerging model sys- 
tem is Brachypodium distachyon, of the grass 
subfamily Pooideae, which is closely related to 
many cereal crops including wheat, barley, rice, 
sorghum, and maize. The emergence of afford- 
able high-throughput sequencing technologies 
has culminated in many crop genomes being 
sequenced. This, in turn, has led to development 
of crop-specific resources, thus paving the way 
for unraveling gene-trait correlations directly in 
the crop species itself. Though there have been 
many developments recently in the field of crop 
genomics, we have a long way to go before this 
knowledge can be used directly for crop improve- 
ment. Till then, model systems would continue to 
be workhorses in plant biology research. 
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Abstract 

Microalgae are used as food, feed, and fodder and also used to produce a 
wide range of metabolites such as, proteins, carbohydrates, lipids, carotenoids, 
vitamins, fatty acids, sterols, etc. They are able to enhance the nutritional 
content of conventional food and feed preparations and hence positively 
affect humans and animal health including aquaculture animals. They also 
provide a key tool for phycoremediation of toxic metals and nanometal 
production. The use of microalgae in nanotechnology is a promising field 
of research with a green approach. The use of genetically modified algae 
for better production of different biotechnological compounds of interests 
is popular nowadays. Microalgal biomass production for sustainable 
biofuel production together with other high-value compounds in a cost- 
effective way is the major challenge of algal biotechnologists. Microalgal 
biotechnology is similar to conventional agriculture but has received quite 
a lot of attention over the last decades, because they can reach substan- 
tially higher productivities than traditional crops and can use the waste- 
lands and the large marine ecosystem. As history has shown, research 
studies on microalgae have been numerous and varied, but they have not 
always resulted in commercial applications. The aim of this review is to 
summarize the commercial applications of microalgae. 


Keywords 
Microalgae ¢ Nutrition « Metabolites * Phycoremediation * Nanometals « 
Nanotechnology ¢ Biofuel 


N. Khatoon ¢ R. Pal (D4) 

Department of Botany, University of Calcutta, 

35, B.C. Road, Kolkata, West Bengal 700 019, India 
e-mail: rpalcu@rediffmail.com 


Bir Bahadur et al. (eds.), Plant Biology and Biotechnology: Volume II; Plant Genomics 
and Biotechnology, DOI 10.1007/978-8 1-322-2283-5_2, © Springer India 2015 


27 


28 


2.1 Introduction 

Algae including cyanobacteria are the most 
primitive plants that appeared on the earth’s 
surface at least 3500 Ma ago. They are 
chlorophyll-bearing photosynthetic organisms 
which built up the oxygen in the atmosphere. 
The algae comprise one of the most diverse plant 
groups and constitute a species range of 40,000- 
10,000,000, with majority being the microalgae 
(Hawksworth and Mound 1991; Metting 1996). 
Traditionally, “algae” include the prokaryotic 
cyanobacteria and other eukaryotic members of 
Chlorophyta, Rhodophyta, Heterokontophyta, etc. 
Therefore, in general, algae include photosyn- 
thetic prokaryotic and eukaryotic organisms as 
well as the heterotrophic members. On the basis 
of molecular phylogeny, many of these groups 
are widely scattered on the “tree of life.” As a 
result, the term “algae” refers to a polyphyletic, 
artificial assemblage of organisms. But in recent 
terminology “algae” excludes cyanobacteria. 
They are described as “lower” plants that never 
have true stems, roots, and leaves, and they are 
normally capable of photosynthesis. Plant 
bodies are thalloid, and there are members with 
coccoid, capsoid, amoeboid, palmelloid, colonial, 
plasmodial, filamentous, and parenchymatous 
(tissue like) organizational levels. The size of 
algae ranges from tiny single-celled species to 
gigantic multicellular organisms (few micron to 
few meters). Algae of different sizes and shapes 
not only occupy all aquatic ecosystems but also 
occur in almost all other habitats. Algae are one 
of the earth’s most important natural resources 
that contribute to approximately 50 % of global 
photosynthetic activity. The first use of microalgae 
by humans dates back 2,000 years to the Chinese, 
who used Nostoc to survive during famine. 
However, microalgal biotechnology only really 
began to develop in the middle of the last century, 
and establishment of Spirulina production plant 
in Mexico can be considered as benchmark of 
microalgal biotechnology (Borowitzka 1999), 
Particularly during the past two or three decades, 
algal biotechnology grew steadily into an impor- 
tant global industry with a diversified field of 
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applications, and more and more new entrepreneurs 
began to realize the potentiality of microalgae in 
biotechnological exploitation. Nowadays, about 
10’ t of microalgae are harvested each year by 
different industries for various purposes. Today’s 
commercial algal biotechnology is still a non- 
transgenic industry that basically produces 
food, feed, and feed additives, cosmetics, and 
pigments. Presently active researches are going 
on for the development of genetically modified 
algae and for biotechnological exploitation. 
Due to the marine and aquatic applications, 
algal biotechnology is sometimes also called 
blue biotechnology. 

Commercial large-scale cultivation of microal- 
gae started in the early 1960s in Japan with the 
culture of Chlorella by the company Nihon 
Chlorella (Taipei, Taiwan) (Borowitzka 1999; 
Iwamoto 2004). It was followed in the early 1970s 
by the establishment of an Arthrospira culturing 
and harvesting facility in Lake Texcoco by Sosa 
Texcoco S.A. (Mexico City, Mexico) (Borowitzka 
1999). The first use of algae in aquaculture 
appeared in the 1970s (Pulz and Scheibenbogen 
1998). By 1980, there were 46 large-scale facto- 
ries in Asia producing more than 1,000 kg of 
microalgae (mainly Chlorella) per month. The 
commercial production of Dunaliella salina, as a 
source of f-carotene, became the third major 
microalgal industry when production facilities 
were established by Western Biotechnology (Hutt 
Lagoon, Australia) and Betatene in 1986 
(Whyalla, Australia, now Cognis Nutrition and 
Health). These were soon followed by other 
commercial plants in Israel and the USA. Together 
with these, the large-scale production of cyano- 
bacteria (blue-green algae) began in India at about 
the same time. More recently, several plants pro- 
ducing Haematococcus pluvialis as a source of 
astaxanthin have been established in the USA and 
India (Biotechnological and Environmental 
Applications of Microalgae [BEAM]). Thus, in a 
short period of about 30 years, the algal biotech- 
nology industry has grown and diversified sig- 
nificantly. Nowadays, the microalgal biomass 
market produces about 5,000 t of dry matter/year 
and generates a turnover of approximately US$ 
1.25 x 10°/year. 
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The aim of this review is to focus on the latest 
status of multidisciplinary research in the area of 
algal biotechnology mainly related to industrial 
applications of algae. 


2.2  #Microalgae in Human 


and Animal Nutrition 


The microalgal market as food is dominated by 
Chlorella and Spirulina (Becker 2004; Pulz and 
Gross 2004), mainly because of their high protein 
content and nutritive value. The biomass of these 
algae is marketed as tablets, capsules, and liquids 
(Table 2.1, Fig. 2.1). Microalgae can act as a 
nutritional supplement or represent a source of 
natural food colorants (Apt et al. 1996; Borowitzka 
1999; Soletto et al. 2005). The commercial 


applications are dominated by four strains: 
Arthrospira, Chlorella, Dunaliella salina, and 
Aphanizomenon flos-aquae. Arthrospira 1s used 
in human nutrition because of its high protein 
content and its excellent nutritive value (Radmer 
1996; Rangel-Yagui et al. 2004; Soletto et al. 
2005; Desmorieux and Decaen 2005). In addi- 
tion, this cyanobacterium has various possible 
health-promoting effects, like alleviation of 
hyperlipidemia, suppression of hypertension, 
protection against renal failure, growth promo- 
tion of intestinal Lactobacillus, suppression of 
elevated serum glucose level, etc. (Vilchez et al. 
1997; Yamaguchi 1997; Liang et al. 2004). 

The world’s largest producer of Arthrospira — 
the Hainan Simai Enterprising — is located in the 
Hainan province of China. This company has an 
annual production of 200 t of algal powder, which 


Table 2.1 Major microalgae commercialized for human nutrition 


Microalga Major producers 
Spirulina (Arthrospira) Hainan Simai Pharmacy 
Co. (China) 
Earthrise Nutritionals 


(California, USA) 


Cyanotech Corp. (Hawaii, USA) 


Myanmar Spirulina factory 


(Myanmar) 
Chlorella 
Co. (Taiwan) 


Kl6tze (Germany) 
Dunaliella salina 
(Australia) 
Blue Green Foods (USA) 
Vision (USA) 


Aphanizomenon flos-aquae 


Taiwan Chlorella Manufacturing 


Cognis Nutrition and Health 


Products Production (t/year) 


Powders, extracts 3,000 
Tablets, powders, extracts 


Tablets, powders, beverages, 


extracts 

Tablets, chips, pasta, and 
liquid extract 

Tablets, powders, nectar, 2,000 
noodles 
Powders 
Powders 1,200 
B-carotene 
Capsules, crystals 500 


Powder, capsules, crystals 


Adapted from Pulz and Gross (2004), Viskari and Colyer (2003), Spolaore et al. (2006), Hallmann (2007) 





Fig.2.1 Algal food used in human and animal nutrition 
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accounts for 25 % of the total national output 
and almost 10 % of the world output. Their 
Arthrospira-based products (tablets and powder) 
are distributed in over 20 countries around the 
world. Many other companies sell a wide variety 
of nutraceuticals made from this microalga. 
For example, the Myanmar Spirulina Factory 
(Yangon, Myanmar) sells tablets, chips, pasta, 
and liquid extract, and Cyanotech (a plant in 
Kona, Hawaii, USA) produces products ranging 
from pure powder to packaged bottles under the 
name Spirulina Pacifica. 

For human consumption, Cognis Nutrition 
and Health, the world’s largest producer of this 
strain, offers Dunaliella powder as an ingredient 
of dietary supplements and functional foods. 
According to many research studies, used alone 
or in combination with other nutraceuticals and 
natural food products, A. flos-aquae promotes 
overall good health (Jensen et al. 2001; Pugh and 
Pasco 2001; Benedetti et al. 2004). 

Many evaluations have shown the suitability 
of algal biomass as a feed supplement (Becker 
2004). Mainly, the microalgae Spirulina and, to 
some extent, Chlorella are used in this domain 
for many types of animals: cats, dogs, aquarium 
fish, ornamental birds, horses, poultry, cows, and 
breeding bulls (Spolaore et al. 2006). All these 
members are able to enhance the nutritional 
content of conventional feed preparations, and 
hence, they positively affect the physiology of 
these animals. Even very small amount of micro- 
algal biomass can positively affect the physiol- 
ogy of animals by improving the immune 
response, resulting in growth promotion, disease 
resistance, antiviral and antibacterial action, 
improved gut function, probiotic colonization 





Fig.2.2 Algal feed for aquaculture 
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stimulation, as well as by improved feed conversion, 
reproductive performance, and weight control 
(Harel and Clayton 2004). 

The external appearance of the animals may 
also be improved, resulting in healthy skin and a 
lustrous coat, for both farming animals (poultry, 
cows, breeding bulls) and pets (cats, dogs, 
rabbits, ornamental fishes, and birds) (Certik and 
Shimizu 1999). Since feed corresponds to the 
most important exogenous factors influencing 
animal health and also the major expense in 
animal production, the use of alternative 
high-quality protein supplements replacing con- 
ventional protein sources is encouraged. In fact, 
30 % of the current world algal production is sold 
for animal feed applications (Becker 2004). 


2.3 Aquaculture Feed 

From an economic perspective, the importance of 
using algae in aquaculture has grown over the 
years as it can be utilized as an unconventional 
source of protein alternative to the much used fish 
meal. Since 1940s, microalgae became more and 
more important as live feed in aquaculture 
(shellfish or fish farming). Microalgal feeds 
are currently used mainly for the culture of larvae 
and juvenile shell and finfish as well as for raising 
the zooplankton required for feeding of juvenile 
animals (Fig. 2.2) (Benemann 1992; Chen 2003). 
They are required for larval nutrition during a 
brief period, either for direct consumption in the 
case of mollusks and penaeid shrimp or indirectly 
as food for the live preys, mainly rotifers, cope- 
pods, and Artemia nauplii, which in turn are used 
for crustaceans and fish larvae feeding (Brown 
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Fig.2.3 Improved growth and color after feeding on algal feed (a) rohu and goldfish and (b) prawn 


et al. 1997; Duerr et al. 1998; Muller-Feuga 
2000). In 1999, the production of microalgae for 
aquaculture reached up to 1,000 t (62 % for mol- 
lusks, 21 % for shrimps, and 16 % for fish) for a 
global world aquaculture production of 43 x 106 t 
of plants and animals (Muller-Feuga 2000). The 
most frequently used microalgae in aquaculture 
are Chlorella, Tetraselmis, Isochrysis, Pavlova, 
Phaeodactylum, Chaetoceros, Nanno-chloropsis, 
Skeletonema, and Thalassiosira (Yamaguchi 
1997; Borowitzka 1997; Apt and Behrens 1999; 
Muller-Feuga 2000). Microalgae contain essen- 
tial nutrients which determine the quality, sur- 
vival, growth, and resistance to disease of 
cultured species. To support a better balanced 
nutrition for animal growth, it is often advised to 
use mixed microalgae cultures, in order to have a 
good protein profile, adequate vitamin content, 
and high polyunsaturated fatty acids, mainly 
EPA, AA, and DHA, recognized as essential for 
survival and growth during the early stages of life 
of many marine animals (Volkman et al. 1989). 
One of the beneficial effects attributed to adding 
algae is an increase in ingestion rates of food by 
marine fish larvae which enhance growth and 
survival as well as the quality of the fry (Naas 
et al. 1992). Aquatic species, such as salmonids 
(salmon and trout), shrimp, lobster, sea bream, 
and koi carp, under intensive rearing conditions 
need a supplementation of carotenoids pigments 


in their diet to attain their characteristic muscle 
color. In addition to pigmenting effects, carot- 
enoids, namely, astaxanthin and canthaxanthin, 
exert benefits on animal health and welfare, pro- 
mote larval development, and provide growth 
and performance. A positive metabolic role of 
carotenoids in the nutrition of larval fish and 
survival of young fry was also discussed by 
Reitain et al. (1997), Planas and Cunha (1999), 
and Lazo et al. (2000). 

The present group reported the use of noncon- 
ventional algal feed with improved results in 
growth, carcass composition, and pigmentation 
of goldfish (Fig. 2.3) (Khatoon et al. 2010a), rohu 
(Khatoon et al. 2010b), and prawn (Khatoon 
et al. 2009). 


Chemicals 
and Pharmaceuticals 


2.4 


A large number of studies have demonstrated the 
benefits and potential of algae as a source for bio- 
logically active (Table 2.2; Fig. 2.4) compounds 
(Hirayasu et al. 2005; Bansemir et al. 2006) and 
their antioxidant activity (Yuan et al. 2005). 
There are at least 30,000 known species of 
microalgae among which only a handful is cur- 
rently of commercial significance (Table 2.3). 
Algae provide a largely untapped reservoir of 
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Table 2.2 Biomass composition of microalgae expressed on a dry matter basis (Um and Kim 2009; Sydney et al. 2010) 


Strain Protein 
Anabaena cylindrica 43-56 
Botryococcus braunii 40 
Chlamydomonas reinhardtii 48 
Chlorella pyrenoidosa 57 
Chlorella vulgaris 41-58 
Dunaliella bioculata 49 
Dunaliella salina 57 
Dunaliella tertiolecta 29 
Euglena gracilis 39-61 
Porphyridium cruentum 28-39 
Prymnesium parvum 28-45 
Scenedesmus dimorphus 8-18 
Scenedesmus obliquus 50-56 
Scenedesmus quadricauda 47 
Spirogyra sp. 6—20 
Spirulina maxima 60-71 
Spirulina Figurensis 42-63 
Synechococcus sp. 63 
Tetraselmis maculata 52 


Carbohydrates Lipid 
25-30 4-7 

2 33 

17 21 

26 2 
12-17 10-22 
4 8 

32 6 

14 11 
14-18 14-20 
40-57 9-14 
25-33 22-39 
21-52 16-40 
10-17 12-14 
— 1.9 
33-64 11-21 
13-16 6-7 
8—14 4-1] 
15 11 

15 3 





Fig.2.4 Algal compounds in the form of cosmetics and medicines 


novel and valuable compounds. Current exploita- 
tion mainly aims to utilize fatty acids, pigments, 
vitamins, and other bioactive compounds. In the 
1980s, the use of microalgae as a source of 
common and fine chemicals was the beginning of 
a new trend (de la Noue and de Pauw 1988). 
Higher plants and animals lack the requisite 
enzymes to synthesize polyunsaturated fatty 
acids (PUFAs) of more than 18 carbons (Lee 
2001; Sancho et al. 1999). Thus, they have to get 
them from their food. Currently, algal docosahexae- 
noic acid (22:6 w3, 6, 9, 12, 15, 18) produced 
from the dinoflagellate member, Cryptheco- 
dinium cohnii, is the only commercially available 
w3-polyunsaturated fatty acid producer (Walker 
et al. 2005), but others like y-linolenic acid from 


Spirulina, arachidonic acid from Porphyridium, 
and eicosapentaenoic acid from Nannochloropsis, 
Phaeodactylum, or Nitzschia have already 
demonstrated as potential for industrial production 
of their high-value compounds (Spolaore et al. 
2006). Other fatty acids or lipids are isolated 
from Phaeodactylum tricornutum as a food addi- 
tive; from Odontella aurita for pharmaceuticals, 
cosmetics, and baby food; and from [sochrysis 
galbana for animal nutrition (Pulz and Gross 
2004). 

Polysaccharides are isolated from Chlorella 
spp. for dietary supplements (Walker et al. 2005) 
and from Porphyridium cruentum for pharma- 
ceuticals, cosmetics, and nutrition (Pulz and 
Gross 2004). Extracts from the cyanobacterium 
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Table 2.3. Commercial companies selling algae or algal 
products (Hallmann 2007) 


Company Products 


Algatech, Israel Astaxanthin, microalgae- 
derived products 

Algisa, Compania 
Industrial de Alginatos 


Alginates 


S.A., Chile 

Bluebio Bio- Biomass (Chlorella, 
Pharmaceutical Co., Ltd., Spirulina) 

China 

Cognis Nutrition and B-carotene 

Health, Australia 

Dainippon Ink and Pigments 
Chemicals, Japan 

Exsymol S.A.M., Cosmetics 

Monaco 

Far East Bio-Tec Co., Biomass (Chlorella, 
Ltd., Taiwan Spirulina), microalgae 


extracts, health care products, 
cosmetics 
Martek Biosciences 
Corporation, USA 


Fatty acids 


Mera Pharmaceuticals, Astaxanthin 
USA 
Penglai Dengzhou Alginates, mannitol, iodine 


Seaweed Co., Ltd., China 
Phycotransgenics, USA ‘Transgenic microalgae 


(Chlamydomonas) 


Qingdao Richstar Food additives, 

Seaweed [Industrial Co., | pharmaceutical chemicals 
Ltd, China 

Spectra Stable Isotopes, Stable isotope biochemicals 
USA 

Subitec GmbH, Germany _ Fatty acids 

Nature Beta B-carotene 


Technologies, Israel 
Necton S.A., Portugal 


Codif Recherche et 
Nature, France 


Biomass (microalgae) 
Cosmetics 


Lyngbya mayjuscula are used as immune modulators 
in pharmaceuticals and nutrition management 
(Pulz and Gross 2004). Components of algae are 
frequently used in cosmetics as_ thickening 
agents, water-binding agents, and antioxidants. 
Cosmetics companies claim benefits on the 
skin or health in general from contents like 
carrageenan, other algal polysaccharides, algal 
proteins or lipids, vitamin A, vitamin B1, iron, 
phosphorus, sodium, copper, magnesium, calcium, 
or other elements; some companies promise that 


algal extracts inhibit oxidative degeneration of 
collagen and hyaluronic acid and that they have 
antiaging properties. From a scientific point of 
view, many of the promised effects have to be 
judged as not scientifically proven and unsub- 
stantiated. Typical species that are used for cos- 
metics are Spirulina platensis, Nannochloropsis 
oculata, Chlorella vulgaris, and Dunaliella 
salina. 

From a commercial point of view, the carot- 
enoids and the phycobiliproteins seem to be the 
most important. In addition to their role in 
coloration, carotenoids act as provitamin A and 
as biological antioxidants, protecting cells and 
tissues from the damaging effects of free radicals 
and singlet oxygen. Therefore, carotenoids are 
utilized in pharmaceuticals, health food, and 
dietary supplements, cosmetics, and as a feed 
additive. The carotenoids including B-carotene 
are produced in large amounts from the halophilic 
green microalga Dunaliella salina. To a lesser 
extent, carotenoids are isolated from Dunaliella 
bardawil (Walker et al. 2005). The most promi- 
nent xanthophyll is astaxanthin, which is 
extracted in large scale from the green microalga 
Haematococcus pluvialis (Pulz and Gross 2004). 
Further utilized xanthophylls are lutein, canthax- 
anthin, and zeaxanthin. Phycobiliproteins consist 
of proteins with covalently bound phycobilins, 
which are tetrapyrrole structures with pyrrole 
rings that are laid out linearly. For efficient 
photosynthesis, phycobiliproteins capture light 
energy at certain wavelengths and pass it on to 
the chlorophylls. The phycobiliproteins-phycoer- 
ythrin and phycocyanin are isolated from 
Spirulina and Porphyridium and are utilized for 
health food, pharmaceuticals, and cosmetics 
(Becker 2004; Pulz and Gross 2004; Spolaore 
et al. 2006). Phycobiliproteins are not only used 
as pigments but have also been shown to have 
health-promoting properties. They are also used 
in research laboratories as labels for biomole- 
cules (Spolaore et al. 2006). The first and most 
important application of phycocyanin is as food 
pigment, replacing current synthetic pigments 
(Becker 2004). Dainippon Ink & Chemicals 
(Sakura) has developed a product called “Lina 
blue” which is used in chewing gum, ice sherberts, 
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popsicles, candies, soft drinks, dairy products, 
and wasabi. They also sell another form of this 
pigment for natural cosmetics like lipstick and 
eyeliners (Yamaguchi 1997; Viskari and Colyer 
2003). Their global market was estimated at more 
than US$50 million in 1997. 


2.5  Biofertilizers 

Biofertilizer contains microorganisms which pro- 
mote the adequate supply of nutrients to the host 
plants, ensure their proper development of growth 
and regulation in their physiology, and protect 
plants from soil-borne diseases to a certain 
degree. The need for the use of biofertilizer has 
arisen, primarily for two reasons: first, because 
increase in the use of fertilizers leads to increased 
crop productivity, and second, because increased 
usage of chemical fertilizer leads to damage in 
soil texture and raises other environmental prob- 
lems. Therefore, the use of biofertilizer is both 
economical and environment friendly. Among 
the microalgae, cyanobacteria have drawn much 
attention as prospective and rich sources of 
biologically active constituents that can be used 
as biofertilizers (Fish and Codd 1994; Schlegel 
et al. 1999). In early times, several authors like 
De (1939) and Gupta and Shukla (1967) studied 
the algal influence on growth, yield, and protein 
content of rice plants and showed that presoaking 
rice seeds with BGA cultures or extracts enhances 
germination, promotes the growth of roots and 
shoots, and increases the weight and protein 
content of the grain. Beneficial effects of cyano- 
bacterial inoculation were reported, not only 
for rice but for other crops as well such as 
wheat, soybean, oat, tomato, radish, cotton, 
Sugarcane, maize, chili, bean, muskmelon, and 
lettuce (Venkataraman 1972; Rodgers et al. 1979; 
Arif et al. 1995; Thajuddin and Subramanian 
2005; Saadatnia and Riahi 2009; Maqubela et al. 
2008; Karthikeyan et al. 2007). Efficient nitrogen- 
fixing blue green algal strains are utilized for 
rice production. Several reasons have been pro- 
posed for beneficial effects of cyanobacteria on 


N. Khatoon and R. Pal 


the growth of different plants. The capacity for 
enhanced biosynthesis of growth-promoting 
substances such as auxins, amino acids, sugars, 
and vitamins (vitamin B12, folic acid, nicotinic 
acid, and pantothenic acid) by algalization was 
reported by Misra and Kaushik (1989b) that can 
enhance the growth of plant. Cyanobacteria play 
an important role in fixing atmospheric nitrogen 
due to the presence of heterocysts and buildup of 
soil fertility, consequently increasing rice growth 
and yield as a natural biofertilizer (Song et al. 
2005). Additionally, cyanobacteria excrete com- 
plex organic carbon compounds that bind to the 
soil particles and improve soil aggregation, hence 
improve soil structure, soil permeability, and 
water-holding capacity of soil (Kaushik 2007). 
Cyanobacteria are capable of abating various 
kinds of pollutants and have advantages as poten- 
tial biodegrading organisms (Subramanian and 
Uma 1996). Blue-green algal inoculation with 
composite cultures was found to be more effective 
than single culture inoculation. Both free-living 
andas well as symbiotic cyanobacteria (blue-green 
algae) have been harnessed in rice cultivation in 
India. A composite culture of BGA having 
heterocystous Nostoc, Anabaena, Aulosira, etc. 
is given as primary inoculum in trays, polythene- 
lined pots, and later mass multiplied in the field 
for application as soil-based flakes to the rice- 
growing field at the rate of 10 kg/ha. At many 
sites where algal inoculation was used for three 
to four consecutive cropping seasons, the inocu- 
lated algae establish well and the effect persisted 
over subsequent rice crop. The blue-green algal 
inoculum may be produced by several methods, 
viz., in tubs, galvanized trays, and small pits and 
also in field conditions. 

However, the large-scale production 1s advisable 
under field condition which is easily adopted by 
farmers. Nowadays, liquid fertilizers are in use. 
There are many advantages of liquid biofertilizer 
over the conventional carrier-based biofertilizers. 
As they have greater potentials to fight with 
native population; cost saving on carrier mate- 
rial, pulverization, neutralization, sterilization, 
packing, and transport; easy and quick quality 
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Fig.2.5 Algal-based biofertilizers 


control protocols; better survival on seeds and 
soil; and very much easy to use by the farmer. 
The increasing demand for the biofertilizers and 
the awareness among farmers and planters in the 
use of biofertilizers have paved way for the fertil- 
izer manufactures and new entrepreneurs to get 
into biofertilizer production. A number of biofer- 
tilizer production units have been started recently 
particularly in the southern states of our country 
(Fig. 2.5). 

The application of BGA biofertilizer in rice 
popularly known as “algalization” helps in creat- 
ing an environment-friendly agroecosystem that 
ensures economic viability in paddy cultivation 
while saving energy-intensive inputs. The tech- 
nology can be easily adopted by farmers for 
multiplication at their own level. 


2.6 Biofuel 

Algae, particularly green unicellular microalgae, 
have been proposed for a long time as a potential 
renewable fuel source (Benemann and Oswald 
1996; Oswald and Golueke 1960; Brennan and 
Owende 2010). Microalgae have long been rec- 
ognized as potentially good sources for bio- 
fuel production because of their high oil content 
and rapid biomass production (Table 2.4). In 
recent years, the use of microalgae as an alterna- 
tive biodiesel feedstock has gained renewed 
interest from researchers, entrepreneurs, and the 
general public. As with plant-derived feedstocks 
(Table 2.5), algal feedstocks can be utilized 
directly or processed into liquid fuels and gas by 


Table 2.4 Oil content of microalgae (Chisti 2007) 
Oil content (% dry 


Microalga weight) 
Botryococcus braunii 25-75 
Chlorella sp. 28-32 
Crypthecodinium cohnii 20 
Cylindrotheca sp. 16-37 
Nitzschia sp. 45-47 
Phaeodactylum tricornutum 20-30 
Schizochytrium sp. 50-77 
Tetraselmis suecica 15-23 


a variety of biochemical conversion or thermo- 
chemical conversion processes (Amin 2009; 
Brennan et al. 2010). Dried algal biomass may be 
used to generate energy by direct combustion 
(Kadam 2002), but this is probably the least 
attractive use for algal biomass. In addition, 
hydrogen can be produced from algae by bio- 
photolysis (Melis 2002). One of the attractions of 
microalgae as a biofuel feedstock is that they can 
be effectively grown in conditions which require 
minimal freshwater input unlike many plant-based 
biofuel crops and utilize land which is otherwise 
nonproductive to plant crops, thus making the 
process potentially sustainable with regard to 
preserving freshwater resources. There has there- 
fore been significant interest in the growth of 
microalgae for biofuels under saline conditions 
(Rodolfi et al. 2009; de la Noue et al. 1992). 

The rise in world oil prices led to a sharp 
increase in biofuel production around the world. 
High oil species of microalgae cultured in 
growth-optimized conditions of photobioreactors 
have the potential to yield 5,000—15,000 gal of 
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Table 2.5 Comparison of microalgae with other biodiesel feedstocks (Teresa et al. 2010) 


Seed oil content 


(% oil by wt in Oil yield (l oil/hna Land use (m* year/ Biodiesel productivity 
Plant source biomass) year) kg biodiesel) (kg biodiesel/ha year) 
Corn/maize (Zea mays L.) 44 172 66 152 
Hemp (Cannabis sativa L.) 33 363 31 321 
Soybean (Glycine max L.) 18 636 18 562 
Jatropha (Jatropha curcas L.) 28 741 15 656 
Camelina (Camelina sativa L.) 42 915 12 809 
Canola/rapeseed (Brassica 41 974 12 862 
napus L.) 
Sunflower (Helianthus annuus 40 1,070 1] 946 
L.) 
Castor (Ricinus communis) 48 1,307 9 1,156 
Palm oil (Elaeis guineensis) 36 5,366 2 4,747 
Microalgae (low oil content) 30 58,700 0.2 51,927 
Microalgae (medium oil 50 97,800 0.1 86,515 
content) 
Microalgae (high oil content) 70 136,900 0.1 121,104 





Fig.2.6 Algal biofuel 


microalgal oil per acre per year. The most signifi- 
cant distinguishing characteristic of algal oil is its 
yield and hence its biodiesel yield. According to 
some estimates, the yield (per acre) of oil from 
algae is over 200 times the yield from the best- 
performing plant/vegetable oils (Becker 1994). 
They can complete an entire growing cycle every 
few days. Microalgae are very efficient solar 
energy converters, and they can produce a great 
variety of metabolites (Chaumont 2005). A major 
current problem for the commercial viability of 
biodiesel production from microalgae is the low 
selling price of biodiesel. Biodiesel from micro- 
algal oil is similar in properties to the standard 
biodiesel (Huang et al. 2010). The algae that are 
used in biodiesel production are usually aquatic 
unicellular green algae. These types of algae are 
a photosynthetic eukaryote characterized by high 


growth rates and high population densities. Under 
good conditions, green algae can double its 
biomass in less than 24 h. Additionally, green 
algae can have huge lipid contents, frequently 
over 50 %. This high yield, high density biomass 
is ideal for intensive agriculture and may be 
an excellent source for biodiesel production 
(Fig. 2.6). 

Algal oil contains saturated and monounsatu- 
rated fatty acids. The fatty acids were determined 
in the algal oil in the following proportions: 36 % 
oleic (18:1), 15 % palmitic (16:0), 11 % stearic 
(18:0), 8.4 % iso-linoleic (17:0), and 7.4 % 
linoleic (18:2). The high proportion of saturated 
and monounsaturated fatty acids in this alga is 
considered optimal from a fuel quality standpoint, 
in that fuel polymerization during combustion 
would be substantially less than what would 
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occur with polyunsaturated fatty acid-derived 
fuel (Chaumont 2005; Gao et al. 2010). After oil 
extraction from algae, the remaining biomass 
fraction can be used as a high-protein feed for 
livestock (Schneider 2006). This gives further 
value to the process and reduces waste. Algae 
biomass can play an important role in solving the 
problem between the production of food and that 
of biofuels in the near future (Chisti 2007). 


2.7. Bioremediation 

Organic pollutants and heavy metals are consid- 
ered to be a serious environmental problem for 
human health. Lead, cadmium, and mercury are 
the most frequent water- and soil-polluting heavy 
metals. Industrial processes, including plastic 
manufacturing, electroplating, Ni-Cd battery 
production, mining, and smelting industries, con- 
tinuously release substantial amounts of heavy 
metals into the environment. The contamination 
of soils and aquatic systems by toxic metals and 
organic pollutants has recently increased due to 
anthropogenic activity. Phycoremediation is the 
use of microalgae for the removal or biotransfor- 
mation of pollutants, including nutrients and 
xenobiotics from wastewater and CO2 from 
waste air with concomitant biomass propagation 
(Olguin et al. 2003, 2004; Mulbry et al. 2008; 
Moreno-Garrido 2008). Phytoremediation 
technologies are becoming recognized as cost- 
effective methods for remediating sites contami- 
nated with toxic metals at a fraction of the cost of 
conventional technologies, such as soil replace- 
ment, solidification, and washing strategies 
(Chaney et al. 1997; Flathman and Lanza 1998; 
Munoz and Guieysse 2008). Recent studies have 
shown that microalgae can indeed support the 
aerobic degradation of various hazardous con- 
taminants (Munoz et al. 2008; Safonova et al. 
2004). Since few decades the potential of micro- 
algae in metal biosorption process has been 
studied extensively due to their ubiquitous occur- 
rence in nature (Gekeler et al. 1998). Many algal 
genera are found to have capabilities to accumu- 
late heavy metals, thereby reducing their toxic 
effects (Huang et al. 1990; Kuyucak and Volesky 


1990; Schiewer and Volesky 2000; Satiroglu 
et al. 2002; Arica et al. 2004; Selatnia et al. 2004). 
There has been a growing interest in using algae 
for biomonitoring eutrophication, organic pollut- 
ants, and inorganic pollutants. There are numer- 
ous processes of treating water, industrial 
effluents, and solid wastes using microalgae aer- 
obically as well as anaerobically. Many algae 
synthesize phytochelatins and metallothioneins 
that can form complexes with heavy metals and 
translocate them into vacuoles (Oswald and 
Golueke 1960). The growth of microalgae is 
indicative of water pollution since they respond 
typically to many ions and toxins. Blue-green 
algae are ideally suited to play a dual role of 
treating wastewater in the process of effective 
utilization of different constituents essential for 
growth leading to enhanced biomass production. 
Secondly, the role of microalgae is the accumula- 
tion and conversion of wastewater nutrients to 
biomass and lipids. Microalgae are efficient in 
the removal of nutrients from wastewater. 
Unicellular algae have shown great efficiency in 
the uptake of nutrients and have been found to 
show dominance in oxidation ponds (Pittman 
et al. 2010). Chlorella, Ankistrodesmus, and 
Scenedesmus species have been already success- 
fully used for the treatment of olive oil, mill 
wastewaters, and paper industry wastewaters 
(Abeliovich 1986; Narro 1987; Tilzer 1983). The 
algae have many features that make them ideal 
candidates for the selective removal and concen- 
tration of heavy metals, which include high 
tolerance to heavy metals, ability to grow both 
autotrophically and heterotrophically, large sur- 
face area/volume ratios, phototaxy, phytochelatin 
expression, and potential for genetic manipula- 
tion (Cai et al. 1995). Mei et al. (2006) suggested 
that Platymonas subcordiformis, a marine green 
microalgae, had a very strontium uptake capacity, 
although high concentrations of strontium cause 
oxidative damage, as evidenced by the increase 
in lipid peroxidation in the algal cell samples and 
the decrease in growth rate and chlorophyll 
contents. Caulerpa racemosa vat. cylindracea as a 
low-cost biomaterial could be used for the 
removal of boron species from aqueous solution 
(Bursali et al. 2009). Dunaliella salina, a green 


38 


microalgae, have high tendency for zinc accumu- 
lation followed by copper and cobalt, the lowest 
tendency was for cadmium, and this may be due 
to the importance of zinc as hydrogen transfer- 
ring in photosynthesis (Liu et al. 2002). The pres- 
ent group also reported several algae like 
Oscillatoria, Anabaena, Rhizoclonium, and 
Chara that accumulated lead, cadmium, and 
chromium from wastewater (Chakraborty et al. 
2011). Arsenic-resistant genera recorded from 
the contaminated area were Oscillatoria prin- 
ceps, Oscillatoria limosa, Anabaena sp., and 
Phormidium laminosum (Bhattacharya 2011). 

The biomass resulting from the treatment of 
wastewaters can be applied for different aims, 
including the use as additives for animal feed, 
the extraction of added value products like carot- 
enoids or other biomolecules, or the production 
of biofuel. The accumulation of heavy metals by 
algae provides an advantage for phytoremedia- 
tion over other methods which are more costly 
and not environmental friendly. Therefore, there 
is a need to improve the possibilities of accumu- 
lation of heavy metals in algae. 


2.8 Phyconanotechnology 
Nanotechnology and biotechnology are the two 
most fascinating technologies of the twenty-first 
century, and unification of these two results in the 
development of nanobiotechnology which deals 
with the synthesis of nanostructures using 
biological organisms. Reliable and eco-friendly 
synthesis of metallic nanoparticles is an impor- 
tant goal of nanotechnology. Nanomaterials are 
gaining so much of interest due to their unique 
optical, chemical, photoelectrochemical, and 
electronic properties which are absent in the bulk 
material because of the quantum-size confine- 
ment imposed by nano-size (Kumar et al. 2003). 
Nanoparticles are synthesized by a number of 
chemical and physical procedures. Several manu- 
facturing techniques are in use that employs 
atomistic, molecular, chemical, and particulate 
processing in a vacuum or a liquid medium 
(Daniel and Astruc 2004). But most of these 
techniques are costly, as well as inefficient in 
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materials and energy use. Therefore, second- 
generation nanotechnology is focused toward 
clean technologies that minimize possible 
environmental and human health risks associated 
with manufacture and fabrication; there is an 
ever-growing demand for the development of 
clean, nontoxic, and environmentally benign 
synthesis procedures. Green synthesis of nanopar- 
ticles is a significant process for the production of 
precious metals such as gold, silver, platinum, 
and palladium due to their various applications in 
material sciences. A number of microorganisms 
and higher plants have already been found to be 
competent to serve as eco-friendly nanofactories 
for the synthesis of gold nanoparticles (GNP) and 
silver nanoparticles (SNP). Cyanobacteria and 
microalgae having the reducing power are already 
designated as potential bioreagents for nontoxic 
nanoparticle synthesis. They are effective in 
nanoparticle synthesis, as they grow rapidly in 
eco-friendly atmosphere producing large amount 
of biomass in a very short time and can accumulate 
high quantity of metals followed by subsequent 
reduction. Their production costs are also 
negligible (Nair and Pradeep 2002; Lin et al. 
2005; Lengke et al. 2006a, b). The properties of 
nanogold change with varying sizes and shapes 
of the particles and are suitable for applying in 
diverse fields, starting from decorative coating 
paints to different fields of science like electronics, 
catalysis, pollution control, cancer therapy, drug 
delivery, biomedical assay, biosensor, bioimaging, 
etc. (Xu et al. 2004; Huang et al. 2006; Hauck 
et al. 2008). The “green” route for nanoparticle 
(NP) synthesis is therefore of great interest for 
economic prospects and applications. The green 
chemistry synthetic route can be employed for 
both GNP and SNP syntheses. Moreover, nano- 
products, synthesized from algae, are expected to 
be biocompatible, therefore minimize environ- 
ment and human health risks. The use of SNPs 
and GNPs in drug delivery systems might be the 
future thrust in the field of medicine. Further 
research is needed to screen the most suitable 
algal taxa for large-scale controlled synthesis of 
gold and silver nanoparticles. 

The synthesis of nanogold was first reported in 
1995 in Chlorella vulgaris (Ting et al. 1995) and 
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Fig.2.7 Algae showing (a) control filament, (b) gold nanoparticle synthesis, and (c) extracellular cold nanoparticles in 
the media. SEM of (d) control, (e) gold-treated algae, and (f) X-ray spectroscopy showing nanoparticles synthesis 


afterwards, Lengke et al. (2006a, b) reported it in 
Plectonema boryanum. In the same year, the pres- 
ent group reported cyanobacteria- and algae-based 
systemic process of gold and silver nanoparticle 
synthesis and published a series of papers on bio- 
synthesis of gold nanoparticle by cyanobacteria 
(such as Lyngbya majuscula and Spirulina sub- 
salsa), green algae (Rhizoclonium hieroglyphicum 
and R. riparium), Spirogyra and diatoms (Nitzschia 
obtusa and Navicula minima) (Chakraborty et al. 
2006, 2009; Nayak et al. 2006a; Roychoudhury 
and Pal 2014). Screening for different algae as 
bioreagent of GNP production and biosynthesis 
of gold nanorods by Nostoc ellipsosporum was 
done by Parial et al. (2012); Parial and Pal (2014) 
and of silver nanoparticles by Roychoudhury et al. 
(2014) (Fig. 2.7). 


2.9 Genetic Engineering 


and GM Algae 


Algal genome research is needed to get a new 
level of efficiency in biotechnological applica- 
tions. Algal species are targets of genetic engi- 
neering to improve productivity, broaden 


environmental tolerance limits, or increase pest 
or pathogen resistance. DNA sequences are 
introduced into the algal cells with the goal of 
modifying biochemical pathways. Among the 
biotechnologically important biomolecules, 
some are with high value but required in small 
quantity like medicinal compounds, but some are 
low-value product and are required in large 
amount like algal biodiesel. Therefore, the aim of 
the algal biotechnologists is always to reduce the 
cost of the required products. Sometimes the cost 
of microalgal biomass production becomes so 
high that ultimately the production cost of the 
required compound becomes exorbitant therefore 
unusable. For this reason, biotechnologists tried 
to manipulate genetically many commercially 
viable microalgae to get suitable combinations 
of genes in the required strain for successful 
low-cost production or to get the required com- 
pounds — called as molecular farming. But till 
date, most of the results related to GM algae pro- 
duction are laboratory confined. The propagation 
of GM algae is in question as algae are the 
lower group of plants which frequently propa- 
gate through vegetative propagation like binary 
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fission resulting in the uncertain distribution of 
the altered genetic combination. 

There are several microalgal genome projects 
(Table 2.6), of which the most advanced projects 
are those for the red alga Cyanidioschyzon 
merolae, the diatom Thalassiosira pseudonana, 
and the three green algae Chlamydomonas rein- 
hardtii, Volvox carteri, and Ostreococcus tauri. 
Cyanobacteria are preferable source materials in 
plant genetic engineering studies due to genetic 
homology of chloroplasts in eukaryotic plants, 
prokaryotic genome organization, short genera- 
tion time, and fast growth rate. Most studies have 
therefore been made on commonly called as 
‘green yeast’. Chlamydomonas reinhardtii. Apart 
from Chlamydomonas, cyanobacterial strains of 
Synechocystis sp., Synechococcus sp., and 
Anabaena _ variabilis. Synechocystis sp. and 
Synechococcus sp. are naturally transformable 
with exogenous DNA. The green alga 
Haematococcus pluvialis naturally accumulates 
large amounts of astaxanthin when exposed to 
unfavorable growth conditions. The gene coding 
for the enzyme that converts -carotene into 
astaxanthin has been identified from H. pluvialis 
and cloned into Synechococcus. After the trans- 
formation, both zeaxanthin and the attractive 
compound astaxanthin were produced (Sahu 
et al. 2012). Tremendous success has been 
achieved in establishing functional association 
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between plant seedling/calli/protoplast cultures 
and N,-fixing cyanobacterial filaments through 
the application of basic tissue culture technique. 
Nostoc, Anabaena, and Plectonema are among 
the few N.-fixing cyanobacterial species success- 
fully practiced in these studies. The use of trans- 
genic cyanobacterium to control insect is a new 
technology. Various combinations of genes were 
introduced in Anabaena which showed toxicity 
against mosquito larvae. Further studies of 
genetic engineered algae and bioremediation 
have been performed on the green algae on 
Chlamydomonas reinhardtii. In the last few 
years, successful genetic transformations of ~25 
algal species have been demonstrated; most of 
these were achieved by nuclear transformation. 
Ten species of green algae have been trans- 
formed; stable transformation have been shown 
for seven of them, one of which was the unicellular 
model organism Chlamydomonas reinhardtii 
(Debuchy et al. 1989; Kindle et al. 1989); and 
transient transformation was demonstrated in 
the other three. All of these green algae are 
unicellular species except for Volvox carteri, for 
which stable transformation has been shown 
(Schiedlmeier et al. 1994). Many species of 
cyanobacteria, e.g., Spirulina, Anabaena, or 
Synechocystis, can be transformed by electropor- 
ation or conjugation (Koksharova and Wolk 
2002). Algae have also demonstrated suitability 


Table 2.6 Biotechnologically utilized transgenic algal species 


Algal genera Utilization 


Chlamydomonas reinhardtii 


Biomass from transgenics for animal health and feed, 


Reference 
Walker et al. (2005) 


bioremediation, environmental monitoring, production of 


recombinant proteins 


Dunaliella salina 


B-carotene and other carotenoids for health food, dietary 


Pulz and Gross (2004) 


supplements, cosmetics, and feed 


Haematococcus pluvialis 
additives 


Chlorella vulgaris 


Astaxanthin for health food, pharmaceuticals, and feed 


Biomass for health food, dietary supplements, and feed 


Pulz and Gross (2004) 


Pulz and Gross (2004) 


surrogates; extracts for cosmetics 


Phaeodactylum tricornutum 
Odontella aurita 
Nannochloropsis oculata Extracts for cosmetics 


Spirulina Figurensis 


Lyngbya majuscula 


Lipids and fatty acids for nutrition 
Fatty acids for pharmaceuticals, cosmetics, and baby food 


Phycocyanin, phycoerythrin, and biomass for health food, 
pharmaceuticals, feed, and cosmetics 


Immune modulators for pharmaceuticals and nutrition 


Pulz and Gross (2004) 
Pulz and Gross (2004) 
Spolaore et al. (2006) 


Becker (2004), Pulz 
and Gross (2004) 


Pulz and Gross (2004) 
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for synthesizing vaccines. In this regard, stable 
expression of the hepatitis B surface antigen gene 
has been shown in Dunaliella salina (Sayre et al. 
2001; Geng et al. 2003; Sun et al. 2003). A fur- 
ther project aims at the application of antigen 
producing algae in the fish industry. It is intended 
to use an alga-produced antigen to vaccinate fish 
against the infectious hematopoietic necrosis 
virus (IHNV) which causes an infectious disease 
that kills 30 % of the US trout population each 
year; vaccination is realized simply by feeding 
the fish with the algae (Banicki 2004). 

For further biotechnological exploitation of 
algae, several researchers are screening extracts 
from a multiplicity of algal species in order to 
find effective organic components like secondary 
metabolites (Kopecky et al. 2000; Lubian et al. 
2000), antifungal or antibacterial biomolecules 
(Piccardi et al. 2000), algal toxins (Piccardi et al. 
2000), or active pharmaceutical ingredients as 
drug candidates (Skulberg 2000). 


2.10 Cultivation 


Algae have an important role to play in the current 
world economy with an approximate turnover of 
US$5 billion per year. The biotechnological basis 
for the most efficient production of microalgal 
biomass is a key issue for the future impact of 
these organisms. The large-scale cultivation of 
microalgae and the practical use of its biomass as 
a source of certain constituents were probably 
first considered seriously in Germany during 
World War II. This initial research was taken up 
by a group of scientists at the Carnegie Institution 
of Washington, who summarized their experiences 
in the classic report Algal culture: from labora- 
tory to pilot plant (Burlew 1953). In the course of 
time, the continuous cultivation of algae under 
partially or fully controlled conditions has become 
an important development, with various economic 
possibilities. Techniques for the cultivation of 
algae on a large scale and processes for their uti- 
lization have been successfully developed in sev- 
eral countries (Table 2.7), and an attempt in this 
direction is worthwhile in some of the developing 
countries (Benemann and Oswald _ 1996; 
Venkataraman and Becker 1985). 


Table 2.7 Present state of microalgal production (Pulz 
and Gross 2004; Ratledge 2004; Hejazi 2004) 


Producer 
Alga Annual production country 
Arthrospira 3,000 t dry China, India, 
weight USA, Myanmar, 
Japan 
Chlorella 2,000 t dry weight Taiwan, 


Germany, Japan 


Dunaliella salina 1,200t dry weight Australia, Israel, 


USA, China 
Aphanizomenon 500tdry weight USA 
flos-aquae 
Haematococcus 300tdry weight USA, India, 
pluvialis Israel 
Crypthecodinium 240 t DHA oil USA 


cohnit 


Most algal species are obligate phototrophs 
and thus require light for their growth. The 
requirement for light, coupled with the high 
extinction coefficient of chlorophyll in these 
organisms, has necessitated the design and devel- 
opment of novel systems for large-scale growth. 
A few algal species are capable of heterotrophic 
growth, and for these organisms, conventional 
fermentation technology can be used for large- 
scale cultivation (Fig. 2.8). 


2.11 Phototrophic Systems 

Commercial growth of photosynthetic algae has 
been achieved in different ways: (1) open cultiva- 
tion using natural sunlight (Oswald and Golueke 
1960), (2) closed cultivation using natural sun- 
light, and (3) closed cultivation using artificial 
illumination. Each system has advantages and 
disadvantages, and the choice of system depends 
on the degree of parameter control needed to pro- 
duce the desired product and on the value of the 
product. A common limitation to all these sys- 
tems is the need to supply light to the culture, 
making it advantageous to maximize the sur- 
face-to-volume ratio of the culture. Many con- 
figurations of open cultures using natural 
sunlight have been proposed and constructed 
(Oswald 1988; Chaumont 1993; Pushparaj et al. 
1997). These systems are generally large, open 
ponds or raceways, and the principle advantage 
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Fig.2.8 Algal cultivation in (a) photobioreactor, (b) open tank, and (c) open raceway pond 


of these configurations is that the light energy is 
free (Pulz and Scheibenbogen 1998). However, 
this advantage is more than offset by several sig- 
nificant disadvantages. In open systems, it is 
very difficult to prevent contamination of the 
algal culture by other organism (i1.e., algae and 
other microorganisms). This problem has been 
addressed by culturing algae that require or toler- 
ate unique growth conditions that would exclude 
contaminating organisms. Open cultures attain 
cell densities leading to the need to process large 
quantities of water to harvest the algae. Outdoor 
phototrophic growth systems are also subject to 
daily and seasonal variations in light intensity 
and temperature, making it difficult to control or 
reproduce specific culture conditions. 
Nevertheless, for specific algal products, this 
technology has proven very successful, producing 
many thousands of tons of dried biomass per year 
(Lee 1997). This is especially true for Spirulina, 
which is extensively cultured in the USA, Mexico, 
Thailand, and China (Metting 1996; Li and Qi 
1997; Vonshak 1997). Several different closed 
systems using natural sunlight have been described 
(Richmond et al. 1993; Qiang and Richmond 
1994; Molina Grima et al. 1995; Spektorova et al. 
1997). In these systems, the algae are enclosed in 
a transparent material (either glass or plastic) and 
the vessels placed outdoors for illumination. The 
closure of the vessels minimizes contamination 
by other algal species. Closed, outdoor systems 
are still subject to variations in light intensity and 
temperature that make cultivation reproducibility 
problematic. In addition, a major problem with 
closed systems is the removal of oxygen from 
the culture and the provision of adequate tem- 
perature control. Although both of these issues 


can be resolved, the cost of doing so can more 
than offset the cost advantage of using natural 
sunlight. As with the outdoor systems, numerous 
designs have been constructed for the indoor, 
closed culture of algae using electric lights for 
illumination (Ratchford and Fallowfield 1992; 
Wohlgeschaffen et al. 1992; Iqbal et al. 1993; 
Lee and Palsson 1994). These vessels are often 
referred to as photobioreactors, and in principle, 
they are similar to conventional fermentor, the 
major difference being that they are driven by 
light rather than by an organic carbon source. 
These vessels provide the ability to control and 
optimize culture parameters, and, coupled with 
the closure that they provide, photobioreactors 
are suitable for culturing many different types of 
algae (Ratchford and Fallowfield 1992). 


2.12 Heterotrophic Systems 


The most significant advance in closed culture 
systems is the adaptation of fermentation tech- 
nology that allows for the heterotrophic growth 
of microalgae and eliminates the problem of light 
limitation (Barclay et al. 1994; Kyle 1996; Chen 
1997). A significant number of microalgae are 
capable of heterotrophic growth and potentially 
suitable for growth in fermentors (Droop 1974; 
Gladue and Maxey 1994). The basic principle of 
fermentor growth is to provide highly controlled 
optimal growth conditions to maximize pro- 
ductivity. In these, the biomass levels are at least 
tenfold higher than those achieved by photosyn- 
thesis-based culture systems (Radmer and Parker 
1994). The high biomass levels also greatly 
decrease the volume of water that must be pro- 
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cessed during harvesting. The effectiveness of 
large-scale cultures and the production of high 
biomass levels can make the cost of fermentative 
growth an order of magnitude less expensive than 
photobioreactors (Radmer and Parker 1994). 
Larger-scale production of the dinoflagellate 
Crypthecodinium by fermentative growth for the 
production of the polyunsaturated fatty acid 
DHA has been under way for several years (Kyle 
1996). Production of Crypthecodinium begins 
with a certified seed stock that was cryopreserved 
under liquid nitrogen conditions to maintain 
genetic stability. Crypthecodinium has been 
reported to produce approximately 30 % of their 
dry weight as the total fatty acid (Kyle et al. 
1992), with DHA making up close to 50 % of the 
total fatty acid (Behrens and Kyle 1996). 
Chlorella is also extensively grown in large quan- 
tities by fermentation techniques. Production 
levels in Japan are estimated to exceed 500 T per 
year, accounting for 50 % of the country’s total 
production (Lee 1997). Plans have also been 
announced in Korea to begin the production of 
heterotrophically grown Chlorella at a level 
exceeding 1,000 T per year (Lee 1997). Milking 
of microalgae is the most advanced technique for 
biotechnologically important compounds (Hejazi 
and Wijffels 2004). 

These systems have to be evaluated in their 
various configuration concepts regarding their 
potential productivity and economic feasibility. 
The most important and most obvious differences 
in microalgal production systems are the exposure 
of the microalgal culture to the environment. 


2.13. Conclusions 


More basic research needs to be performed before 
algal biotechnology would reach a capacity to 
compete with other systems. But since many 
physiological, morphological, biochemical, or 
molecular characteristics of algae are quite differ- 
ent from higher plants or animals, algae can meet 
several requirements that other systems cannot 
sufficiently accomplish. This is one reason why 
algal systems gain more and more influence in the 
production of substances of economic, industrial, 


and pharmaceutical importance. However, micro- 
algae are still not a well-studied group from a bio- 
technological point of view. The _ genetic 
improvement of algal strains is also a present 
challenge. The use of transgenic microalgae for 
commercial applications has not yet been reported 
but holds a significant promise. Among the algal 
species that are believed to exist, only a few thou- 
sand strains are kept in collections, a few hundred 
are investigated for chemical content, and just a 
handful are cultivated in industrial quantities (1.e., 
in tons per year). The combination of the excep- 
tional nutritional value of microalgae with color- 
ing and therapeutical properties, associated with 
an increase demand of natural products, makes 
microalgae worth exploring for utilization in the 
future in feed, food, cosmetic, and pharmaceutical 
industries, with recognized advantages compar- 
ing with the traditional ingredients. 
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Abstract 

Fungi have been recognized to be a major cause of disease in immuno- 
compromised hosts: moreover, the loss of food and fodder crops through 
fungi has been unmatched since the last decade. Fungi colonize the plant 
cell and organs by modulating the host defense response. A number of dif- 
ferent methods have been recently used to understand host—fungus inter- 
actions. With the advent of HiSeq approaches, more fungal genomes and 
transcriptomes are now sequenced, and their bioinformatics analyses have 
enriched and assisted our knowledge of the interplay between plant and 
fungi. The present chapter reviews the current biotechnological and bioin- 


formatics approaches for the study of plant—fungus interactions. 


Keywords 
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Fungal infection 


3.1 Introduction 

Fungi have coevolved with other eukaryotes 
including plants and animals. They can be either 
saprophytic or parasitic. The plant-parasitic fungi 
are strictly host specific, and the infection in 
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plants is mediated by the enzymes used in the 
breakdown of cell walls and toxins that inhibit or 
reduce the activity of host cells. Fungal infection 
in plants is also caused by disturbance in the hor- 
monal equilibrium of the plant cell, causing dis- 
ruptions to the growth and differentiation of the 
cells and tissues (Ludwig-Miiller 2000). The 
mode of infection in fungi can be necrotrophic, 
biotrophic, and hemibiotrophic. Necrotrophic 
infection leads to dysfunction of the tissue and 
eventually death of the plant. Biotrophs take 
nutrients and growth-regulating substances from 
the plants, but do not kill it, while hemibiotrophs 
initiate infection with a period of biotrophy, fol- 
lowed by a necrotrophic phase, and they possess 
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properties of both groups (Meinhardt et al. 2014). 
Significant yield losses due to fungal attacks 
occur in most of the agricultural and horticultural 
species. After insects, fungal diseases are rated as 
the second most important factor contributing to 
yield losses in our major cereal, pulse, and oil- 
seed crops (Grover and Gowthaman 2003). 
Penetration of fungi into plant cells occurs 
through diverse invasion strategies. Infection 
structures such as “Spitzenk6rper’’, an organizing 
center for hyphal growth, are produced, which 
enable the fungus to penetrate different types of 
plant cell walls (Mendgen et al. 1996). These 
fungi penetrate and degrade the cell wall and pro- 
duce toxins for their effective infection (Mendes- 
Giannini et al. 2005). Often N-terminal type II 
signal peptides containing fungal AVR proteins 
facilitate the fungal virulence by suppressing 
pathogen-associated molecular pattern-triggered 
immunity and induce effector-triggered immu- 
nity in plants containing cognate resistance pro- 
teins (Luderer et al. 2002). 
The different modes of infection include: 


3.1.1 Infection Structures 

Spores or hyphae of fungi penetrate through 
wounds of the epidermis or open stomata. For 
example, fungi imperfecti secrete a terpenoid 
(fusicoccin) that increases the influx of potas- 
sium into the cells of the stomata and induce 
thereby a permanent opening of the stomata and 
cause a high loss of water which may lead to 
death. Thus fusicoccin is also known as a wilting 
toxin (Arntzen et al. 1973). 


3.1.2 Cell Wall Degradation 


There is a range of polysaccharide-degrading 
enzymes and glycosidases such as alpha- 
galactosidase, beta-galactosidase, beta- 
xylosidase, and alpha-arabinosidase produced by 
phytopathogenic fungi. These enzymes are able 
to degrade walls of plant tissues (Albersheim and 
Valent 1974). 
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3.1.3. Toxin Production 


2-Hydroxycyclopropyl-alpha-D-galactopyranoside 
(helminthosporoside) has been identified in fun- 
gus-infected leaves of sugarcane with the help of 
nuclear resonance analysis and mass _ spectros- 
copy (Strobel and Hess 1974). Further detailed 
analysis revealed that it was a mixture of three 
isomer sesquiterpene—alkaloids that contain diga- 
lactosyl residues at both ends of the molecule. 
Helminthosporoside resembles alpha-galactosi- 
dases which are present in plant tissues and are 
taken up by the plant cells actively. These toxins 
attach at the plasma membrane, to the same bind- 
ing site as these sugars, but in contrast to the sug- 
ars are not transported into the cell. Instead, 
binding to the toxin increases the activity of the 
potassium/magnesium pump with the uptake of 
potassium. This causes an increase in the uptake 
of water, a bigger osmotic pressure, damage of the 
membrane, and finally the disintegration of the 
cell (Strobel and Hess 1974). Further, the study of 
signal transduction pathways in virulent fungi 1s 
particularly important in view of their putative 
role in the regulation of pathogenicity (Mendes- 
Giannini et al. 2005). 


3.2 Biotechnological 
Approaches for Plant- 


Fungus Interactions 


The study of fungal—host interplay is essential for 
developing alternative protection and resistance 
strategies. With the emergence of plenty of func- 
tional data from genomics, transcriptomics, and 
metabolomics, the study of plant—fungus interac- 
tions has been promoted. Functional genetic stud- 
ies are divided into forward or reverse genetics 
screens. In the forward genetic study, genes are 
mutated at random. After identification of a 
mutant phenotype, the mutations must be mapped, 
a process which is usually time-consuming and 
nontrivial. Conversely, reverse genetic approaches 
involve the disruption of a gene of interest, so as 
to determine its function and/or involvement in a 
pathway (Silva et al. 2004). 
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3.2.1 Reverse Genetics Approaches 

Reverse genetics approaches such as targeted 
gene disruption/replacement (knockout), gene 
silencing (knockdown), insertional mutagenesis 
(transposon and T-DNA mutagenesis), or target- 
ing induced local lesions in genomes (TILLING) 
are now used to analyze the functional informa- 
tion from the large number of genome projects 
(Gonzalez-Fernandez et al. 2010). With the 
advent of “reverse genetics,’ the functions of 
genes are discovered which contribute in the 
molecular mechanisms of fungal development, 
nutrition, and interaction with host plant 
(Bhadauria et al. 2009). Targeted gene replace- 
ment leads to the study of the phenotypes of 
mutants in which a genomic locus has been 
altered by insertion (gene disruption) or replace- 
ment (gene replacement) with heterologous DNA 
(Reski 1998). Homologous recombination 
between a target gene and the introduced DNA 
carrying its mutant allele results in targeted gene 
knockout. This approach was first pioneered for 
the budding yeast Saccharomyces cerevisiae to 
decipher gene function (Winzeler et al. 1999). 
Gene knockout can be performed by the simple 
disruption of the gene of interest by a resistance 
cassette, by alteration of the expression of the 
candidate gene rather than its deletion, by over- 
expression or mis-expression of a gene, and by 
gene replacement, which combines the second 
and third approaches (Bhadauria et al. 2009). A 
crucial feature that makes this technology feasi- 
ble is the ability to introduce linear DNA into a 
large number of cells simultaneously (Melissa 
and Jeff 2002). However, targeted gene disrup- 
tion is again time-consuming. A faster, simpler, 
and cheaper alternative of attenuating gene func- 
tion in a sequence-specific manner has emerged 
in the form of antisense RNA technology, ribo- 
zymes, and, more recently, RNA interference 
(RNAi) (Silva et al. 2004). In this technique 
double-stranded RNA (dsRNA) triggers the deg- 
radation of a homologous mRNA, thereby dimin- 
ishing or abolishing gene _ expression. 
RNA-mediated gene silencing methods that 
block the expression of genes at the posttran- 
scriptional level have been identified in a few 
economically important phytopathogens such as 


Magnaporthe oryzae (Kadotani et al. 2003) and 
Phytophthora infestans (West et al. 1999; 
Latijnhouwers et al. 2004; Whisson et al. 2005). 
Generally, RNAi causes only a partial reduction 
in gene expression and not a complete loss. This 
is amajor disadvantage of this method (Bhadauria 
et al. 2009). Insertional mutagenesis is performed 
by chromosomal integration of transforming 
DNA. The presence of a selectable marker in the 
transforming DNA is used to establish linkage 
between the insertion and the observed pheno- 
type and to recover DNA representing the 
mutated allele for cloning and subsequent analy- 
ses (Brown and Holden 1998). Targeting induced 
local lesions in genomes (TILLING) or chemical 
mutagenesis 1s a new reverse genetic strategy for 
plants that combines the efficiency of ethyl meth- 
anesulfonate (EMS)-induced mutagenesis with 
the ability of denaturing high-performance liquid 
chromatography (DHPLC) to detect base pair 
changes (G/C to A/T transition) by heteroduplex 
analysis (McCallum et al. 2000; Chen et al. 
2014). It has been evolved as an influential tool 
for functional genomics of phytopathogens. 
Lamour and colleagues (2006) employed 
TILLING to isolate gene-specific mutants in 
Phytophthora spp. Single-cell uninucleated 
spores like zoospores of Phytophthora spp. are 
best for TILLING mutagenesis (Bhadauria et al. 
2009). This technique has also helped to improve 
waxy quality trait in the polyploid wheat plant 
(Slade et al. 2005). 

All the reverse genetics tools described above 
have their own merits and drawbacks, and any 
one of them may be more effective for a particu- 
lar phytopathogen and case study while less suit- 
able for others. 


3.3. Gene Expression Analysis 


3.3.1 Real-Time Polymerase Chain 
Reaction (qPCR) and Reverse 
Transcription Polymerase 
Chain Reaction (RT-PCR) 


The advantage of real-time PCR is that the pro- 
cess of amplification is monitored in real time by 
using the fluorescence technique (Wilhelm and 
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Pingoud 2003). This technique has been used for 
rapid quantitative assessment of the expression of 
genes involved in powdery mildew disease devel- 
opment using the powdery mildew pathogen 
Golovinomyces orontii on a set of hypersuscepti- 
ble and resistant host plant Arabidopsis thaliana. 
The technique was found to be rapid, reproduc- 
ible, and useful for quantitative estimation of 
gene expression in other fungi as well (Webling 
and Panstruga 2012). The semiquantitative 
RT-PCR is currently the common technique for 
gene expression analysis, although an accurate 
normalization is essential to control for experi- 
mental error (Huggett et al. 2005). In this relative 
analysis method, expression of the gene of inter- 
est 1s compared with expression of a reference 
gene (an internal control). The technique has 
been used for different fungal plant studies such 
as coffee rust fungus Hemileia vastatrix (Vieira 
et al. 2011) and Verticillium dahliae in potato 
(Pasche and Mallik 2013). 


3.3.2 Direct Fluorescent In Situ 
RT-PCR 


Fungal interaction with host tissue and the under- 
standing of the subsequent events have been stud- 
ied by using direct fluorescent in situ 
RT-PCR. This technology combines PCR and in 
situ hybridization which is employed for mRNA 
localization in cell organelles or tissue sections. 
Digoxigenin-labeled nucleotides (digoxigenin- 
11-dUTP) are incorporated in the PCR product 
after reverse transcription and subsequently 
detected with an anti-digoxigenin antibody con- 
jugated with alkaline phosphatase. Further modi- 
fications can be done using fluorescent probes 
and fluorescence immunocytochemical labeling 
(Lossi et al. 2011). This technology has been 
used to study the symbiotic relationship of arbus- 
cular mycorrhizal fungi (Seddas-Dozolme et al. 
2010), as well as for the identification of an infec- 
tion gene in hemiubiotrophic plant pathogen 
Colletotrichum lindemuthianum (Tollot et al. 
2009). Further applications include identification 
of Blumeria graminis spores and mycelia on bar- 
ley leaves (Bindsley et al. 2002) and others. The 
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disadvantage of this method is that it is a time- 
consuming technique due to the use of hybridiza- 
tion step and technically demanding procedures 
such as light microscopy (Capote et al. 2012). 


3.3.3 TRAC Analysis 


There are a number of techniques for the analysis 
of gene expression; TRAC (transcript analysis 
with aid of affinity capture) analysis is a new and 
advanced technique for expression analysis of 
genes in strains producing foreign proteins. It is 
an efficient technique for the analysis of large 
numbers of samples (Nowrousian 2014). In this 
technique, the cells are directly lysed, and gene 
expression 1s measured without using RNA puri- 
fication or RI-PCR (Rautio et al. 2008). A set of 
fluorophore-labeled probes are used for the rec- 
ognition of the gene of interest. One of the most 
often used probes is biotin oligo-dT, which is 
used to capture targets from their polyA tails. By 
the use of TRAC method, 100—200 genes of the 
filamentous fungus Trichoderma reesei and the 
yeast Saccharomyces cerevisiae have been iden- 
tified with supposed importance in different bio- 
logical pathways (Rautio et al. 2007). These 
evaluated genes show potential in many physio- 
logical effects such as growth, protein production 
rate, and availability of nutrients and oxygen 
(Rintala et al. 2008). 


3.3.4 Suppression Subtractive 
Hybridization 


Suppression subtractive hybridization (SSH) 
technique is used for distinguishing two closely 
related DNA samples. They can be used both for 
cDNA subtraction and genomic DNA subtraction 
(Rebrikov et al. 2004). In this technique only one 
round of subtractive hybridization is needed 
where the subtraction step excludes the common 
sequences between target and driver population. 
This increases the probability of obtaining low- 
abundance differentially expressed cDNA and 
simplified analysis of subtracted library 
(Diatchenko et al. 1999). This has been used to 
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identify the upregulated genes in the establish- 
ment of mycorrhiza between Medicago truncat- 
ula and Glomus mosseae (Weidmann et al. 2004) 
in comparison with an infected plant and the 
pathogen in axenic culture. Other examples ana- 
lyzed are genes associated with phytoplasma in 
Chinese jujube (Liu et al. 2014) and plants 
infected with a wild-type isolate compared to 
plants infected with mutants (Horwitz and Lev 
2010). SSH can also identify fungal genes against 
a background of the uninfected host, for example, 
Alternaria brassicicola on Arabidopsis (Cramer 
and Lawrence 2004), genes of the white rot fun- 
gus Heterobasidion parviporum expressed dur- 
ing colonization of Norway spruce stems 
(Yakovlev et al. 2008), and in symbiosis for 
arbuscular mycorrhizae (Brechenmacher et al. 
2004). It is even possible to isolate differentially 
expressed mycoparasite genes in antagonistic 
fungal—fungal interactions (Carpenter et al. 2005; 
Morissette et al. 2008). 


3.3.5 Laser Microdissection 


Infection structures are formed in fungi to extract 
nutrition from plants. Laser microdissection 
(LM) of plant tissues infected with a fluorescent 
protein-tagged fungus is a useful method for 
obtaining samples highly enriched in fungal 
RNA for downstream analysis such as hybridiza- 
tion to a microarray, RT-PCR, and others (Tang 
et al. 2012). The technique can be used to isolate 
both fungal and host plant cells after pathogen 
infection. The samples are carefully fixed to 
maintain the integrity of the cell and later on dis- 
sected by laser. RNA isolation is further used for 
gene expression analysis (Fosu-Nyarko et al. 
2010). By using this technique, host—fungus 
interaction studies have been done in many cases, 
such as in Arabidopsis, where the epidermal cells 
of leaves infected with powdery mildew (Erysiphe 
cichoracearum) were isolated for gene expres- 
sion studies (Inada and Wildermuth 2005), and 
fungal haustoria could be separated from infected 
host plant cells (van Driel et al. 2007). This tech- 
nique 1s utilized for the study of biochemistry and 


structure of hyphal cells and fungal interaction 
with host tissues (Fosu-Nyarko et al. 2010). 


3.3.6 Biochemical Methods 


Biochemical methods are powerful tools for 
investigating mechanisms of gene expression. 
These techniques consist of Western blot analy- 
sis, protein purification using epitope-tagged 
fusion protein, protein immunoprecipitation (IP), 
and chromatin immunoprecipitation (ChIP) 
assays. Immunoblot analysis, also called Western 
blot, is a commonly used method to detect spe- 
cific proteins in tissue homogenates or protein 
extracts. Protein purification is also an approach 
routinely used to study the protein activity and to 
identify protein complexes. Immunoprecipitation 
(IP) is a method to specifically concentrate the 
protein of interest by using a specific antibody, 
which is widely used to study protein—protein 
interaction (PPI). ChIP assay is used to investi- 
gate the association of a protein of interest with 
specific DNA regions (Collas 2010). These meth- 
ods can be modified under different growth con- 
ditions and in different filamentous fung1. 


3.3.7. Proteome Profiling and Mass 
Spectrometry Tools 


Study of proteome changes in response to fungal 
invasion is crucial to understanding the molecu- 
lar mechanisms underlying plant—pathogen inter- 
action and pathogenesis. Several approaches are 
involved for identifying large numbers of pro- 
teins expressed in cells and also for globally 
detecting the differences in levels of proteins in 
different cell states. These techniques involve: 


3.3.7.1 Electrophoresis 

One-dimensional (1D) SDS polyacrylamide gel 
electrophoresis (PAGE) is a widely used tool for 
the separation of total protein extracts as well as 
protein fractions; two-dimensional electrophore- 
sis (2DE) and two-dimensional fluorescence dif- 
ference gel electrophoresis (2D-DIGE) are 
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protein separation techniques widely used in fun- 
gal proteomics. 


3.3.7.2 Mass Spectrometry (MS) Tools 

The major invention in MS tool is the time-of- 
flight (TOF) MS which is a relatively nondestruc- 
tive method used to convert proteins into volatile 
ions. A typical MS consists of an ion source, a 
mass analyzer, and a detector. Matrix-assisted 
laser desorption ionization (MALDIJ) and electro- 
spray ionization (ESI) have made it possible to 
volatilize and ionize large biomolecules, such as 
peptides and proteins (Yates 1998; Godovac- 
Zimmermann and Brown 2001; Mann et al. 
2001). The mass analyzers currently used in pro- 
teomics research are ion trap, TOF quadrupole, 
and Fourier transform ion cyclotron (FI-MS). 
MALDI is usually coupled to TOF analyzers that 
measure the mass of intact peptides (Chalupova 
et al. 2014). A new matrix-assisted laser desorp- 
tion/ionization TOF/TOF MS with the novel 
laser-induced fragmentation technique (MALDI 
LIFT-TOF/TOF MS) provides high sensitivity 
for peptide mass fingerprints (PMP). 


3.3.7.3 Multidimensional Protein 
Identification Technology 
(MudPIT) 
In this method, multidimensional liquid chroma- 
tography is coupled with tandem mass spectrom- 
etry (MS) and database searching by the 
SEQUEST algorithm (Washburn 2004). MudPIT 
was first applied to the fungal proteome of the 
Saccharomyces cerevisiae strain BJ5460 grown 
to mid-log phase and yielded the largest pro- 
teome analysis to date. A total of 1,484 proteins 
were detected and identified (Washburn et al. 
2001). 


3.3.8 Next-Generation Sequencing 


Next-generation sequencing (NGS) technologies 
are continuing to rise with the fall in its costs. 
This enables researchers to conduct whole- 
transcriptome sequencing (RNAseq) studies of 
interactions between plants and pathogenic fungi 
(Wang et al. 2009). The application of NGS in 
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plant-fungus interaction research shortens the 
overall time of development of molecular genetic 
information necessary for functional and transla- 
tional studies. Zhuang et al. (2012) reported 
novel transcriptome sequence information from 
the pea—S. sclerotiorum interaction obtained by 
454 pyrosequencing. [lumina’s next-generation 
sequencing technology sequencing by synthesis 
(SBS) is a powerful tool to rapidly sequence 
genomes and transcriptomes and is not all that 
expensive (Venu et al. 2011). 


3.3.9 Genetic Engineering: 
An Approach for Plant Disease 
Resistance 


Genetic engineering is an artificial technique 
used for transferring genes from one organism to 
other. This technique is used to manipulate the 
genetic makeup of a cell to produce new charac- 
teristics in an organism (Zhu et al. 2013). Plants 
produce different secondary metabolites, for the 
prevention of growth of fungi. Transgenic plants 
are developed either by increasing or decreasing 
the expression of genes that are naturally present 
in an organism or by transferring genes between 
individuals of the same or different species. 
Pathogenesis-related (PR) proteins expressed by 
the host plant are induced by pathogen attach- 
ment or other stresses (Antoniw and Pierpoint 
1978; Van Loon et al. 1994); these are not only 
accumulated locally in the infected leaf but are 
also induced systemically, associated with the 
development of systemic acquired resistance 
(SAR) against further infection by fungi, bacte- 
ria, and viruses. PR protein genes appear to be a 
good potential source as candidate genes for 
engineering fungal resistance in plants. Induction 
of PR proteins has been found in many plant 
species belonging to various families. The genes 
which encode for chitinases and glucanases (PR 
proteins) are the most extensively used for devel- 
oping fungal resistance in plants by their overex- 
pression. Another important hydrolytic enzyme 
is beta-1,3-glucanase which inhibits fungal 
growth in plants by catalyzing the degradation of 
beta-1,3-glucan abundantly present in plant cell 
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wall of many filamentous fungi (Wessels and 
Sietsma 1981). The detailed information on the 
engineering fungal resistance in plants by expres- 
sion of PR protein and other antifungal genes has 
not been given here as it is covered in another 
chapter in this book. 


3.3.10 Signal Transduction Pathway 
Activation 


Systemic acquired resistance (SAR) is an induc- 
ible system that becomes activated by pathogen 
exposure and leads to a broad range of resistance 
against diverse pathogens. After being exposed to 
a pathogen, the plant cell activator molecules 
interact with signal molecules such as H,O,, sali- 
cylic acid (SA), jasmonic acid (JA), and ethylene 
and trigger defense responses. H,O, has been 
shown to directly inhibit pathogen growth and to 
induce PR proteins, SA, and ethylene, as well as 
phytoalexins (Mehdy 1994). Overexpression of 
H,O, in transgenics of cotton, tobacco, and potato 
reduced disease development caused by a num- 
ber of different fungi, but higher level of overex- 
pression was phytotoxic (Murray et al. 1999). 
Salicylic acid (SA) is also one of the signal mol- 
ecules involved in defense response. 
Overexpression of SA in transgenic tobacco 
plants enhances PR protein production and pro- 
vides resistance to fungal pathogens (Verberne 
et al. 2000). R genes (resistance genes) have 
evolved in plants whose products allow recogni- 
tion of specific pathogen effectors, either through 
direct binding or by recognition of the effector’s 
alteration of a host protein (Jones and Dangl 
2006). R gene products control a broad set of dis- 
ease resistance responses whose induction is 
often sufficient to stop further pathogen growth 
and spread. 

Fungal infection in plants has been associated 
with reduced photosynthetic rate, nutrient mobi- 
lization at the site of infection, and formation of 
patches called green islands (Walters and 
McRoberts 2006). Cytokinins have the ability to 
mobilize nutrients toward sites of application and 
delay senescence which has led to the specula- 


tion of involvement of cytokinin in green island 
formation. Spores of some fungi such as rust and 
powdery mildew are known to contain high lev- 
els of cytokinins (Kiraly et al. 1967), and it has 
been suggested that the nutrient mobilization and 
green island formation are due to cytokinin pro- 
duced by the fungus. It has also been proposed 
that cytokinin-induced increases in invertase 
activity occurring early on in the pathogen—host 
interaction lead to carbohydrate accumulation 
and the downregulation of photosynthetic metab- 
olism. These events indicate that cytokinins play 
a crucial role in allowing the biotrophic pathogen 
to establish and grow in the host (Walters and 
McRoberts 2006). However, cytokinins have 
been shown to increase immunity in plants 
against bacteria (Naseem et al. 2014). Systems 
biology studies have been performed to under- 
stand the interplay of auxin and cytokinin 
(Naseem et al. 2012; Naseem and Dandekar 
2012) in bacteria and host plants as well as the 
effect of cytokinin on jasmonate-—salicylate 
antagonism in Arabidopsis immunity (Naseem 
et al. 2013). 


3.3.11 RNA Silencing 


RNA silencing is used as a reverse tool for gene 
targeting in fungi. Homology-based gene silenc- 
ing induced by transgenes (co-suppression), anti- 
sense, or dsRNA has been demonstrated in many 
plant-pathogenic fungi. This includes 
Cladosporium fulvum (Hamada and Spanu 1998), 
Magnaporthe oryzae (Kadotani et al. 2003), 
Venturia inaequalis (Fitzgerald et al. 2004), 
Neurospora crassa (Goldoni et al. 2004), 
Aspergillus nidulans (Hammond and Keller 2005), 
and Fusarium graminearum (Nakayashiki 2005); 
whether it is suitable for large-scale mutagenesis 
in fungal pathogens remains to be tested. 
Nakayashiki (2005) were successful in silencing 
70-90 % of transformants mpg/ and polyketide 
synthase gene. The mpg/ gene is a hydrophobin 
gene that is essential for pathogenicity, as it acts as 
a cellular relay for adhesion and a trigger for the 
development of appressorium (Talbot et al. 1996). 
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3.4 Bioinformatics and Systems 


Biology Approaches 


Biological problems often deal with different 
numbers of factors which are interrelated and 
lead to the whole behavior or function. This orga- 
nized complexity has recently introduced sys- 
tems biology as a new age science. Systems 
biology studies in plant science have increased 
our knowledge about circadian rhythms, multi- 
genic traits, stress responses, and plant defenses. 
Plants continually face biotic and abiotic stress 
and respond with sophisticated biochemical and 
structural defenses. Various external stimuli are 
differentially perceived by plants and lead to dis- 
tinct cascades of molecular responses and signal- 
ing networks at the cellular level. A detailed 
biochemical understanding of pathogen action 
and plant responses would enable development 
of disease control strategies, a challenging goal 
of plant pathology. “Omics” research approaches 
have produced copious data such as genomics, 
proteomics, transcriptomics, interactomics, and 
metabolomics which can be exploited in order to 
better understand biological interactions. 


Brief Introduction to Network 
Reconstruction and Modeling 


3.4.1 


The interactions of genes, proteins, metabolites, 
or other components are illustrated by networks. 
Network reconstruction and analysis is one of the 
most common approaches for describing biologi- 
cal systems. Both component integration and 
interactions are key features of systems biology. 
Networks can be either static or dynamic, and of 
different types, including gene-to-metabolite net- 
works, protein-protein interaction networks, 
transcriptional regulatory networks, and gene 
regulatory networks (Yuan et al. 2008). In meta- 
bolic networks, each reaction is a network edge 
connecting nodes that represent reaction sub- 
strates and products. The nodes connected by an 
edge are also associated with values indicating 
the quantity or concentration of a reaction par- 
ticipant (Pritchard and Birch 2011). The values at 
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the nodes vary over time, representing the ebb 
and flow of material that is created or destroyed. 
The network edges are said to carry a flux, the 
rate at which material “flows” from one node to 
another, and flux is also represented by a value. 
There are two approaches for network recon- 
struction: top-down approach and bottom-up 
approach (Shahzad and Loor 2012). Top-down 
approach provides a broad overview of the sys- 
tem where the regions can be gradually filled 
with detailed information of factors affecting the 
dynamics of the network. A bottom-up approach 
is a detailed model where the output of the mod- 
ule affects processes described by other modules 
at a higher level of organization. A bridge or bot- 
tleneck is a pathway of communication, the 
breaking of which can split the network into non- 
communicating parts. A bottleneck is a potential 
point of fragility in a network. However, biologi- 
cal networks are error tolerant and hence follow a 
“power law” with the loss of a single edge rarely 
leading to a global disconnection in the network. 
Hubs are the nodes which are directly connected 
to many other nodes. In protein-protein interac- 
tion networks, the number of network hubs varies 
by organism but is always small. In biological 
cells we expect a few highly connected “hubs” to 
play critical roles in signal transduction, regulat- 
ing transcription, and mediating metabolic flux 
across the network. Such central protein nodes 
with many interactions to other proteins (so- 
called hub nodes with a large degree) are thus 
candidates for effector targets. 


3.4.2 Systems Biology of Plants 


Advances in plant systems biology studies are 
expected to support our current requirements of 
crop adaptation for food, feed, biofuels, and 
industrial and pharmaceutical production. Plants 
have sessile lifestyle and phototrophic nutrition, 
and in order to accommodate this behavior, they 
have a unique genetic and metabolic repertoire 
(Morsy et al. 2008). However network modeling 
and experimental validation in plants can be 
executed in a fast and reiterative way owing to 
their relatively short generation times and their 
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capability to produce enough offspring for 
genetic analyses. 


3.4.3 Different Plant-Fungus 
Studies 


Human pathogenic fungi such as Aspergillus 
fumigatus, Candida albicans, Cryptococcus neo- 
formans, and Encephalitozoon cuniculi have 
been studied for some time now; plant—fungus 
interactions have not been studied much. 
However the complete sequencing of the 
Magnaporthe oryzae, a causal agent of blast dis- 
ease in rice, has shown that multi-locus genes are 
concordant with host preference leading to the 
segregation of a new species of Magnaporthe 
oryzae from Magnaporthe grisea (Gudimella 
et al. 2010). This example indicates horizontal 
transfer of gene clusters in human pathogenic 
fungi in accordance with host preference. The 
interolog mapping approach has been used to 
transfer interaction information from animal 
model organisms such as_ Drosophila, 
Caenorhabditis elegans, and human to the model 
plant Arabidopsis. In Arabidopsis, approximately 
10,000 orthologs have been detected in at least 
one reference species (i.e., in yeast, Drosophila, 
or humans) (Morsy et al. 2008). Structure model- 
ing and docking approaches have also been used 
to develop a peptide which can competitively 
inhibit the crucial glyoxylate pathway in fungi 
(Srivastava et al. 2010). 


3.4.4 Computational Techniques 
that Can Aid Studies on Host- 
Fungus Interactions 


Different computational techniques which can 
help in the study of host—fungus interaction are 
enlisted below with a brief description: 


3.4.4.1 Analogies from Mammal- 
Pathogen Interactions 

Of the 1.5—5.1 million fungal species, an esti- 

mated 270,000 species are associated with plants, 

and 325 are known to infect humans (Blackwell 


2011; Hawksworth and Rossman 1997; Robert 
and Casadevall 2009; Woolhouse and Gaunt 
2007; Gauthier and Keller 2013). A small subset 
of plant pathogens such as E. rostratum can cross 
kingdoms and infect humans. These crossover 
pathogens include fungi from Ascomycota and 
Mucoromycotina phyla. The human host infected 
by crossover fungi is generally immunocompro- 
mised. Gauthier and Keller (2013) enlist the 
fungi which can cause disease in both plants and 
human. Fungi have different methods of infecting 
plants and animals; however, they produce the 
same mycotoxins and secondary metabolites, 
assimilate similar nutrients such as iron, and have 
similar MAPK, cAMP-PKA, and G-protein sig- 
naling pathways. Targeted mutagenesis allows 
characterization of fungal pathogenesis accord- 
ing to the loss of function. The mutagenesis relies 
on either random or targeted insertion of a select- 
able marker. The targeted genes are identified by 
comparative genome analysis between pathogen 
and non-pathogen and/or by large-scale tran- 
scriptome analyses. PHI-base (Pathogen—Host 
Interaction Database) catalogs the phenotypes 
resulting from mutations in defined genes of both 
plant and animal pathogens (Sexton and Howlett 
2006). The genomic resources which can be 
exploited for orthologous studies of plant hosts 
are rice and Arabidopsis thaliana as well as non- 
mammalian hosts such as Drosophila melano- 
gaster and Caenorhabditis elegans. 


3.4.4.2 Predicted Interactomes 

Two types of interactome have widely been rec- 
ognized: genetic and physical. A genetic interac- 
tome is a network of genes characterized on the 
basis of genetic interactions serving to elucidate 
gene function within physiological processes 
(Yuan et al. 2008). Genetic interactions can be 
predicted using a systematic genetic analysis 
(SGA)-based method. In this approach a query 
mutation is crossed to an entire mutation array set 
to identify any genetic interaction, yielding func- 
tional information of the query gene and the gene 
it interacts with. The SGA-based genetic interac- 
tome has also been applied successfully in the 
nematode Caenorhabditis elegans, the fungi 
Cryptococcus neoformans, the virus Neisseria 
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meningitidis, and many other physiological pro- 
cesses and protein families. SGA in plants is 
however complicated because of polyploidy. As 
compared with genetic interactions, physical 
interactions among proteins are relatively easier 
to characterize in plants. Different protein—pro- 
tein interaction methodologies have been enlisted 
by Morsy et al. (2008). The computational tech- 
nique which has been used to define the protein 
interactions is “interologs.” Interologs are the 
predicted interactive protein pairs, based on 
orthologous proteins that are known to interact in 
the reference species such as yeast and human. 
The interactome of Arabidopsis was predicted by 
Geisler-Lee et al. (2007) using confidence scor- 
ing based on the number of observations and 
experimental approaches that detected the inter- 
action, together with subcellular location and co- 
expression data, to indicate the reliability of the 
interaction. Using the interolog method, 
11,674 PPIs among 3,017 M. grisea proteins 
were also inferred from the experimental PPI 
data in different organisms (He et al. 2008). The 
potential of interactomes in understanding the 
systemic regulation of biological processes is 
unquestionable. However, the techniques to build 
the interactome should be used carefully. 


3.4.4.3 Metabolic Modeling 

Fungal infection of plants leads to changes in the 
secondary metabolism to boost defense pro- 
grams. It also affects primary metabolism affect- 
ing growth in plants. Charting the metabolic 
changes can lead to better understanding of fun- 
gal infections in plants. A set of linear pathways 
often cannot capture the full complexity and 
behavior of a metabolic network (Schuster et al. 
2000). However once the genome of the intended 
microbe has been annotated, the entire metabolic 
map representing the stoichiometry of all the 
metabolic reactions taking place in the cell can be 
constructed. Metabolic networks have been 
restructured for different microbes such as 
Mycoplasma pneumoniae (Schuster et al. 2002), 
Listeria monocytogenes (Schauer et al. 2010), 
Staphylococcus aureus (Cecil et al. 2011; Liang 
et al. 2011), Staphylococcus epidermidis (Cecil 
et al. 2011), and E. coli (Jain and Srivastava 
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2009); however, the metabolic models for fungi 
are still in their infancy. With the increasing num- 
ber of sequencing projects and increasing 
amounts of metabolic data, the metabolic model 
for plant—fungus interactions will soon increase. 


3.4.4.4 EST Mining and Functional 
Expression Assays 

Microbial effectors are small secreted proteins 
that are capable of manipulating the microbial 
biochemistry by triggering a defense response. 
They can act out of the host cell or can enter the 
living host cell by specialized pathogen-derived 
structures such as type III secretion systems in 
bacteria; haustoria in fungi, oomycetes, and para- 
sitic plants; and the nematode stylet. PexFinder, 
an algorithm for automated identification of 
extracellular proteins from EST data sets, was 
developed and applied to 2147 ESTs from the 
oomycete plant pathogen Phytophthora infes- 
tans. The program identified 261 ESTs (12.2 %) 
corresponding to a set of 142 nonredundant Pex 
(Phytophthora extracellular protein) cDNAs, of 
which 78 (55 %) Pex cDNAs had no significant 
matches in public databases (Torto et al. 2003). 
This appears to be a promising approach to iden- 
tify the microbial effectors in other fungal 
pathogens. 


3.4.5 Fungal Pathogen Genomics 


The advances in genomics and sequencing tech- 
niques have allowed the analysis of gene expres- 
sion changes at the whole-genome level. Different 
sequencing projects are ongoing. A_ fungal 
genome initiative (FGI) was initiated in 2000 to 
promote the sequencing of medically and indus- 
trially important fungi (http://www.broad.mit. 
edu/annotation/fgi). The Genome Online 
Database (GOLD) contains a number of draft and 
complete sequence project for plant fungi (Pagani 
et al. 2012). With the abundance of sequencing 
and genomics data, data analysis will become the 
rate-limiting step. Most gene prediction pro- 
grams have been based on humans and do not 
automatically work for fungi. For the correct 
structural prediction of pathogenic genes, it will 
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be more realistic to train the gene-finding soft- 
ware EST infection libraries (Soderlund 2009). 
In closely related plant-pathogenic fungi, com- 
parative analysis can be used to improve de novo 
gene prediction and identify genes involved in 
host range determination, infection-related mor- 
phogenesis, and virulence (Xu et al. 2006). For 
pathogen-host studies, genes that are not found 
in abundance in the non-pathogens but are found 
in pathogens are natural candidates for pathoge- 
nicity. Functional analysis of genes is possible by 
testing the effect of mutated genes on pathoge- 
nicity. The Plant-Associated Microbe Gene 
Ontology (PAMGO) Consortium has collabo- 
rated with the Gene Ontology (GO) Consortium 
to define GO terms to describe the biological pro- 
cesses common to plant and animal microbes 
(Xu et al. 2006). 


3.4.5.1 Tools, Techniques, 
and Resources for Plant- 
Pathogen Studies 
While the number of symbiotic and pathogenic 
fungi with genome-based research is limited, the 
resources available and subsequently genes 
sequenced are increasing. The ESTS, genome 


sequence data, microarray data, in silico muta- 
tion models, and protein interaction databases are 
available to use for the organism of interest. 
Although all these resources have their limita- 
tions and should be used in combination with 
in vitro and in vivo experiments, their availability 
is capable of saving all kinds of expenses. The 
work of the Human Proteome Organization 
Proteomics Standards Initiative (HUPO-PSI; 
http://www.psidev.info/) has greatly improved 
the ability to combine or compare interaction 
data from diverse sources. In addition, HUPO-— 
PSI has published guidelines outlining the mini- 
mum information required for reporting a 
molecular interaction experiment (MIMIx) 
(Mitchell et al. 2003). Some useful weblinks are 
listed in Table 3.1. 


3.5 Future Challenges in Plant- 


Fungus Studies 


Network modeling approaches are often based on 
assumptions and generalizations and hence are 
not capable of completely representing the actual 
biological scenario; however, the predictions 


Table 3.1 Repositories useful for plant—fungus interaction studies 


Function 
EST database with EST data 


Name 
Phytopathogenic Fungi and 


Weblink 
http://cogeme.ex.ac.uk/ 


Reference 

Soanes and Talbot 
(2006), Soanes et al. 
(2002) 


Oomycete EST Database from eighteen plant-pathogenic 
fungi, two species of 
oomycetes, and three species of 
saprophytic fungi 

IntAct Interaction data from literature 


curation and user submission 


Molecular Interaction 
Database (MINT) 


Experimentally verified 
protein—protein interaction 


mined from scientific literature 


Arabidopsis thaliana 
Protein Interactome 
Database (AtPID) 


Information pertaining to 
protein—protein interaction 


thaliana proteome 


Database of Interacting 
Protein (DIP) 


Biomolecular Interaction 
Network Database (BIND) 


Experimentally determined 


complex and pathway 
information 


networks, domain architecture, 
ortholog information, and GO 
annotation in the Arabidopsis 


interactions between proteins 
Biomolecular interaction and 


http://www.ebi.ac.uk/intact/ Orchard et al. (2014) 


http://mint.bio.uniroma2.it/ 
mint/Welcome.do 


Licata et al. (2012) 


http://www.megabionet.org/ Li et al. (2011) 


atpid/webfile/ 


http://dip.doe-mbi.ucla.edu/ 
dip/Main.cgi 
http://baderlab.org/ 
BINDTranslation 


Salwinski et al. (2004) 


Bader et al. (2003) 


(continued) 
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Table 3.1 (continued) 


Name 


Biological General 
Repository for Interaction 
Datasets (BioGRID) 


Pathogen—Host Interaction 
Database (PHI-base) 


Comprehensive 
Phytopathogen Genomics 
Resource (CPGR) 


Genomes Online Database 
(GOLD) 


Plant Expression Database 
(PlexDB) 


Fungal Secretome 
KnowledgeBase 
(FunSecKB) 


Function 


Interaction data through 
comprehensive curation 


Catalogs experimentally 
verified pathogenicity, 


virulence, and effector genes 


from fungal, oomycete, and 
bacterial pathogens, which 


infect animal, plant, fungal, and 


insect hosts 


Comprehensive plant pathogen 


genomics and annotation 
resource 


Information regarding genome 
and metagenome sequencing 


projects 
Unified gene expression 


resource for plants and plant 


pathogens 

Resource of secreted fungal 
proteins, 1.e., secretomes, 
identified from all available 


Weblink 
http://thebiogrid.org/ 


http://www.phi-base.org/ 


http://cpgr.plantbiology. 
msu.edu/ 


http://genomesonline.org/ 
cgi-bin/GOLD/index.cgi 


http://www.plexdb.org/ 


http://proteomics.ysu.edu/ 
secretomes/fungi.php 


M. Srivastava et al. 


Reference 
Stark et al. (2006) 


Winnenburg et al. 
(2008) 


Hamilton et al. (2011) 


Pagani et al. (2012) 


Dash et al. (2012) 


Lum and Min (2011) 


fungal protein data in the NCBI 


RefSeq database 


ought to improve as more interactome data 
become available about plants and thus will serve 
as powerful tools for generating hypotheses for 
future experimental investigations. 


Acknowledgment The financial support from German 
Research Foundation (grant TR124/B1) is gratefully 
acknowledged. We thank Jennifer Braysher for stylistic 
and language corrections. 


References 


Albersheim P, Valent BS (1974) Plant pathogens secrete 
proteins which inhibit enzymes of the host capable of 
attacking the pathogen. Plant Physiol 53:684—687 

Antoniw JF, Pierpoint WS (1978) Purification of a tobacco 
leaf protein associated with resistance to virus infec- 
tion. Biochem Soc Trans 6:248—250 

Arntzen CJ, Haugh MF, Bobrick S (1973) Induction of 
stomatal closure by Helminthosporium maydis patho- 
toxin. Plant Physiol 52:569-574 

Bader GD, Betel D, Hogue CW (2003) BIND: the biomo- 
lecular interaction network database. Nucl Acids Res 
31(1):248-250 

Bhadauria V, Banniza S, Wei Y, Peng YL (2009) Reverse 
genetics for functional genomics of phytopathogenic 


fungi and oomycetes. 
2009:380719 

Bindsley L, Oliver RP, Johansen B (2002) Jn situ PCR for 
detection and identification of fungal species. Mycol 
Res 106(3):277-279 

Blackwell M (2011) The fungi: 1, 2,3 ... 5.1 million spe- 
cies. Am J Bot 98:426-438 

Brechenmacher L, Weidmann S, van Tuinen D et al (2004) 
Expression profiling of up-regulated plant and fungal 
genes in early and late stages of Medicago truncatula- 
Glomus mosseae interactions. Mycorrhiza 14:253—262 

Brown JS, Holden DW (1998) Insertional mutagenesis 
of pathogenic fungi. Curr Opion Microbiol 
1(4):390-394 

Capote N, Pastrana AM, Aguado A, Sanchez-Torres P 
(2012) Molecular tools for detection of plant patho- 
genic fungi and fungicide resistance. In: Cumagun CJ 
(ed) Plant pathology. InTech, Rijeka (Croatia), pp 
150-202, Chap. 7, ISBN:978-953-51-0489-6 

Carpenter MA, Stewart A, Ridgway HJ (2005) 
Identification of novel Trichoderma hamatum genes 
expressed during mycoparasitism using subtractive 
hybridisation. FEMS Microbiol Lett 251:105—112 

Cecil A, Rikanovic C, Ohlsen K, Liang C, Bernhardt J, 
Oelschlaeger TA et al (2011) Modeling antibiotic and 
cytotoxic effects of the dimeric isoquinoline IQ-143 
on metabolism and its regulation in Staphylococcus 
aureus, Staphylococcus epidermidis and human cells. 
Genome Biol 12(3):R24 


Comp Funct Genomics 


3 Application of Biotechnology and Bioinformatics Tools in Plant—-Fungus Interactions 61 


Chalupova J, Raus M, Sedlafova M, Sebela M (2014) 
Identification of fungal microorganisms by MALDI- 
TOF mass _ spectrometry. Biotechnol Adv 
32(1):230-241 

Chen L, Hao L, Parry MA, Phillips AL, Hu YG (2014) 
Progress in TILLING as a tool for functional genom- 
ics and improvement of crops. J Integr Plant Biol. 
doi:10.1111/jipb.12192 

Collas P (2010) The current state of chromatin immuno- 
precipitation. Mol Biotech 45(1):87—100 

Cramer RA, Lawrence CB (2004) Identification of 
Alternaria brassicicola genes expressed in plants dur- 
ing pathogenesis of Arabidopsis thaliana. Fungal 
Genet Biol 41:115-128 

Dash S, Van Hemert J, Hong L, Wise RP, Dickerson JA 
(2012) PLEXdb: gene expression resources for plants 
and plant pathogens. Nucleic Acids Res 40(Database 
issue):D1194—D 1201 

Diatchenko L, Lukyanov S, Lau YF, Siebert PD (1999) 
Suppression subtractive hybridization: a_ versatile 
method for identifying differentially expressed genes. 
Methods Enzymol 303:349-380 

Fitzgerald A, Van Kha JA, Plummer KM (2004) 
Simultaneous silencing of multiple genes in the apple 
scab fungus Venturia inaequalis, by expression of 
RNA with chimeric inverted repeats. Fungal Genet 
Biol 41:963-971 

Fosu-Nyarko J, Jones Michael GK, Wang Z (2010) 
Application of laser microdissection to study plant— 
fungal pathogen interactions. In: Sharon A (ed) 
Molecular and cell biology methods for fungi. Humana 
Press, New York 

Gauthier GM, Keller NP (2013) Crossover fungal patho- 
gens: the biology and pathogenesis of fungi capable of 
crossing kingdoms to infect plants and humans. 
Fungal Genet Biol 61:146—157 

Geisler-Lee J, O’ Toole N, Ammar R, Provart NJ, Millar 
AH, Geisler M (2007) A predicted interactome for 
Arabidopsis. Plant Physiol 145(2):317—329 

Godovac-Zimmermann J, Brown LR (2001) Perspectives 
for mass spectrometry and functional proteomics. 
Mass Spectrom Rev 20:1—57 

Goldoni M, Azzalin G, Macino G, Cogoni C (2004) 
Efficient gene silencing by expression of double 
stranded RNA in Neurospora crassa. Fungal Genet 
Biol 41:1016—1024 

Gonzalez-Fernandez R, Prats E, Jorrin-Novo JV (2010) 
Proteomics of plant pathogenic fungi. J Biomed 
Biotechnol 2010:932527 

Grover A, Gowthaman R (2003) Strategies for develop- 
ment of fungus-resistant transgenic plants. Curr Sci 
8(3):330-340 

Gudimella R, Nallapeta S, Varadwaj P, Suravajhala P 
(2010) Fungome: annotating proteins implicated in 
fungal pathogenesis. Bioinformation 5(5):202—207 

Hamada W, Spanu PD (1998) Co-suppression of the 
hydrophobin gene Hcf-/ is correlated with antisense 
RNA biosynthesis in Cladosporium fulvum. Mol Gen 
Genet 259:630-638 

Hamilton JP, Neeno-Eckwall EC, Adhikari BN, Perna NT, 
Tisserat N, Leach JE et al (2011) The comprehensive 


phytopathogen genomics resource: a web-based 
resource for data-mining plant pathogen genomes. 
Database (Oxford) 2011:bar053 

Hammond TM, Keller NP (2005) RNA silencing in 
Aspergillus nidulans is independent of RNA- 
dependent RNA polymerase. Genetics 169:607—617 

Hawksworth DL, Rossman AY (1997) Where are all the 
undescribed fungi? Phytopathology 87:888-89 1 

He F, Zhang Y, Chen H, Zhang Z, Peng YL (2008) The 
prediction of protein-protein interaction networks in 
rice blast fungus. BMC Genomics 9:519 

Horwitz BA, Lev S (2010) Identification of differentially 
expressed fungal genes in planta by suppression sub- 
traction hybridization. Methods Mol Biol 
638:115-—123 

Huggett J, Dheda K, Bustin S, Zumla A (2005) Real-time 
RT-PCR normalization; strategies and considerations. 
Genes Immun 6:279—284 

Inada N, Wildermuth MC (2005) Novel tissue preparation 
method and cell-specific marker for laser microdissec- 
tion of Arabidopsis mature leaf. Planta 221:9-16 

Jain R, Srivastava R (2009) Metabolic investigation of 
host/pathogen interaction using MS2-infected 
Escherichia coli. BMC Syst Biol 3:121 

Jones JD, Dangl JL (2006) The plant immune system. 
Nature 444(7117):323-329 

Kadotani N, Nakayashiki H, Tosa Y, Mayama S (2003) 
RNA silencing in the phytopathogenic fungus 
Magnaporthe oryzae. Mol Plant Microbe Interact 
16(9):769-776 

Kiraly Z, El Hammady M, Pozsar BI (1967) Increased 
cytokinin activity of rust-infected bean and broad- 
bean leaves. Phytopathology 57:93—94 

Lamour KH, Finley L, Hurtado-Gonzales O, Gobena D, 
Tierney M, Meijer HJG (2006) Targeted gene muta- 
tion in phytophthora spp. Mol Plant Microbe Interact 
19(12):1359-1367 

Latijnhouwers M, Ligterink W, Vleeshouwers VGAA, 
West P, Govers F (2004) A Galpha subunit controls 
zoospore motility and virulence in the potato late 
blight pathogen Phytophthora infestans. Mol 
Microbiol 51:925—936 

Li P, Zang W, Li Y, Xu F, Wang J, Shi T (2011) AtPID: the 
overall hierarchical functional protein interaction net- 
work interface and analytic platform for Arabidopsis. 
Nucleic Acids Res 39(Database issue):D1130—D1133 

Liang C, Liebeke M, Schwarz R, Ziihlke D, Fuchs S, 
Menschner L et al (2011) Staphylococcus aureus 
physiological growth limitations: insights from flux 
calculations built on proteomics and external metabo- 
lite data. Proteomics 11(10):1915—1935 

Licata L, Briganti L, Peluso D, Perfetto L, Iannuccelli M, 
Galeota E et al (2012) MINT, the molecular interac- 
tion database: 2012 update. Nucleic Acids Res 
40(Database issue):D857—D861 

Liu Z, Wang Y, Xiao J, Zhao J, Liu M (2014) Identification 
of genes associated with phytoplasma resistance 
through suppressive subtraction hybridization in 
Chinese jujube. Physiol Mol Plant P 86:43-48 

Lossi L, Gambino G, Salio C, Merighi A (2011) Direct in 
situ rt-PCR. Methods Mol Biol 789:111—126 


62 


Luderer R, Takken FL, de Wit PJ, Joosten MH (2002) 
Cladosporium fulvum overcomes Cf-2-mediated resis- 
tance by producing truncated AVR2 elicitor proteins. 
Mol Microbiol 45(3):875—884 

Ludwig-Miilller J (2000) Hormonal balance in plants dur- 
ing colonization by mycorrhizal fungi. In: Yoram K, 
David DD (eds) Arbuscular mycorrhizas: physiology 
and function. Springer, Dordrecht, pp 263-285. ISBN 
978-90-48 1-55 15-6 

Lum G, Min XJ (2011) FunSecKB: the fungal secretome 
knowledge base. Database (Oxford) 201 1:bar001 

Mann M, Hendrickson RC, Pandey A (2001) Analysis of 
proteins and proteomes by mass spectrometry. Annu 
Rev Biochem 70:437-473 

McCallum CM, Comai L, Greene EA, Henikoff S (2000) 
Targeting induced local lesions IN genomes 
(TILLING) for plant functional genomics. Plant 
Physiol 123(2):439-442 

Mehdy MC (1994) Active oxygen species in plant defense 
against pathogens. Plant Physiol 105:467-472 

Meinhardt LW, Lacerda C, Gustavo G, Thomazella DPT, 
Teixeira PJPL, Carazzolle MF (2014) Genome and 
secretome analysis of the hemibiotrophic fungal 
pathogen, Moniliophthora roreri, which causes frosty 
pod rot disease of cacao: mechanisms of the biotrophic 
and necrotrophic phases. BMC Genomics 15:164 

Melissa DA, Jeff JS (2002) From sequence to phenotype: 
reverse genetics in Drosophila melanogaster. Nat Rev 
Genet 3:189-198 

Mendes-Giannini MJ, Soares CP, da Silva JL, Andreotti 
PF (2005) Interaction of pathogenic fungi with host 
cells: molecular and cellular approaches. FEMS 
Immunol Med Microbiol 45(3):383—394 

Mendgen K, Hahn M, Deising H (1996) Morphogenesis 
and mechanisms of penetration by plant pathogenic 
fungi. Annu Rev Phytopathol 34:367-386 

Mitchell TK, Thon MR, Jeong J-S, Brown D, Deng J, 
Dean RA (2003) The rice blast pathosystem as a case 
study for the development of new tools and raw mate- 
rials for genome analysis of fungal plant pathogens. 
New Phytol 159:53-61 

Morissette DC, Dauch A, Beech R, Masson L, Brousseau 
R, Jabaji-Hare S (2008) Isolation of mycoparasitic- 
related transcripts by SSH during interaction of the 
mycoparasite Stachybotrys elegans with its host 
Rhizoctonia solani. Curr Genet 53:67—80 

Morsy M, Gouthu S, Orchard S, Thorneycroft D, Harper 
JF, Mittler R et al (2008) Charting plant interactomes: 
possibilities and challenges. Trends Plant Sci 
13(4):183-191 

Murray F, Llewellyn D, McFadden H, Last D, Dennis E, 
Peacock WJ (1999) Expression of the Talaromyces fla- 
vus glucose oxidase gene in cotton and tobacco 
reduces fungal infection, but is also phytotoxic. Mol 
Breed 5:219—232 

Nakayashiki H (2005) RNA silencing in fungi: mecha- 
nisms and applications. FEBS Lett 579:5950—-5970 

Naseem M, Dandekar T (2012) The role of auxin- 
cytokinin antagonism in plant-pathogen interactions. 
PLoS Pathog 8(11):e1003026 


M. Srivastava et al. 


Naseem M, Philippi N, Hussain A, Wangorsch G, Ahmed 
N, Dandekar T (2012) Integrated systems view on net- 
working by hormones in Arabidopsis immunity 
reveals multiple crosstalk for cytokinin. Plant Cell 
24(5):1793-1814 

Naseem M, Kaltdorf M, Hussain A, Dandekar T (2013) 
The impact of cytokinin on jasmonate-salicylate 
antagonism in Arabidopsis immunity against infection 
with Pst DC3000. Plant Signal Behav 8:e26791 

Naseem M, Kunz M, Dandekar T (2014) Probing the 
unknowns in cytokinin-mediated immune defense in 
Arabidopsis with systems biology approaches. 
Bioinform Biol Insights 8:35—44 

Nowrousian M (2014) Genomics and transcriptomics to 
analyze fruiting body development fungal genomics. 
Mycota 13:149-172 

Orchard S, Ammari M, Aranda B, Breuza L, Briganti L, 
Broackes-Carter F et al (2014) The MIntAct project— 
IntAct as a common curation platform for 11 molecu- 
lar interaction databases. Nucleic Acids Res 
42(Database issue):D358—D363 

Pagani I, Liolios K, Jansson J, Chen IM, Smirnova T, 
Nosrat B, Markowitz VM, Kyrpides NC (2012) The 
Genomes OnLine Database (GOLD) v. 4: status of 
genomic and metagenomic projects and their associ- 
ated metadata. Nucleic Acids Res 40(Database 
issue):D571—D579 

Pasche JS, Mallik I (2013) Development and validation of 
a real-time PCR assay for the quantification of 
Verticillium dahliae in potato. Plant Dis 
97(5):608-618 

Pritchard L, Birch P (2011) A systems biology perspective 
on plant—microbe interactions: biochemical and struc- 
tural targets of pathogen effectors. Plant Sci 
180:584—603 

Rautio JJ, Huuskonen A, Vuokko H, Vidgren V, 
Londesborough J (2007) Monitoring yeast physiology 
during very high gravity wort fermentations by fre- 
quent analysis of gene expression. Yeast 24:741—760 

Rautio JJ, Satokari R, Vehmann-Kreula P, Serkkola E, 
Soderlund H (2008) TRAC in high-content gene 
expression analysis: applications in microbial popula- 
tion studies, process biotechnology and biomedical 
research. Expert Rev Mol Diagn 8:379-385 

Rebrikov Denis V, Desai Sejal M, Siebert Paul D, 
Lukyanov Sergey A (2004) Suppression subtractive 
hybridization. In: Gene expression profiling. Methods 
Mol Biol 258:107-134 

Reski R (1998) Physcomitrella and Arabidopsis: the 
David and Goliath of reverse genetics. Trends Plant 
Sci 3(6):209-210 

Rintala E, Wiebe MG, Tamminen A, Ruohonen L, Penttila 
M (2008) Transcription of hexose transporters of 
Saccharomyces cerevisiae 1s affected by change in 
oxygen provision. BMC Microbiol 8:53 

Robert VA, Casadevall A (2009) Vertebrate endothermy 
restricts most fungi as potential pathogens. J Infect Dis 
200: 1623-1626 

Salwinski L, Miller CS, Smith AJ, Pettit FK, Bowie JU, 
Eisenberg D (2004) The database of interacting 


3 Application of Biotechnology and Bioinformatics Tools in Plant-Fungus Interactions 63 


proteins: 2004 update. Nucleic Acids Res 32(Database 
issue):D449—-D451 

Schauer K, Geginat G, Liang C, Goebel W, Dandekar T, 
Fuchs TM (2010) Deciphering the intracellular metab- 
olism of Listeria monocytogenes by mutant screening 
and modeling. BMC Genomics 11:573 

Schuster S, Fell DA, Dandekar T (2000) A general defini- 
tion of metabolic pathways useful for systematic orga- 
nization and analysis of complex metabolic networks. 
Nat Biotechnol 18(3):326—332 

Schuster S, Pfeiffer T, Moldenhauer F, Koch I, Dandekar 
T (2002) Exploring the pathway structure of metabo- 
lism: decomposition into sub-networks and applica- 
tion to Mycoplasma pneumoniae. Bioinformatics 
18(2):351-361 

Seddas-Dozolme PM, Arnould C, Tollot M, Kuznetsova 
E, Gianinazzi-Pearson V (2010) Expression profiling 
of fungal genes during arbuscular mycorrhiza symbio- 
sis establishment using direct fluorescent in situ 
RT-PCR. Methods Mol Biol 638:137—152 

Sexton AC, Howlett BJ (2006) Parallels in fungal patho- 
genesis on plant and animal hosts. Eukaryot Cell 
5(12):1941-1949 

Shahzad K, Loor JJ (2012) Application of top-down and 
bottom-up systems approaches in ruminant physiol- 
ogy and metabolism. Curr Genomics 13(5):379-394 

Silva J, Chang K, Hannon GJ, Rivas FV (2004) RNA- 
interference-based functional genomics in mammalian 
cells: reverse genetics coming of age. Oncogene 
23:8401-8409 

Slade Ann J, Fuerstenberg Susan I, Loeffler D, Steine 
Michael N, Facciotti D (2005) A reverse genetic, non- 
transgenic approach to wheat crop improvement by 
TILLING. Nat Biotech 23:75-81 

Soanes DM, Talbot NJ (2006) Comparative genomic anal- 
ysis of phytopathogenic fungi using expressed 
sequence tag (EST) collections. Mol Plant Pathol 
7(1):61—70 

Soanes DM, Skinner W, Keon J, Hargreaves J, Talbot NJ 
(2002) Genomics of phytopathogenic fungi and the 
development of bioinformatic resources. Mol Plant 
Microbe Interact 15(5):421—427 

Soderlund C (2009) Computational techniques for eluci- 
dating plant-pathogen interactions from large-scale 
experiments on fungi and oomycetes. Brief Bioinform 
10(6):654—663 

Srivastava M, Akhoon BA, Gupta SK, Gupta SK (2010) 
Development of resistance against blackleg disease in 
Brassica oleracea var. botrytis through in silico meth- 
ods. Fungal Genet Biol 47(10):800—808 

Stark C, Breitkreutz BJ, Reguly T, Boucher L, Breitkreutz 
A, Tyers M (2006) BioGRID: a general repository for 
interaction datasets. Nucleic Acids Res 34(Database 
issue):D535—D539 

Strobel G, Hess WM (1974) Evidence for the presence of 
the toxin-binding protein on the plasma membrane of 
sugarcane cells. Proc Natl Acad Sci U S A 
71:1413-1417 

Talbot NJ, Kershaw MJ, Wakley GE, de Vries OMH, 
Wessels JGH, Hamer JE (1996) MPGI encodes a fun- 


gal hydrophobin involved in surface interactions dur- 
ing infection-related development of Magnaporthe 
grisea. Plant Cell 8:985—999 

Tang WH, Zhang Y, Duvick J (2012) The application of 
laser microdissection to profiling fungal pathogen 
gene expression in planta. Methods Mol Biol 
835:219-236 

Tollot M, Hoi Joanne WS, van Tuinen D, Arnould C, 
Chatagnier O, Dumas B, Gianinazzi-Pearson V, 
Seddas Pascale MA (2009) An STE12 gene identified 
in the mycorrhizal fungus Glomus intraradices 
restores infectivity of a hemibiotrophic plant patho- 
gen. New Phytol 181:693—707 

Torto TA, Li S, Styer A, Huitema E, Testa A, Gow NA, 
van West P, Kamoun S (2003) EST mining and func- 
tional expression assays identify extracellular effector 
proteins from the plant pathogen Phytophthora. 
Genome Res 13(7):1675—1685 

van Driel KGA, Boekhout T, W6sten HAB, Verkleij AJ, 
Miller WH (2007) Laser microdissection of fungal 
septa as visualized by scanning electron microscopy. 
Fungal Genet Biol 44:466-473 

Van Loon LC, Pierpoint WS, Boller T, Conejero V (1994) 
Recommendations for naming plant pathogenesis- 
related proteins. Plant Mol Biol Rep 12:245-—264 

Venu RC, Zhang Y, Weaver B, Carswell P, Mitchell TK, 
Meyers BC et al (2011) Large scale identification of 
genes involved in plant-fungal interactions using 
Illumina’s —_sequencing-by-synthesis —_ technology. 
Methods Mol Biol 722:167—178 

Verberne MC, Verpoorte R, Bol JF, Mercado-Blanco J, 
Linthorst HJM (2000) Overproduction of salicylic 
acid in plants by bacterial transgenes enhances 
pathogen resistance. Nat Biotechnol 18:779-—783 

Vieira A, Talhinhas P, Loureiro A, Duplessis S, Fernandez 
D, Silva M et al (2011) Validation of RT-qPCR refer- 
ence genes for in planta expression studies in Hemileia 
vastatrix, the causal agent of coffee leaf rust. Fungal 
Biol 115(9):891-901 

Walters DR, McRoberts N (2006) Plants and biotrophs: a 
pivotal role for cytokinins? Trends Plant Sci 
11(12):581-586 

Wang Z, Gerstein M, Snyder M (2009) RNA-Seg: a revo- 
lutionary tool for transcriptomics. Nat Rev Genet 
10(1):57-63 

Washburn MP (2004) Utilization of proteomics datasets 
generated via multidimensional protein identification 
technology (MudPIT). Brief Funct Genomic 
Proteomic 3(3):280—286 

Washburn MP, Wolters D, Yates IIIJR (2001) Large-scale 
analysis of the yeast proteome by multidimensional 
protein identification technology. Nat Biotechnol 
19(3):242—247 

Webling R, Panstruga R (2012) Rapid quantification of 
plant-powdery mildew interactions by qPCR and 
conidiospore counts. Plant Methods 8:35 

Weidmann S, Sanchez L, Descombin J, Chatagnier O, 
Gianinazzi S, Gianinazzi-P V (2004) Fungal elicita- 
tion of signal transduction-related plant genes pre- 
cedes mycorrhiza establishment and requires the dmi3 


64 


gene in Medicago truncatula. Mol Plant Microbe 
Interact 17(12):1385—1393 

Wessels GH, Sietsma JH (1981) Fungal cell walls: a sur- 
vey. In: Tanner W, Loewus FA (eds) Encyclopedia of 
plant physiology, vol 13B, New Series. Springer, 
Heidelberg, pp 352-394 

West P, Kamoun S, van’tKlooster JW, Govers F (1999) 
Internuclear gene silencing in Phytophthora infestans. 
Mol Cell 3(3):339-348 

Whisson SC, Avrova AO, West PV, Jones JT (2005) A 
method for double-stranded RNA-mediated transient 
gene silencing in Phytophthora infestans. Mol Plant 
Pathol 6(2):153-—163 

Wilhelm J, Pingoud A (2003) Real-time polymerase chain 
reaction. Chem Bio Chem 4(11):1120—1128 

Winnenburg R, Urban M, Beacham A, Baldwin TK, 
Holland S, Lindeberg M et al (2008) PHI-base update: 
additions to the pathogen host interaction database. 
Nucleic Acids Res 36(Database issue):D572—D576 

Winzeler EA, Shoemaker DD, Astromoff A, Liang H, 
Anderson K, Andre B, Bangham R et al (1999) 
Functional characterization of the Saccharomyces 
cervisiaeé genome by precise deletion and parallel 
analysis. Science 285(5429):901—906 


M. Srivastava et al. 


Woolhouse M, Gaunt E (2007) Ecologic origins of novel 
human pathogens. Crit Rev Microbiol 33:23 1-242 
Xu JR, Peng YL, Dickman MB, Sharon A (2006) The 
dawn of fungal pathogen genomics. Annu Rev 
Phytopathol 44:337-366 

Yakovlev IA, Hietala AM, Steffenrem A, Solheim H, 
Fossdal CG (2008) Identification and analysis of dif- 
ferentially expressed Heterobasidion parviporum 
genes during natural colonization of Norway spruce 
stems. Fungal Genet Biol 45:498—513 

Yates JR III (1998) Mass spectrometry and the age of the 
proteome. J Mass Spectrom 33:1-19 

Yuan JS, Galbraith DW, Dai SY, Griffin P, Stewart CN Jr 
(2008) Plant systems biology comes of age. Trends 
Plant Sci 13(4):165-171 

Zhu C, Sanahuja G, Yuan D, Farré G, Arj6 G, Berman J 
et al (2013) Biofortification of plants with altered anti- 
oxidant content and composition: genetic engineering 
strategies. Plant Biotechnol J 11(2):129-141 

Zhuang X, McPhee KE, Coram T, Peever T, Chilvers M 
(2012) Rapid transcriptome characterization and pars- 
ing of sequences in a non-model host-pathogen inter- 
action pea-Sclerotinia sclerotiorum. BMC Genomics 
13:668 


Ps 


Kadirvel, S. Senthilvel, S. Geethanjali, M. Sujatha, 


and K.S. Varaprasad 


Abstract 

Genetic markers have long been used for characterization of plant genetic 
diversity and exploitation in crop improvement. The advent of DNA 
marker technology (during 1980s) has revolutionized crop breeding 
research as it has enabled the breeding of elite cultivars with targeted 
selection of desirable gene or gene combinations in breeding programmes. 
DNA markers are considered better over traditional morphology and 
protein-based markers because they are abundant, neutral, reliable, conve- 
nient to automate and cost-effective. Over the years, DNA marker 
technology has matured from restriction based to PCR based to sequence 
based and to eventually the sequence itself with the emergence of novel 
genome sequencing technologies. Trait mapping has been the foremost 
application of molecular markers in plant breeding. Genomic locations of 
numerous genes or quantitative trait loci (QTLs) associated with agro- 
nomically important traits have been determined in several crop plants 
using linkage or association mapping approaches. Plant breeders always 
look for an easy, rapid and reliable method of selection of desirable plants 
in breeding populations. Conventionally, desirable plants are selected 
based on phenotypic observations. The phenotypic selection for complex 
agronomic traits is difficult, unpredictable and challenging. Once the 
marker-trait association is correctly established, the gene- or QTL-linked 
markers can be used to select plants carrying desirable traits, the process 
called marker-assisted selection (MAS). Marker-assisted backcrossing 
(MABC) has been widely used for transferring single major gene or com- 
bination of major genes into the background of elite cultivar; the process 
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4.1 


Introduction 
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refers to gene pyramiding. Marker-assisted recurrent selection (MARS) 
and genome-wide association analysis (GWA) are considered potential 
MAS strategies for improvement of complex traits but still remain as theo- 
retical possibilities in plant breeding. Though molecular markers and 
MAS have promises for improved plant breeding process, the marker-trait 
associations or QTLs are statistical associations, which are influenced by 
several factors such as trait heritability, phenotyping methods, marker den- 
sity, population type and other experimental conditions that might lead to 
false positives. Therefore, a cautiously optimistic approach is necessary to 
consider MAS in crop breeding programmes. In this chapter, the potentials 
of genetic markers in plant breeding are described. 
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automated for convenience and cost-effective- 
ness. During the last three decades, DNA marker 


Historically, plant breeding has been augmented 
with novel tools and methods from time to time. 
In recent years, molecular marker technology 
(especially DNA-based markers) has become 
popular among plant breeders. Molecular mark- 
ers are widely used for characterizing genetic 
diversity in germplasm collections, mapping and 
identification of genes associated with important 
traits, mining of superior alleles and marker- 
assisted selection (MAS) of desirable genes/traits 
in breeding programmes. DNA markers are more 
attractive compared to conventional morphology 
and protein-based markers as they meet the char- 
acteristics of an ideal genetic marker system: 
unlimited in number, insensitive to environmen- 
tal conditions, highly reliable and can easily be 


technology has progressed tremendously; a vari- 
ety of marker types has been developed, which 
basically differ in their ability to detect DNA 
polymorphisms. The recent advent of next-gener- 
ation sequencing (NGS) technologies and bioin- 
formatics tools has enabled high-throughput 
development of sequence-based markers with 
extraordinarily less cost and time. 

Genetic enhancement for agronomically 
important traits is challenging because most of 
them are genetically complex, controlled by mul- 
tiple genes and influenced by environment. 
Classical breeding methods and tools are inade- 
quate to deal with complex traits. Molecular 
markers provide means to precisely identify the 
number of genes controlling complex traits, to 
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locate them on linkage maps and to quantify their 
effects thereby enabling breeders to selectively 
manipulate complex traits in breeding pro- 
grammes. In this chapter, the developments in 
genetic marker technology, its application in trait 
mapping and MAS for crop improvement are 
briefly described. 


4.2 Genetic Markers 

Genetic markers are the biological features, 1.e. 
traits, enzymes/proteins and fragments of DNA 
that are inherited from parent to progeny follow- 
ing the Mendelian segregation. These features are 
used to keep track of an individual or a gene, 
hence called ‘markers’. Genetic markers can be 
classified into four categories, viz. morphological 
markers, cytological markers, protein/isozymes 
and DNA markers. As the knowledge on biology 
improves, the marker system has evolved from 
using mere morphological features as markers to 
the level of using variation at DNA itself. More 
detailed description of different types of markers 
is given in the following sections. 


4.2.1. Morphological Markers 

Historically, plant breeders use easily observable 
morphological traits such as leaf shape, flower 
colour, pubescence, pod/seed colour, seed shape, 
and others to distinguish individuals as well as to 
use as a proxy in selection, when they are linked 
with other agronomic traits. Morphological 
markers have been reported in several crops, viz. 
rice, wheat, soybean, tomato and corn. For exam- 
ple, Khush (1987) has listed more than 300 mor- 
phological traits that can be used for genetic 
analysis in rice. A few associations/linkages 
between morphological and agronomic traits 
have been successfully used for selection in 
breeding populations. As early as 1923, Karl Sax 
found an association between seed colour and 
seed size in Phaseolus vulgaris that helped for 
selecting plants with large seeds. The other 
examples include linkage of semid-warfism with 
yield in rice and wheat, dwarfism with male 


sterility in wheat, which were extensively used in 
rice and wheat improvement during 1990s. 
Morphological markers are still relevant for 
genetic and breeding applications, if they show 
discrete variation in the study materials and the 
trait 1s not affected by developmental stage of the 
plant and environment. 


4.2.2 Cytological Markers 


The variations in banding patterns of the chromo- 
some are called ‘cytological markers’. They are 
used for chromosome characterization, detection 
of mutations and studying taxonomical relation- 
ships. The distribution of euchromatin and het- 
erochromatin is the basis for variations in the 
banding pattern. Generally, the variations are 
visualized in terms of colour, width, order and 
position of the bands created by staining the 
chromosomes. Based on the stains used, different 
banding techniques were developed that include 
Q-banding, G-banding, R-banding, C-banding, 
NOR-banding and T-banding. In recent times, 
fluorescent in situ hybridization (FISH) is used to 
detect and localize the presence or absence of 
specific DNA sequences on chromosomes using 
fluorescence-labelled DNA or RNA as probes. 
Cytological markers were widely used in physi- 
cal mapping and linkage group identification. 
Orellana et al. (1993) have mapped the genes 
controlling seed storage proteins to chromosome 
1R of rye using cytological markers. Santos et al. 
(2006) reported a translocation-based cytological 
marker that is associated with winter hardiness in 
oat. However, similar reports on linkage/associa- 
tion of cytological markers with traits of impor- 
tance are very minimal perhaps due to rare 
occurrence and laborious’ procedures for 
detection. 


4.2.3 Protein Markers 


Some of the enzymes are present in multiple 
forms in an individual and still carry out the same 
function. Such variants of enzymes are called 
‘isozymes’ or ‘allozymes’ and used as genetic 
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markers. The isozymes are enzyme variants that 
are the products of different genes and thus repre- 
sent different loci, whereas enzyme variants pro- 
duced by different alleles of the same gene are 
called allozymes. Isozymes were the first set of 
molecular markers. Enzyme variants differ in the 
amino acid sequences causing differences in size 
and charge, which can be visualized by gel elec- 
trophoresis. This class of markers was widely 
used in population genetic analysis during 1980s. 
However, protein-based markers are not stable 
across tissues, organs and developmental stages, 
and the number of informative marker loci is too 
small to use in gene mapping studies. 
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4.2.4 DNA Markers 


Technological advances have enabled us to visu- 
alize the differences in DNA sequences itself 
across genotypes, which have led to the advent of 
DNA-based markers. DNA markers are the varia- 
tions observed in a particular portion of the DNA 
among the individuals of a species. These varia- 
tions may be due to different mutations such as 
insertions, deletions and substitution or errors in 
replication of tandemly repeated DNA. Since 
1980, several DNA marker technologies have 
been developed (Table 4.1), which differ on the 
basis of polymorphism and the methodology to 


Table 4.1 List of different DNA markers in chronological order of their publication 


Nomenclature* 

Restriction fragment length polymorphism 
Short tandem repeats 

Variable number tandem repeats 
Allele-specific oligonucleotides 
Allele-specific polymerase chain reaction 
Oligonucleotide polymorphism 

Inverse PCR 

Single-stranded conformational polymorphism 
Sequence-tagged sites 

Randomly amplified polymorphic DNA 
Arbitrarily primed polymerase chain reaction 
Sequence-tagged microsatellite sites 
Restriction landmark genome scanning 

DNA amplification fingerprinting 

Expressed sequence tags 

Cleaved amplified polymorphic sequence 
Strand displacement amplification 

Simple sequence length polymorphism 
Degenerate oligonucleotide primer-PCR 
Simple sequence repeats 

Multiple arbitrary amplicon profiling 
Sequence characterized amplified region 
Microsatellite-primed PCR 

Inter-simple sequence repeats 

Selective amplification of microsatellite polymorphic loci 
Single nucleotide polymorphisms 

Random amplified microsatellite polymorphisms 
Single primer amplification reactions 
Amplified fragment length polymorphism 
Allele-specific associated primers 
Site-selected insertion PCR 


Acronym Reference 

RFLP Grodzicker et al. (1975) 

STR Hamada et al. (1982) 

VNTR Jeffreys et al. (1985) 

ASO Saiki et al. (1986) 

AS-PCR Landegren et al. (1988) 

OP Beckmann (1988) 

IPCR Triglia et al. (1988) 

SSCP Orita et al. (1989) 

STS Olsen et al. (1989) 

RAPD Williams et al. (1990) 
AP-PCR Welsh and McClelland (1990) 
STMS Beckmann and Soller (1990) 
RLGS Hatada et al. (1991) 

DAF Caetano-Anolles et al. (1991) 
EST Adams et al. (1991) 

CAPS Akopyanz et al. (1992) 

SDA Walker et al. (1992) 

SSLP Dietrich et al. (1992) 
DOP-PCR Telenius et al. (1992) 

SSR Akkaya et al. (1992) 

MAAP Caetano-Anolles et al. (1993) 
SCAR Paran and Michelmore (1993) 
MP-PCR Meyer et al. (1993) 

ISSR Zietkiewicz et al. (1994) 
SAMPL Morgante and Vogel (1994) 
SNP Jordan and Humphries (1994) 
RAMP Wu et al. (1994) 

SPAR Gupta et al. (1994) 

AFLP Vos et al. (1995) 

ASAP Gu et al. (1995) 

SSI Koes et al. (1995) 


(continued) 
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Table 4.1 (continued) 


Nomenclature* 

Cleavase fragment length polymorphism 

Inverse sequence-tagged repeats 

Directed amplification of minisatellite DNA-PCR 
Sequence-specific amplified polymorphism 
Multiplexed allele-specific diagnostic assay 
Retrotransposon-based insertional polymorphism 
Anchored simple sequence repeats 

Single feature polymorphism 
Inter-retrotransposon amplified polymorphism 
Retrotransposon-microsatellite amplified polymorphism 
Miniature inverted-repeat transposable element 
Three endonuclease-AFLP 

Inter-MITE polymorphisms 

Sequence-related amplified polymorphism 
Diversity arrays technology 

Target region amplification polymorphism 
Promoter anchored amplified polymorphism 
Conserved DNA-derived polymorphism 

Start codon targeted polymorphism 

Inter-primer binding site 

Inter-SINE amplified polymorphism 


“Some of the markers are very similar or synonyms 


detect it. They can be categorized into dominant 
or codominant types based on their ability to dis- 
tinguish heterozygous from homozygous indi- 
viduals. Dominant markers are bi-allelic and 
multi-locus, whereas codominant markers are 
either bi-allelic or multi-allelic and locus specific 
(Fig. 4.1). Some of the widely used DNA mark- 
ers based on hybridization and PCR techniques 
are described below. 


4.2.4.1 Restriction Fragment Length 
Polymorphism (RFLP) 

The RFLP uses the technique of Southern blot- 
ting, a nucleic acid hybridization technique 
developed by Southern (1975). It was first used in 
human linkage mapping (Botstein et al. 1980). 
The basis for polymorphism is the mutation 
occurring in or between restriction sites causing 
differences in the restriction products. RFLP 
markers are locus specific, mostly codominant 
and highly reproducible. It is a robust marker sys- 
tem for synteny analysis across different species. 
It does not require sophisticated equipment, but 


Acronym Reference 

CFLP Brow et al. (1996) 

ISTR Rohde (1996) 
DAMD-PCR Bebeli et al. (1997) 
S-SAP Waugh et al. (1997) 
MASDA Shuber et al. (1997) 

RBIP Flavell et al. (1998) 
ASSR Wang et al. (1998) 

SFP Winzeler et al. (1998) 
IRAP Kalendar et al. (1999) 
REMAP Kalendar et al. (1999) 
MITE Casa et al. (2000) 
TE-AFLP van der Wurff et al. (2000) 
IMP Chang et al. (2001) 

SRAP Li and Quiros (2001) 
DArT Jaccoud et al. (2001) 
TRAP Hu and Vick (2003) 

PAAP Pang et al. (2008) 

CDDP Collard and Mackill (2009) 
SCoT Collard and Mackill (2009) 
iPBS Kalendar et al. (2010) 
ISAP Seibt et al. (2012) 


large amount of high-quality DNA and technical 
expertise are paramount. Its throughput is very 
low, hence not widely used especially in plant 
breeding applications. 


4.2.4.2 Random Amplified 
Polymorphic DNA (RAPD) 

The RAPD was first demonstrated in the 1990s 
and is the simplest marker system to date. This is 
PCR-based technique, which requires only a 
single, short, random primer sequence to amplify 
DNA segments arbitrarily (Williams et al. 1990; 
Welsh and McClelland 1990). RAPD can detect 
several marker loci simultaneously in the form 
of presence or absence of the bands. The basis 
for polymorphism is the changes in DNA 
sequence that inhibit primer binding. RAPD was 
the most widely reported molecular markers till 
a decade ago because it is inexpensive and can 
be performed in any laboratory with limited 
infrastructure. But, it suffers from the problem 
of reproducibility, and many consider it as inap- 
propriate for any applications. However, it may 
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Fig.4.1 Properties of different DNA markers 


be still useful for certain applications such as 
looking at overall diversity of a set of samples or 
confirmation of hybrids. When useful RAPD 
band is identified, this could be converted to 
other types of PCR-based markers, like sequence 
characterized amplified region (SCAR) to 
improve the specificity (Michelmore et al. 1991; 
Martin et al. 1991). There are several variants of 
this technology such as arbitrarily primed PCR 
(AP-PCR) and DNA amplification fingerprinting 
(DAF). 


4.2.4.3 Inter-Simple Sequence Repeats 
(ISSR) 

The ISSR technique uses a single primer designed 
based on the complimentary sequences of a mic- 
rosatellite as core and few selective nucleotides 
as anchor at the 3’ end to amplify the sequences 
between two adjacent microsatellites that are 
oppositely oriented. The basis of polymorphism 
is the differences in length of the sequences 


Biallelic 


Polyallelic 





Biallelic 


P. Kadirvel et al. 





Single locus 





Multi locus 





between two adjacent microsatellites. The major 
advantage of the ISSR is that the primers can be 
synthesized without the prior sequence informa- 
tion. It is a simple technique and a single run can 
produce multiple bands. Like RAPD, ISSR is 
also a dominant marker and suffers from repro- 
ducibility and the possibility of occurrence of 
similar-sized nonhomologous fragments. This 
marker system has potential applications in fin- 
gerprinting and genetic identity tests. 


4.2.4.4 Amplified Fragment Length 
Polymorphism (AFLP) 

The AFLP is generated by digesting the genomic 
DNA with a pair of restriction enzymes and 
selective amplification of restriction fragments 
using PCR. The basis of polymorphism is the 
same as that of RFLP, but this technique allows 
visualization of several polymorphisms simulta- 
neously. The throughput is high. A typical AFLP 
run would produce about 100 bands, and about 
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70-80 % of them would be polymorphic depending 
on the nature of samples. This technique can be 
used for any organism without much initial 
investment in primer or probe development and 
sequence analysis. AFLPs are extremely useful 
as tools for DNA fingerprinting, studying diver- 
sity and germplasm characterization. However, 
the process is laborious and expensive. As that of 
RAPD, it is also a dominant marker though 
claims for codominant inheritance are also made 
based on band intensity. 


4.2.4.5 Microsatellites 

Microsatellites are tandemly repeated units of 
short nucleotide motifs of about 1—6 bp long. 
They are also known as simple sequence repeats 
(SSR), short tandem repeats (STR), simple 
sequence length polymorphisms (SSLP) or 
sequence-tagged microsatellite sites (STMS). 
The copy number of the repeats varies among 
individuals, and therefore, the polymorphism is 
observed in terms of variation in sequence length 
based on the number of repeats. Microsatellites 
are the markers of choice in most areas of molec- 
ular genetics because they are hyper-variable, 
highly reproducible, codominant, locus specific, 
abundant and widely distributed across the 
genome. Microsatellite analysis requires low 
amount of DNA and can be easily automated for 
high-throughput screening. They are highly 
transferable between populations, and the data 
can be exchanged easily across laboratories. It is 
necessary to have DNA sequence information to 
develop SSR markers. In the past, the develop- 
ment costs of SSRs were high due to genomic or 
cDNA library construction and sequencing. With 
the advent of high-throughput sequencing tech- 
niques, it is possible now to develop microsatel- 
lite markers for any organism with little 
investment in time and money. 


4.2.4.6 Single Nucleotide 

Polymorphism (SNP) 
A SNP is a single nucleotide base difference 
between DNA sequences of individuals, which 
occur within coding/non-coding regions of genes 
or in the intergenic regions. They are numerous 
and occur at different frequencies in different 


regions of the genome. SNP becomes the most 
preferred marker system nowadays due to the 
abundance (two to three polymorphic sites every 
kilobase), high level of polymorphism, high- 
throughput capability and cost-effectiveness. The 
SNP markers may be a better choice for the crop 
species especially legumes and oilseeds, wherein 
the microsatellite diversity appears to be very 
low. Even though the possibility of using SNP as 
marker was demonstrated in the 1990s, it has 
become popular only in the recent times due to 
the advent of NGS technologies. The NGS tech- 
nologies have drastically increased the speed at 
which the DNA sequence can be generated, while 
reducing the costs by several folds. Thus, the dis- 
covery of SNPs has become very simple and 
inexpensive, but the development of genotyping 
assays for SNPs 1s still hindered by the low con- 
version rate and high cost. Several SNP genotyp- 
ing options are available which ranges from 
simple cleaved amplified polymorphic sequence 
(CAPS) analysis to very high-throughput, array- 
based methods. The choice of the method largely 
depends on the number of SNPs to be used for 
genotyping. A detailed description on several 
methods of SNP typing has been provided by 
Bagge and Liibberstedt (2008). 


4.2.4.7 Gene-Targeted and Functional 
Markers 

The markers described above may belong to 
either transcribed or non-transcribed regions of 
the genome and are generally developed without 
the prior knowledge of their function. Hence, they 
are called ‘anonymous’ or ‘random markers’. 
Advancement in genomic studies now provides 
opportunities to specifically target genic regions 
of the genome to develop markers, which are 
termed ‘gene-targeted markers’. Examples of 
such markers include IRAP, REMAP, SRAP, 
TRAP, PAPP, CDDP, SCoT, iPBS and ISAP 
(Table 4.1). Furthermore, when DNA sequence 
information from fully characterized genes is 
available, markers can be developed from 
polymorphic sites within such genes causing the 
phenotypic variation. These markers are called 
‘functional markers’. The Functional markers 
possess a specific advantage that it can be used in 
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the segregating population without the issue of 
recombination. The candidate genes for plant 
height (Dwarfs), plant stature (tb/), flowering 
time (DwarfS), vernalization requirement (VF'R2), 
fruit size (fw2.2) and grain quality (GBSS) have 
been used to develop functional markers in rice, 
maize, tomato and rapeseed. The potentials and 
prospects of gene-targeted and functional markers 
have been reviewed by Andersen and Lubberstedt 
(2003) and Poczai et al. (2013). 


4.2.4.8 Genotyping by Sequencing 
Historically, genotyping was done by surveying 
only a portion of the genome using any of the 
marker technologies. Due to reduction in the cost 
of whole genome sequencing, newer approaches 
have been developed to use sequencing itself for 
routine genotyping for genetic studies and breed- 
ing applications. These approaches are called 
‘genotyping by sequencing (GBS)’. In this 
method, a reduced representation library is devel- 
oped from the genomic DNA of each sample and 
sequenced using any one of the NGS platform. 
The variants such as SNPs and InDels are scored 
across the genome among the samples. The GBS 
procedure has been demonstrated with maize and 
barley recombinant inbred populations, where 
roughly 200,000 and 25,000 sequence tags were 
mapped, respectively (Elshire et al. 2011). 


4.2.5 Choice of Markers 


In general, there is no perfect marker. Markers 
vary in their attributes such as abundance, 
genomic distribution, level of polymorphism, 
basis of polymorphism, technical requirements, 
cost, etc., and the choice depends mainly on the 
application. Suitability of different markers for 
selected applications is given in Table 4.2. Other 
factors influencing the selection of markers 
include accessibility, technical expertise, turn- 
around time and level of polymorphism in the 
study material, DNA quantity and quality require- 
ment, transferability between laboratories and 
populations and cost. 
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4.3. Trait Mapping 

Trait mapping has been the foremost application 
of molecular markers in plant breeding (Paterson 
1996). It refers to determining number, genomic 
locations and effects of genes associated with tar- 
get traits (qualitative or quantitative) using 
molecular markers either through linkage or 
association analysis. The information on genes 
and the linked markers would help breeders to 
design suitable strategies for marker-assisted 
breeding. 


4.3.1 Mapping Major Genes 

Traditionally, when classical genetics analysis 
suggests that the trait is under the control of 
major genes, a Series of near isogenic lines (NILs) 
carrying different genes are created by repeated 
backcrossing with a common recurrent parent. 
Subsequently, the NILs and the recurrent parent 
are analysed for polymorphic markers. The link- 
age of polymorphic markers with the target gene 
is established using segregating populations 
(Yoshimura et al. 1993). But the construction of 
NIL is time-consuming and tedious. However, 
this limitation has been overcome by an alterna- 
tive strategy called bulked segregant analysis 
(BSA) developed by Michelmore et al. (1991). 
The BSA involves the use of phenotypic extremes 
of a segregating population, instead of NIL, for 
tagging genes with the following steps: develop- 
ment of segregating populations (F, or F3) by 
crossing two parents that differ for target trait (for 
instance, resistant or susceptible), identification 
of polymorphic markers between parents, pheno- 
typing of segregating populations and identifica- 
tion of phenotypic extremes (resistant or 
susceptible progenies), making of DNA bulks of 
phenotypic extremes (usually about ten DNA 
samples for each phenotype category) and com- 
parison of polymorphic markers between DNA 
bulks of phenotypic extremes along with the 
DNA of parents. The marker alleles that are 
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Table 4.2 Suitability of popular markers for different applications 


Applications 
Genetic diversity analysis 


Comparative mapping 
and synteny analysis 


Genetic mapping 


Hybrid confirmation 


Marker-assisted selection 


Genomic selection 


Markers 
RAPD 
RFLP 
AFLP 


ISSR 


SSR 


SNP 


RAPD 
RFLP 


AFLP 
ISSR 
SSR 
SNP 
RAPD 
RFLP 
AFLP 
ISSR 
SSR 


SNP 
RAPD 
RFLP 
AFLP 
ISSR 
SSR 
SNP 
RAPD 
RFLP 
AFLP 
ISSR 
SSR 


SNP 
RAPD 
RFLP 
AFLP 
ISSR 
SSR 
SNP 


Suitability 
Suitable, easy and inexpensive system 
Suitable but expensive and low throughput for this application 


Provide more data point per run and ideal for this application but slightly 
expensive and nonrandom in its genomic distribution in some species 


Provide more data point per run and easy to perform but may be 
nonrandom in its genomic distribution 


Ideal because of its hyper-variability and multi-allelic nature but would 
be low throughput unless automated and less polymorphic in certain 
species 

Highly suitable because it can survey the entire genome, but assay 
development is expensive 

Not suitable due to its randomness 


Highly suitable because conserved gene sequences could be used as 
probes 


Not suitable 

Not suitable 

A fraction of the markers might be useful 

Not suitable 

Random and not locus specific, not suitable 

Suitable but tedious and low throughput 

Not ideal as they are not locus specific and dominant 
Not ideal as they are not locus specific and dominant 


Highly suitable because they are locus specific and codominant and have 
good genomic distribution 


Highly suitable because of its abundance and codominant nature 
Easy and inexpensive system, if stringent PCR conditions are maintained 
Expensive and cumbersome process 

Expensive and cumbersome process 

Suitable and easy system if agarose gel is used for detection 
Highly suitable 

Expensive unless low-cost assays like KASP are used 

The target band is to be converted to SCAR 

Not suitable due to tedious process and low throughput 

Not suitable 

Not suitable as tracking is difficult 


Highly suitable because of its codominant nature and transferability 
across labs 


Highly suitable when low-cost genotyping assays like KASP are used 
Not suitable in terms of reproducibility 

Not suitable in terms of throughput and number 

Not highly suitable in terms of throughput and tracking 

Not highly suitable in terms of throughput and tracking 

Not highly suitable in terms of throughput 

Highly suitable in terms of number and distribution 
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detected in the resistant parent as well as in the 
resistant DNA bulk are further tested on the large 
segregating population for its linkage with the 
resistance gene. The putative location of target 
gene can be determined by mapping the linked 
marker on a reference linkage map. 


4.3.2 Mapping Polygenes/QTLs 


‘QTL is an acronym for ‘quantitative trait loci’. 
Most of the agronomically important traits are 
quantitative in nature and are controlled by mul- 
tiple genes with variable effects. The ‘QTLs’ are 
genomic regions that are associated with a quan- 
titative trait, which may be found on different 
chromosomes. The process of locating QTLs on 
genome using linked genetic markers is called 
‘QTL mapping’. This can be achieved by statisti- 
cal methods that evaluate the association between 
marker genotype and the variation in trait value 
(phenotype) of individuals of a population. QTLs 
are detected based on the co-segregation of 
marker loci with the phenotypic trait. The co- 
segregation between the marker genotype and the 
trait (phenotype) 1s expected, if the marker loci 
and the genes responsible for the trait are in link- 
age disequilibrium. The earliest report of QTL 
analysis dates back to the 1920s. Sax (1923) pre- 
sented the first evidence for linkage between a 
morphological marker (seed colour or colour pat- 
terns) and a quantitative trait (seed size) in 
Phaseolus vulgaris. Later, Thoday (1961) pre- 
sented the methods for locating QTL on 
chromosomes based on their linkage relation- 
ships to Mendelian marker loci. He postulated 
that the process of QTL identification is limited 
only by the non-availability of sufficient number 
of genetic markers. During the last couple of 
decades, the limitation on number of markers has 
been overcome by the advances in molecular 
biology. At present, it is possible to find and use 
millions of markers for any species. Hence, the 
probability of finding the linkage between the 
genes and the marker loci is very high. Because 
of this opportunity, several attempts have been 
made to identify QTL in many organisms espe- 
cially human, animals and plants. QTL mapping 
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has enormous applications in crop improvement 
(Paterson 1998; Nelson et al. 2013). The immedi- 
ate utility of QTL analysis is to know the genetic 
architecture of a trait based on the number, 
genomic positions and effects of QTLs that 
explain the variation in the phenotypic trait. It 
may also indicate if the trait is controlled by 
many genes of small effect or by a few genes of 
large effect. Once the association between the 
genomic region(s) and the trait is established, the 
candidate genes underlying the trait can be iden- 
tified by high-resolution mapping followed by 
sequencing the target region(s). 


4.3.2.1 Steps in QTL Mapping 

A typical QTL mapping experiment involves the 

following major steps: 

1. Developing/assembling a mapping population 

2. Genotyping the mapping population using 
molecular markers 

3. Construction of linkage map for linkage- 
based QTL analysis 

4. Phenotyping the mapping population 

5. Statistical analysis of genotypic and pheno- 
typic data 


4.3.2.1.1 Mapping Population 

The genetic material used for QTL mapping 
could be a set of either related or unrelated indi- 
viduals, but the individuals should widely differ 
for the trait under study. The populations used for 
QTL analysis can be broadly classified into two 
categories, viz. (1) populations derived from con- 
trolled crosses of two or more parents and (11) 
natural population. 

Several types of populations can be created by 
crossing two or more parents in a defined fashion. 
Among them, the populations produced from two 
parents (biparental mapping populations) that 
include F,s, backcross population (BC), recombi- 
nant inbred lines (RIL), doubled haploids (DH), 
test cross, half sibs, full sibs, introgression lines 
(IL) and near isogenic lines (NIL) are widely 
used for QTL mapping. However, biparental pop- 
ulations have inherent limitations that they pro- 
vide low recombination and it is possible to study 
only two alleles. To overcome these limitations, 
multi-parent-based mapping populations such as 
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nested association mapping (NAM) population 
(Yu et al. 2008) and multi-parent advanced 
generation intercross (MAGIC) population 
(Cavanagh et al. 2008) are used in_ the 
recent times. 

A natural population of unrelated individuals 
from the germplasm collection can also be used 
for QTL analysis based on the principle of link- 
age disequilibrium (LD). It is assumed that the 
unrelated individuals in the germplasm may still 
contain very small chromosomal segments that 
are identical by descent, which leads to nonran- 
dom association of alleles across the genome and 
can be captured through genome-wide marker 
analysis, which is popularly called ‘association 
mapping’ or ‘LD mapping’. An unstructured nat- 
ural population is more ideal for association 
mapping because the presence of subpopulations 
can result in spurious marker-trait associations. 
However, the concern of population structure or 
family relatedness has been overcome with the 
recently developed statistical methods. 

The genetic model for different populations 
are based on the genotype constitution of the 
population. Each of the above-mentioned 
populations has their own advantages and 
disadvantages. Large number of markers may be 
needed to identify the QTL in natural population 
compared to population from controlled crosses 
due to high LD decay in natural populations. 
However, the accuracy and mapping resolution 
will be high in natural population. The early gen- 
eration populations such as F, or F>.3 provide the 
opportunity to estimate the effect of additive and 
dominance gene actions at specific locus, whereas 
only additive gene action can be studied in homo- 
zygous RILs and DH. However, RIL and DH 
being immortal populations, they could be used 
in replicated trials across locations and seasons. 
Hence, the traits that are highly influenced by 
environment can be better studied using RIL. 

Therefore, the choice of population should be 
based on the nature of trait, timeframe of the 
project as well as the availability of resources. 
The size of the population depends on the type of 
population, nature of the trait and objective of the 
study. Several simulation and empirical studies 
indicated that about 500 individuals are required 


for mapping QTL of small effects, whereas 
population of 200-300 individuals would be 
sufficient for identifying QTL with large effects. 


4.3.2.1.2 Genotyping of Mapping 
Populations 

Any type of markers can be used for QTL analy- 
sis. The choice of markers depends on the avail- 
ability of the marker system in the species, ease 
with which the genotyping can be performed and 
the available budget. For QTL identification 
through linkage analysis, it may be sufficient to 
have a marker at every 10 cM interval (Kearsey 
1998). The minimum number of markers for 
genome-wide association analysis mainly 
depends on the extent of LD in the species 
(Kruglyak 1999) and the specific population 
under study. For example, it has been estimated 
that about 275,000 markers are required for 
genome-wide association analysis in sorghum, 
whereas just 2,000 markers would be sufficient 
for Arabidopsis. Even for the same species, the 
required number of markers differs based on the 
level of recombination. For example, association 
analysis in a collection of maize landraces would 
require 750,000 markers, whereas only 50,000 
markers might be sufficient, if the population 
consists of only elite inbreds of maize. For candi- 
date gene-based analysis, only markers in the 
particular genomic regions, viz. promoters, 
introns, exons and 5’/3’ UTRs, would be required. 


4.3.2.2 Phenotyping of Mapping 
Populations 

The accurate and precise measurement of the 
trait is the critical step in QTL analysis. The suc- 
cess of any QTL mapping greatly relies on effi- 
cient and effective phenotyping of large 
population. Measuring the quantitative traits is a 
cumbersome process because the genetic archi- 
tecture of these traits often involves many loci of 
small effect that may interact with each other as 
well as with the environment. The trait must be 
evaluated in replicated trials under different envi- 
ronments using a reliable screening methodol- 
ogy. If possible, each trait is to be dissected into 
component traits, and more objective measure- 
ments are to be taken. There is concern that 
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several hundred QTLs have been reported in pub- 
lication, but only a handful of them are in use. 
One reason for this situation could be the report- 
ing of false-positive QTLs detected using flawed 
phenotypic screens. The traits may not be ame- 
nable for QTL mapping if proper methods for 
quantification are not available. 


4.3.2.2.1 QTL Detection Through 
Statistical Analysis 

QTL mapping heavily relies on the statistical 
model used in the analysis (Lander and Botstein 
1989; Churchill and Doerge 1994; Zeng 1994; 
Doerge et al. 1997; Zeng et al. 1999; Barton and 
Keightley 2002; Zou and Zuo 2006). Different 
analytical methods have been evolved, but the 
use of a particular procedure largely depends on 
the genetic architecture of the trait and the 
genomic information available at the point of 
time. The QTLs could be of ‘main-effect QTLs’ 
or ‘epistatic QTLs’ based largely on the presence 
or absence of epistasis. Main-effect QTLs are the 
ones showing clear additive or dominant effect 
on allelic substitution and are denoted by the spe- 
cific allele of the locus. Such QTLs could be 
identified by single marker analysis or through 
‘interval mapping’, if linkage information 1s 
available. The epistatic QTLs are detected by 
analysing the interactions between alleles at two 
or more loci through epistatic models. 


Single Marker Analysis 

In this method, the individuals are grouped based 
on the marker genotypes. The difference in the 
phenotypic value among the group is statistically 
tested (t-test, ANOVA, regression) (Haley and 
Knott 1992). A significant difference in the group 
means indicates the possible linkage/association 
of the marker (used to group the individuals) to 
the trait. If the backcross population is used, the 
individuals can be grouped into two classes based 
on the marker genotypes. Then, the difference 
among the group means can be tested by simple 
t-test. If F, or other early generation intercross 
population is used, the number of genotypic 
groups will be more than two. In that case, 
ANOVA can be used to assess the significance in 
the mean difference. Single marker analysis is the 
simplest form of QTL analysis. It can be per- 
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formed even when there is no linkage map avail- 
able. However, this method has limitations that 
QTL location cannot be determined and their 
effects may be underestimated. 


Interval Mapping 

Linkage map is essential for this analysis. A QTL 
is assumed in the region flanked by two linked 
markers and tested for its likelihood based on the 
phenotypic variance. The likelihood is calculated 
for QTL position and its effect, based on the 
observed phenotypic data and marker genotypes. 
The significance is tested using a likelihood ratio 
test. Likelihood for a single putative QTL is 
assessed at each location on the genome. The 
limitation in this approach is that QTLs located 
elsewhere on the genome can have an interfering 
effect compromising the power of detection and 
accuracy of effects (Jansen and Stam 1994), 


Composite Interval Mapping (CIM) 

This is the most widely used method. In this 
method, interval mapping is performed using a 
subset of marker loci as covariates. These mark- 
ers serve as proxies for other QTLs to increase 
the resolution of interval mapping, by accounting 
for linked QTLs and reducing the residual varia- 
tion. The key problem with CIM concerns the 
choice of suitable marker loci to serve as 
covariates. 


Multiple Interval Mapping (MIM) 

This method uses multiple marker intervals 
simultaneously. QTL models are built using all 
the QTLs _ identified initially through 
CIM. Alternative models are evaluated based on 
the information content and the best fit is chosen. 
This method allows simultaneous detection and 
estimation of additive, dominance and epistatic 
effects (Kao et al.1999). 


LD-Based Association Analysis 

There are two major strategies for mapping QTL 
using association analysis: (1) genome-wide asso- 
ciation analysis and (11) candidate gene approach. 
Genome-wide association analysis uses anony- 
mous markers from across the genome. In this 
approach, no assumptions are made as to what 
the genes might be, and markers are chosen sim- 
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ply to scan the whole genome for marker-trait 
association. Genome-wide approach requires a 
large number of markers for genotyping the sam- 
ples. Hence, it is difficult to apply for crops with 
a large genome or with limited genomic resources. 

In the candidate gene approach, a list of genes 
presumed to be involved in the trait is assembled 
and markers are chosen within those genes. The 
focus is the search for alleles at these candidate 
genes that result in variation in the trait of inter- 
est. Here, association mapping can be used as a 
validation method based on candidate genes or 
markers linked to QTL previously detected by 
classical methods or to mine novel and superior 
alleles from unexplored genetic materials. 


Software 

There are several software available for QTL map- 
ping. The most widely used software include 
Mapmaker-QTL, QTL Cartographer, Win-QTL 
Cartographer, PLABQTL, R/QTL, MapQTL, 
QGene, MQTL, MapManager/QTX and 
QTLNetwork. Among these, Mapmaker-QTL, 
QTL Cartographer, Win-QTL Cartographer, 
PLABQTL and R/QTL are command-line soft- 
ware. Except MapQTL, all others are free software. 
Mapmaker-QTL is the oldest software and can per- 
form only simple interval mapping. QTL 
Cartographer is the most versatile software and has 
options for performing most of the reported QTL 
mapping methods. R/QTL 1s a very flexible com- 
mand-line software that offers Bayesian (hidden 
Markov model) method of QTL analysis. There are 
specific software for QTL analysis in nonconven- 
tional populations. The software ‘HAPPY’ per- 
forms Multipoint QTL Mapping in genetically 
heterogeneous stocks. TASSEL is the most widely 
used software for marker-trait association in natural 
population (Bradbury et al. 2007). 


4.3.3 New Developments in QTL 
Mapping 


Now, the QTL mapping paradigm has extended 
beyond the trait measured at whole organism 
level. The level of expression of the gene as the 
amount of RNA transcript or the amount of 


protein produced is considered as quantitative 
traits (Damerval et al. 1994). The gene expression 
phenotypes are mapped to particular genomic loci 
by combining microarray or next-generation 
sequencing analyses of expression profiles and 
conventional QTL mapping of the same segregat- 
ing population. This approach is called expression 
QTL (eQTL) mapping (Michaelson et al. 2009). 
The ‘eQTL mapping’ is used to determine the 
genomic regions involved in the regulation of 
transcription and to study the impact of polymor- 
phisms within these regions. Several eQTL stud- 
ies have been reported in crops like eucalyptus 
(lignin-related genes), rice (stress-regulated gene 
expression with QTL for osmotic adjustment) and 
maize (degree of heterosis). 


4.3.4 Caveats in QTL Analysis 


The possibility of detecting the QTL is largely 
dependent on the heritability of the trait being 
mapped, the precision with which the trait can be 
measured and the size and type of the mapping 
population. Accurate phenotyping of a large pop- 
ulation is crucial but seldom achieved. Even 
though the number of markers is rarely a limiting 
factor in QTL mapping experiments nowadays, 
identification of QTLs located in certain regions 
like centromere would still be difficult due to 
recombination suppression in those regions. The 
QTLs identified in a particular cross may be 
missing in a different cross. Even the same cross 
may reveal different QTLs, if the measurements 
are done in different environments. At first, QTL 
iS a mere Statistical entity. It will not be useful, 
unless a biological meaning is associated with 
the QTL. But, our ability to move from QTL to 
gene is not an easy task. The QTL 1s a large chro- 
mosomal segment potentially harbouring several 
genes, and all of them may not have connection 
to the observed phenotype. Pinpointing the gene 
responsible for the trait will be a time- and cost- 
intensive process. It is to be noted that the bio- 
logical processes are oversimplified in QTL 
analyses (Arunachalam and Chandrashekaran 
1993). The current statistical models do not take 
into account allelic heterogeneity and epigenetic 
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inheritance. Epitasis is considered only as the 
portion of the genetic variance that cannot be 
explained by the marginal effects of individual 
loci in the hierarchical genetic effect model. 
Irrespective of the shortfalls, there are several 
success stories of QTL mapping, as reported by 
Price (2006). The advent and rapid evolution of 
genomic technologies provides tremendous 
opportunity for QTL mapping. However, QTL 
mapping should be performed with prudence, 
and it is not a simple click and play with soft- 
ware, as perceived by many researchers. 


4.4 Marker-Assisted Selection 

Plant breeders always look for an easy, rapid and 
reliable method of selection of desirable plants in 
breeding populations. Conventionally, desirable 
plants are selected based on phenotypic observa- 
tions. The phenotypic selection has long been suc- 
cessful for qualitative and simply inherited traits. 
However, in most cases, the agronomic traits are 
quantitative, controlled by polygenes and influ- 
enced by environment, which makes the selection 
difficult, unpredictable and challenging. DNA 
markers provide alternative selection tool for effi- 
cient selection of desirable genotypes in a breed- 
ing programme. Marker-assisted selection (MAS) 
refers to selection of desirable plants based on the 
presence or absence of a DNA marker that is asso- 
ciated or linked to the gene contributing for the 
target trait. During the last three decades, enor- 
mous progress has been made in genetic mapping 
and identification of molecular markers for selec- 
tion of genes/QTLs governing agronomic traits in 
crop plants. The MAS provides an excellent 
opportunity for plant breeders to have several 
ways to improve the efficiency and genetic gains 
in their breeding programmes (Collard and Mackill 
2008; Moose and Mumm 2008). 


When Is MAS Preferred 
in Breeding? 


4.4.1 


The MAS is preferred in breeding to address the 
following critical issues that are often encoun- 
tered by the plant breeders. 
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4.4.1.1 Proxy to Phenotypic Selection 

The MAS improves the effectiveness and reli- 
ability of selection when the phenotyping is par- 
ticularly expensive, laborious, time-consuming, 
destructive and growth stage and environment 
dependent. For instance, genotypes with high- 
oleic trait are selected by analysing seeds using 
biochemical methods such as gas chromatogra- 
phy. Though highly accurate, this method is labo- 
rious, time consuming and destructive. But 
marker assays can be carried out using DNA 
from young seedlings to predict high-oleic trait 
before harvesting of the seeds, and also pheno- 
type of F, or F, plants can be predicted, while 
analysis of individual seeds is not possible by the 
conventional biochemical methods. However, it 
is important to confirm the phenotype of the pre- 
dicted plants at every stage of the breeding pro- 
gramme. Therefore, it must be kept in mind that 
MAS is only a surrogate method and not a substi- 
tute for phenotypic selection. 


4.4.1.2 Shortening the Breeding Cycle 

The MAS helps the breeders to select plants 
carrying desirable genes at heterozygous state 
itself; therefore, selfing can be avoided and a sea- 
son can be saved. This is especially important for 
traits controlled by recessive genes as well as 
crops that are highly season bound. Conventional 
backcrossing will not give any clue about the 
genome status of the backcross progenies. Based 
on a simulation study, Frisch et al. (1999a) 
reported that ~1 % of the backcross progenies 
may have the maximum of recurrent parent 
genome. When MAS is applied, those progenies 
(with ~97 % of recurrent parent genome) can be 
identified with one or two generations fewer than 
that would be required in_ conventional 
backcrossing. 


4.4.1.3 Pyramiding of Multiple Genes 

It has been possible to transfer only one or two 
genes into a single cultivar background through 
conventional breeding methods, especially for 
simply inherited traits as well as in some cases 
for pest and disease resistance owing to the avail- 
ability of pathotypes or biotypes (Khush and Brar 
1991). Those cultivars that carried one or two 
genes succumbed to pest pressure soon after the 
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release for the cultivation. Therefore, pyramiding 
of more genes in the background of cultivars is 
warranted to improve the durability of resistance. 
Pyramiding of multiple genes is almost impossi- 
ble if pathotype or biotype information is not 
available because resistance genes cannot be dif- 
ferentiated at phenotype level. MAS will be 
extremely useful for gene pyramiding when the 
information on the genes and the linked markers 
are available for the donor genotypes. 


4.4.1.4 Breaking Undesirable Linkages 

Sometimes, ‘linkage drag’ and ‘repulsion link- 
ages’ may cause serious hurdles for finding desir- 
able recombinants in a breeding programme. 
Linkage drag refers to the undesirable effects that 
come along with the target trait while introgress- 
ing genes from wild species, which happens 
when genes for undesirable traits are closely 
linked with the target gene. In backcrossing pro- 
gramme, it is possible that a large segment of 
wild chromosome is fixed in the plant selections. 
For instance, 7y/ and Ty3 genes conferring resis- 
tance to tomato yellow leaf curl disease are part 
of a large region (~32 cM) from Solanum 
chilense, and the lines homozygous for this 
region may show deleterious effects (Verlaan 
et al. 2011). Based on a simulation study, Frisch 
et al. (1999b) reported that even after 20 rounds 
of backcrossing, the expected linkage drag 
around the target locus would be around 10 cM, 
when assumed that the target locus is at the centre 
of 100 cM chromosome. Since MAS allows 
selection of plants which are heterozygous for 
target gene region, it is possible to have recombi- 
nation in the region so that size of the linkage 
drag can be reduced further. A repulsion linkage 
refers to linkage of desirable allele of a target 
gene with undesirable allele of another target 
gene. For example, when resistance allele of a 
gene ‘A’ is closely linked with susceptible allele 
of a gene ‘B’, during the selection of resistance 
allele ‘A’, the susceptible allele “b’ also gets 
selected automatically. Tomato breeders have 
observed that the 7y//Ty3 alleles are linked to 
susceptible allele mi-/ of root-knot nematode 
(Meloidogyne incognita) resistance gene Mi-/ 
and 7Jy-2 linked to susceptible allele i-2 of the 


Fusarium wilt resistance gene /-2 (Peter Hanson, 
AVRDC-The World Vegetable Center, Personal 
Communication, 2010). Crossovers between the 
alleles linked in repulsion phases are rare but do 
occur; hence, by genotyping a large population, 
the desirable recombinants can be obtained. 


4.4.1.5 Rapid and Cost-Effective 
Selection 

Plant breeding is a number game. The probability 
of finding an ideal plant improves when a large 
number of segregating populations are generated 
and phenotyped rigorously. When ‘genotypic 
information’ of the parental lines or progenies 
are available to breeders before hand, the unde- 
sirable progenies can be rejected timely, and only 
a small number of useful progenies can be fol- 
lowed up, which can save substantial amount of 
time and cost of a breeding programme. For 
instance, a detailed economic analysis by Kuchel 
et al. (2005) on the use of MAS in an Australian 
wheat breeding programme showed that MAS 
enhanced the genetic gain over phenotypic selec- 
tion as well as reduced the overall cost by 40 %. 
It is also important to note that MAS methods for 
conversion of maize inbred lines for a single 
dominant gene at International Maize and Wheat 
Improvement Center (CIMMYT) were not par- 
ticularly advantageous over phenotypic selection 
in terms of cost-effectiveness (Morris et al. 2003). 
Phenotypic selection was less expensive but slow, 
whereas MAS was expensive but rapid. Therefore, 
according to Morris et al. (2003), MAS will be 
the best breeding method when the operating 
capital is abundantly available; otherwise, the 
conventional phenotypic selection will be the 
best option. 


4.4.2 Strategies of MAS 


Traditional breeding methods such as_back- 
cross, pedigree and recurrent selection can be 
improved with the integration of MAS. With the 
advent of genomic technologies, genome-wide 
MAS strategies are also emerging. The com- 
monly known MAS sstrategies are briefly 
described below. 
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4.4.2.1 Marker-Assisted Backcrossing 
(MABC) 

Backcrossing 1s a commonly used breeding 
method to incorporate one or a few major genes 
into an adapted or elite variety. Conventionally, 
the elite variety (recurrent parent) will be crossed 
with donor parent for the target trait. The back- 
cross progenies will be screened phenotypically 
for the target trait, and a few promising progenies 
will be selected and crossed back to recurrent 
parent. A few backcrosses will recover the elite 
variety with the desirable trait transferred or 
introgressed from the donor parent. The whole 
process can be efficiently managed if molecular 
markers are integrated with backcross breeding 
programme. Collard and Mackill (2008) have 
outlined the strategy of MABC, which is 
described briefly here. 

1. Foreground selection (stage I) — refers to 
selection of a subset of plants that carry desir- 
able allele from the donor parent. This will be 
done by genotyping of BC,F, plants for target 
gene or QTL using linked marker. A single 
marker or flanking markers that are tightly 
linked to the target locus can be used in 
foreground selection. The use of single marker 
may be risky because the recombination 
between the marker and the target locus may 
lead to wrong selection of allele. Therefore, 
the use of flanking markers may reduce the 
probability of losing the target alleles. 

2. Recombinant selection (stage I) — refers to 
further selection among those of BC;F, plants, 
which show heterozygosity in the target gene 
or QTL region. The BC,F;s that have fixed 
alleles may not have recombination in that 
region so that the size of the introgression can- 
not be reduced. Hence, the BC,F, progeny 
that is heterozygous for the target locus will 
be selected. 

3. Background selection (stage HI) — refers to 
final selection of BC,F,s that carry maximum 
proportion of recurrent parent genome. This 
will be done by genotyping the selected back- 
cross progenies using genome-wide markers. 
The above steps may be repeated for two or 

three times till an ideal backcross plant, which 

carries the target locus in heterozygous genomic 
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region and having the maximum of recurrent par- 
ent genome, is recovered. The selected backcross 
plants may be selfed for two or more generations 
to fix the agronomic traits before taking up 
field trials. 


4.4.2.2 Marker-Assisted Pedigree 
Selection (MAPS) 

Pedigree method of breeding involves combining 
of multiple desirable genes from two elite parents 
for the development of improved breeding mate- 
rials and/or new cultivars. When the number of 
genes to be pyramided is many, or the genes are 
tightly linked in repulsion, it 1s practically impos- 
sible to recover the desirable recombinants in a 
single generation using a realistic population 
size; therefore, repeated selection at several gen- 
erations is required. The availability of markers 
tightly linked to the desirable genes makes it pos- 
sible to conduct effective individual selection at 
early generations (F, or F;). This reduces the 
number of lines tested in the later generations and 
increases the desirable genotype frequency in the 
selected progeny (Ye and Smith 2008). 


4.4.2.3 Marker-Assisted Recurrent 
Selection (MARS) 
Recurrent selection is a cyclic breeding system 
for improvement of complex traits, wherein 
superior genotypes are identified and intermated 
to accumulate a large number of favourable 
alleles (Hallauer 1992). MARS uses QTL-linked 
markers to identify ideal genotypes for crossing 
in every selection cycle; thereby, favourable QTL 
alleles from diverse parental sources are accumu- 
lated within a short span of time (Peleman and 
Van Der Voort 2003; Ribaut and Ragot 2007). 
MARS programme may involve at least two 
or more superior parents. A MARS scheme with 
two superior parents aims to identify the most 
favourable QTLs from two parents and accumu- 
late them in a single genotype. Ribaut et al. 
(2010) provide a detailed scheme of a biparental 
MARS programme, which is briefly described 
below. The two selected parents will be crossed, 
and F;-derived populations will be developed and 
advanced to F, generation through single seed 
descent method. The F; plants will be genotyped 
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using molecular markers, and the F, progenies 
will be phenotyped for target traits at multiple 
locations. QTL analysis will be carried out based 
on the F; genotypic data and F, phenotypic data, 
and a set of key QTLs will be identified. A subset 
of F; progenies will be chosen based on the QTL 
information and the phenotypic performance. 
Their corresponding F, plants will be grown and 
genotyped using the nearest marker to the QTL 
peak or flanking markers to select the desirable 
plants for recombination cycle. For example, if 
eight progenies are chosen, they will be crossed 
as pairs (first recombination cycle), the F,s will 
be intercrossed as pairs (second recombination 
cycle) and the final two F;s (third recombination 
cycle) will be subsequently selfed for two or 
more generations to fix the desirable alleles. The 
resulting lines will be genotyped using QTL- 
linked markers, and ideal genotypes that carry 
the most of the desirable QTLs will be identified. 
When multiple sources of parental lines are 
involved in MARS, the source parents will be 
crossed as pairs (several biparental combina- 
tions), F,s will be intercrossed as pairs and the 
final two F,s will be used to start the first 
selection cycle. Compared to _ biparental 
MARS, multi-parental MARS scheme will save 
time and resources and permit rapid accumula- 
tion of favourable alleles from diverse parental 
lines at once. 

During the MARS process, the selection of an 
individual will be based on a molecular score or 
selection index, which is derived from genotypic 
and phenotypic data of the individual. In case a 
single marker is used for selection, the molecular 
score of an individual for use in recurrent selec- 
tion is obtained as the estimate of the statistical 
association between marker genotype and pheno- 
type. In cases where multiple markers are used in 
selection, genotype effects of all the markers are 
summed up to a single molecular score. It will be 
ideal to consider both molecular score and phe- 
notypic information of the individuals to obtain 
maximum genetic improvement (Dekkers and 
Hospital 2002; Eathington et al. 2007). 


4.4.2.4 Genomic or Genome-Wide 
Selection (GWS) 

Genomic selection (GS) or genome-wide selec- 
tion (GWS) is considered a new strategy of MAS 
for plant breeding (Jannink et al. 2010; Nakaya 
and Isobe 2012). In GS, selection of desirable 
individuals will be based on genomic estimated 
breeding values (GEBVs). Conventionally, 
breeding value is predicted using phenotypic data 
from family pedigrees. GS uses an innovative 
method for the estimation of breeding value 
based on genome-wide dense DNA markers. In 
the first step, significant relationships between 
phenotypes and genotypes are predicted using a 
subset of a population (called training popula- 
tion). In the second step, breeding population is 
genotyped using the informative markers, GEB V 
for each individual is derived by summing the 
marker effects and desirable plants are selected 
based on GEBVs. Genomic selection is expected 
to address small-effect genes that cannot be cap- 
tured by traditional MAS. 


4.4.3 Critical Considerations 
for an Effective MAS 
Programme 


The following issues must be kept in mind before 
embarking on a MAS programme. 


4.4.3.1 Choice of MAS Strategy 
and Statistical Issues 

A suitable strategy of MAS must be determined 
based on the goals of a breeding programme, 
heritability of a trait, the number of genes gov- 
erning a trait, strength of linkage between target 
locus and marker, population size and availability 
of resources. Based on _ simulation studies, 
Visscher et al. (1996), Frisch et al. (1999a, b) and 
Prigge et al. (2009) provide the theoretical pos- 
sibilities for the minimum marker distance, 
marker density and population size required 
for selecting at least one desirable individual 
when a single gene is transferred through 
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MABC. According to Frisch et al. (1999b), if the 
goal is to select a BC,F, plant that is heterozy- 
gous for a target gene and homozygous for recur- 
rent parent alleles at both flanking markers with 
99 % probability, minimum number of BC,F, 
plants required would be about 4,066 when the 
target locus is at the distance of 5 cM from the 
flanking markers. In case if the goal is to select a 
BC,F, plant that is heterozygous for a target gene 
and homozygous for recurrent parent allele at 
one flanking marker and heterozygous at the 
other with 99 % probability, the minimum num- 
ber of BC,F, plants required would be about 100 
when the target locus is at the distance of 5 cM 
from the flanking markers. Ishii and Yonezawa 
(2007) have outlined various schemes for pyra- 
miding two or more genes. For instance, one of 
the schemes involves crossing of multiple donors 
(~4) with a single recipient parent followed by 
separate backcrosses. MAS will be applied at 
each backcross generation to select the plants 
carrying target locus (heterozygous state) for fur- 
ther backcrossing. After three rounds of back- 
crossing, crosses between selected plants will be 
made so that the final population is expected to 
contain a genotype having all genes in homozy- 
gous condition. Bernardo and Charcosset (2006) 
and Bernardo and Yu (2007) discuss several 
issues concerning MARS and GWS for breeding 
applications based on simulation studies. 


4.4.3.2 Genotyping Platforms for MAS 

Simple, cost-effective and breeder-friendly geno- 
typing platforms must be available depending 
upon the type of markers, the scale of a MAS 
programme and the resources. Gel-based geno- 
typing platform would be enough and effective 
when markers such as sequence-tagged sites 
(STS), sequence characterized amplicon regions 
(SCAR), simple sequence repeats (SSR) and 
cleaved amplified polymorphic sequences 
(CAPS) are used and only a few genes or a few 
hundred samples are handled in a MAS pro- 
gramme. Single nucleotide polymorphism (SNP) 
markers are ideal for achieving high-throughput 
and automation. A variety of SNP genotyping 
assays, viz. high-resolution melting (HRM) 
(Terracciano et al. 2013), KBiosciences 
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Competitive Allele-Specific PCR (KASPar) 
(Luciana Rosso et al. 2011), Golden Gate 
(Thomson et al. 2012), etc., are available. The 
use of high-throughput marker system may 
reduce the cost per genotypic data point (Herzog 
and Frisch 2011). 


4.4.3.3 Phenotyping Platforms for Trait 
Validation of MAS Progenies 

Effects of gene/QTL may be influenced by the 
genetic background (QTL x genotype interac- 
tion) and the environments (QTL x environment 
interaction) where it needs to be introduced. For 
example, Vadez et al. (2011) reported that effects 
of stay-green QTLs on water extraction, transpi- 
ration efficiency and seed yield varied in different 
genetic backgrounds. There are numerous exam- 
ples to suggest that QTL effects vary across envi- 
ronments. Hence, prioritization of genes/QTLs 
for MAS work must be made after a thorough 
validation across genetic backgrounds and target 
environments to avoid false positives. This would 
demand an effective phenotyping network across 
target environments. 


4.4.3.4 Integrated Breeding Platform 

The fact that MAS has been highly successful 
in private sector suggests its efficacy. However, 
Delannay et al. (2011) observed that the public 
sector in the developing countries has very 
limited progress in MAS due to severe con- 
straints, viz. resource-limited breeding pro- 
grammes, shortage of trained manpower, 
inadequate high-throughput capacity, poor 
phenotyping infrastructure, lack of informa- 
tion and data handling systems/tools. Delannay 
et al. (2011) advocate an integrated breeding 
platform that connects the above-listed com- 
ponents for the successful implementation of 
MAS programmes. 


4.4.4 Cultivar Improvement 
Using MAS 


Globally, substantial efforts have been made to 
develop improved cultivars using MAS in public 
as well as private plant breeding programmes 
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(Eathington et al. 2007; Brumlop and Finckh 
2011). The list is exhaustive and a few prominent 
examples are provided below. 


4.4.4.1 Enhancing Yield Potential 
Improvements of per se yield potential in variet- 
ies and hybrids have stagnated in many crops. It 
is envisaged that further improvement in yield 
potential might require integrated approaches 
involving traditional breeding and MAS to select 
a crop ideotype for a given environment (Francia 
et al. 2005). MAS for yield enhancement in a cul- 
tivar has been demonstrated in barley (Schmierer 
et al. 2004). Yield-enhancing QTLs from wild 
species appear promising (Swamy and Sarla 
2008) but are yet to be exploited for cultivar 
development. 


4.4.4.2 Resistance to Biotic Stresses 

In India, popular rice varieties Samba Mahsuri 
and Pusa Basmati | and parental lines of rice 
hybrid PusaRH1O (PRR78 and Pusa6B) have 
been improved for resistance to bacterial blight 
disease through MABC. Samba Mahsuri has 
been pyramided with bacterial blight resistance 
genes, viz. xa5, xal3 and Xa2/ (Sundaram et al. 
2008). Pusa Basmati | has been improved with 
bacterial blight resistance genes, xa/3 and Xa2/, 
along with blast resistance genes, viz. Piz-5, Pi- 
Kh, Pi-1, Pi-ta, Pi-b, Pi-5 and Pi-9. PRR78 and 
Pusa6B have been pyramided with xa/3 and 
Xa2I along with Piz-5 (Prabhu et al. 2009). A 
highly cultivated wheat variety in India, PBW343, 
has been improved for leaf rust resistance by pyr- 
amiding Lr24, Lr28 and Lr48 genes (Prabhu 
et al. 2009). Jairin et al. (2009) improved a Thai 
aromatic rice cultivar ‘Khao Dawk Mali 105’ 
(KDML105) by transferring Bph3 for resistance 
to brown plant hopper (Nilaparvata lugens) from 
a donor variety, Rathu Heenati. Tomato breeding 
lines pyramided with multiple Jy genes, confer- 
ring resistance to tomato yellow leaf curl virus, 
are spearheading private tomato breeding pro- 
grammes in the Asia-Pacific region (Peter 
Hanson, AVRDC-The World Vegetable Center, 
Personal Communication, 2010). 


4.4.4.3 Tolerance to Abiotic Stresses 
Several popular cultivars have been improved for 
abiotic stress tolerance by transfer of major QTLs 
through MABC. For instance, mega rice varieties 
of India, viz. Swarna, Samba Mahsuri, IR64, 
TDK1 and CR1009, have been improved for sub- 
mergence tolerance using the Sub/ QTL (Neeraja 
et al. 2007; Septiningsih et al. 2009). Similarly, 
Chin et al. (2011) improved Indonesian rice vari- 
eties for high yield under phosphorus-deficient 
conditions by transferring Pup/ QTL. In Vietnam, 
Huyen et al. (2012) and Cuc et al. (2012) 
improved Vietnam elite rice variety ASS996 for 
tolerance to salt with Saltol QTL and submer- 
gence with Sub/ QTL. 


4.4.4.4 Enhancement of Quality Traits 
Development of quality protein maize hybrids 
using the opaque-2 gene has been a commend- 
able success of MAS worldwide. In India, Vivek 
Maize Hybrid 9, a popular single-cross hybrid, 
has been improved for its protein quality through 
MAS. The improved QPM hybrid, Vivek Quality 
Protein Maize (QPM) 9, showed 41 % increase in 
tryptophan and 30 % increase in lysine over the 
original hybrid. The Vivek QPM9 has been 
released for cultivation in India during 2008 
(Gupta et al. 2013). In rice, Yi et al. (2009) 
improved cooking quality traits in Myanmar rice 
cultivar Manawthukha by transferring genes for 
fragrance coupled with intermediate level of 
amylase content from the donor variety, Basmati 
370, through MABC. Chu et al. (2011) reported 
improvement of a peanut variety “Tifguard’ for 
high oleic-linoleic (O:L) ratio along with resis- 
tance to root-knot nematode (RKN) through 
MABC. The O:L ratio of the improved “Tifguard’ 
was about 25-30 % compared to <5 % in the 
original “Tifguard’. Molecular markers associ- 
ated with a mutation causing high oleic level 
(~70-75 %) have been derived from the FAD-2 
gene family members in sunflower (Schuppert 
et al. 2006) and safflower (Liu et al. 2013). The 
potential of these markers for development of 
high-oleic cultivars in sunflower and safflower 
remains to be tested. 
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4.5 Conclusions 

Molecular plant breeding has been recognized as 
the foundation for crop improvement in the 
twenty-first century to increase crop production 
(Moose and Mumm 2008; Tester and Langridge 
2010). The MAS is an important tool for breed- 
ing that would certainly empower plant breeders 
to make their breeding programmes highly effi- 
cient and productive. 

Many of the features of the markers like cost, 
time and expertise have changed over time due to 
technological developments. The advent of NGS 
has removed the divide between the so-called 
‘orphan’ and ‘well-bestowed’ crops in terms of 
availability of markers. Using NGS technologies, 
the complete genome or the transcriptome of a 
set of representative genotypes of useful gene 
pool can be sequenced, and genome-wide genetic 
variants such as SSRs, InDels and SNPs can be 
rapidly identified and easily converted into 
genetic markers for any crop. Hence, there are 
multiple options available for the geneticists and 
breeders, and the selection of marker is mainly 
driven by the genetic materials in hand and the 
intended application. 

Among the MAS strategies, MABC has been 
extensively used to improve widely adopted 
mega cultivars in important crops such as rice, 
wheat, maize and tomato particularly for traits 
that are controlled by major genes or major 
QTLs. Information on agronomically important 
genes/QTLs in orphan crops is very much lim- 
ited, which might perhaps change due to the 
advent of new cost-effective, high-throughput 
genomic tools. The challenge of breaking the 
yield plateau would require manipulations of sev- 
eral complex traits, which are low heritable and 
controlled by a large number of minor genes/ 
QTLs. There is a concern that most of the reported 
QTLs are false or not accurate because the num- 
bers of QTLs are plenty but their use in breeding 
is scanty. The accuracy of small QTLs can be 
improved by designing phenotypic screens that 
improve the heritability, improve understanding 
of genotype by environment interaction and 
refine map positions through candidate gene 
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approach (Price 2006; Xu and Crouch 2008). 
MAS for complex traits would warrant extensive 
application of genome-wide MAS strategies such 
as MARS and GWS, which still remain as theo- 
retical possibilities in crop breeding. 
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Abstract 

Haploid plants provide an excellent example of cellular totipotency as they 
develop from either the male or female gametes without fertilization. 
Haploids can be induced in vivo and in vitro. Androgenesis 1s the develop- 
ment of haploids from male gametes. The two main techniques that are 
employed to generate androgenic plants in vitro are (a) anther culture and 
(b) microspore culture. The application of anther culture is widespread. 
Different developmental pathways of microspore embryogenesis have 
been reported in different plant species and in response to different stress 
treatments and its microspore developmental stage. Genotype or physiol- 
ogy of the donor plant, microspore developmental stage, stress pretreat- 
ment, culture medium and culture conditions affect the androgenesis 
process. Each crop has individual optimal pretreatment regimes and 
in vitro culture requirements. In vivo haploid induction techniques involve 
parthenogenesis and wide hybridization followed by chromosome elimi- 
nation. In maize (Zea mays L.), haploids are commonly produced by 
in vivo haploid induction system and the method has been commercially 
exploited by both public and private sector organizations. Wide hybridiza- 
tion method is limited to potato and cereals. Wheat, barley and potato are 
excellent examples of wide hybridization followed by chromosome elimi- 
nation system of haploid induction. The regenerated plants particularly in 
androgenesis need ploidy analysis to distinguish haploids from non- 
haploid plants. Regardless of the mode of development, doubling of 
haploid plants either spontaneously or by chemical means leads to a 
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homozygous doubled haploid (DH) individual with two identical copies of 
each chromosome. Doubled haploids have been used in _ breeding 
programmes to produce homozygous genotypes in a number of important 
species and the number is increasing. DH populations are ideal for genetic 
mapping and allow the development of high-density marker maps that can 
then be exploited in quantitative trait locus identification. Genetic map- 
ping using DHs had an impact in locating gene control traits for yield, 
quality, agronomy, abiotic and biotic stress. DHs now feature in cultivar 
production in a number of crops, and breeding time is considerably 
reduced. Hence, worldwide use of DH technology as an accelerated 
approach to crop improvement has become routine by many breeding 
companies and laboratories leading to the development of almost 300 new 


varieties. 


Keywords 


Androgenesis * Haploids * Doubled haploids « Plant breeding * Wide 
hybridization ¢ High-density marker maps ¢ Quantitative trait loci 


5.1 Introduction 


Doubled haploid technology is one of the revolu- 
tionary techniques in accelerating the plant 
breeding process. Haploids occur spontaneously 
in nature or are induced artificially. The sponta- 
neous haploids were first reported in Datura stra- 
monium (Blakeslee et al. 1922). Subsequently, 
the reports came in other crop plants (Forster 
et al. 2007). However, the frequency of naturally 
occurring spontaneous haploids is very low and 
therefore is of limited use. It was the in vitro 
induced haploids from Datura anthers (Guha and 
Maheshwari 1964, 1966) which opened the doors 
for haploids in basic plant research. Later, 1n vitro 
haploids were successfully produced in tobacco 
through anther culture (Nataka and Tanaka 1968; 
Nitsch and Nitsch 1969). Success was also 
achieved from microspores in rice via haploid 
callus (Niizeki and Oono 1968). Subsequently, 
the reports came in wheat, barley, maize and 
other crops, but the rate of success was far below 
the levels found in Solanaceae particularly in 
tobacco. Slowly, the number of crop species 
increased, and even new methods of haploid 


induction wide hybridization followed by chro- 
mosome elimination were also reported (Kasha 
and Kao 1970). 

Haploid is a general term used to designate a 
plant containing gametic chromosome number 
(n). In a diploid species, the haploid may also be 
called as monoploid (n= x) as it contains only one 
set of chromosomes. The haploids of a polyploid 
are called polyhaploids because they contain 
more than one set of chromosomes. The dihap- 
loid term is thus used for haploid of a tetraploid 
species where the gametic chromosome number 
is n=2x. It has often been observed that the word 
dihaploid is commonly used for a plant where the 
chromosome number is doubled and which actu- 
ally should be termed as doubled haploid (DH), 
not a dihaploid. 

Rapid strides in research on conversion of 
sterile haploids into fertile doubled haploids led 
to the further development of chromosome dou- 
bling techniques to produce homozygous DH 
plants (Jensen 1974). Tobacco, rapeseed and 
barley are currently the most responsive species 
and model species for doubled haploid 
production. 
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5.2. Haploid Induction Methods 
The two broad systems of haploid induction are: 
In vitro haploid induction: Androgenesis and 
gynogenesis 
Induced 
parthenogenesis 
Wide hybridization 
followed by 
chromosome 
elimination 

The common method used for producing hap- 
loids is anther and microspore culture which has 
been commercially exploited in many major 
crops e.g. tobacco, brassica and rice. In maize, 
haploid inducer stocks are used to produce 
in vivo haploids. In wheat and barley, wide 
hybridization followed by chromosome elimina- 
tion is the preferred method for large-scale 
haploid production. Gynogenesis is the only 
practical method in onion. Palmer and Keller 
(2005) differentiated parthenogenesis from gyno- 
genesis whereby the former involves ‘the normal 
development of endosperm and embryo forma- 
tion occurs in vivo’, whereas in the case of gyno- 
genesis, endosperm degenerates with the passage 
of time and there is a need to rescue the embryo 
in the laboratory conditions. In our chapter, we 
will be discussing androgenesis, induced parthe- 
nogenesis and wide hybridization in haploid 
induction, and ploidy determination methods, 
chromosome doubling and application of dou- 
bled haploids in plant breeding. 


In vivo haploid induction: 


5.2.1 Androgenesis 

During sexual reproduction in plants, the gam- 
etes are formed with half the chromosome num- 
ber, which reunites to restore the normal ploidy 
level. The process of microsporogenesis results 
in the reduction of chromosomes through meiosis 
in vegetative cells to give rise to microspores. 
The normal fate of microspores is to develop into 
pollen grains, but microspores can be induced to 
undergo a developmental process that leads to the 
formation of haploid embryos. The haploid 


embryos subsequently develop into haploid 
plants. This process is called androgenesis or 
microspore embryogenesis. Thus, androgenesis 
is the process of induction and regeneration of 
haploids and double haploids from male gametic 
cells. It is often the method of choice for doubled 
haploid production in crop plants (Sopory and 
Munshi 1997). Anther culture requires good 
aseptic techniques but is generally simple and 
applicable to a wide range of crops (Maluszynski 
et al. 2003). 

Guha and Maheshwari (1964) first success- 
fully discovered that the immature pollen could 
be induced to bypass normal development within 
the anther to produce haploid plants in Datura 
inoxia Mill. Since then considerable efforts were 
made to extend the technique to other species. 
Still many agronomically important crops are 
recalcitrant to androgenesis. Concerted efforts on 
many different plants have resulted in the docu- 
mentation of the microspore embryogenesis pro- 
cess in over 250 species (Maluszynski et al. 2003; 
Dunwell 2010). However, success with andro- 
genesis 1s still restricted to mostly annuals and 
non-woody plants. 

The two main techniques that are employed to 
generate androgenic plants in vitro are the anther 
culture and the microspore culture. The applica- 
tion of anther culture is widespread, where the 
excised immature anthers are dissected out from 
flower buds and cultured on suitable growth 
media under appropriate in vitro conditions. 
Various stages of anther culture in tobacco are 
shown in Fig. 5.1. In microspore culture, micro- 
spores are isolated by mechanical means such as 
crushing of anthers and release into medium by 
magnetic stirring. They can also be isolated as 
naturally shed microspores in the culture medium 
after pre-culture of anthers. Although microspore 
culture is technically more demanding and 1s less 
efficient in some plant species than the culture of 
whole anthers, it has several advantages over 
anther culture. In the anther culture method, 
plants regenerate from somatic tissue of the 
anther wall also rather than from the haploid 
microspore cells. This problem does not arise in 
microspore culture because of isolation of micro- 
spores from anthers before culture. As individual 
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Stage | Stage 2 


Buds selection for culture 





Globular embryo 





Hardening of colchicine treated plants 


Fig.5.1 Anther culture in tobacco 


plants develop from separate pollen cells, the 
chance of chimaera production is also low. 
Further, the culture of individual microspores 
makes possible the tracing of events that occur 
from microspore initiation through to embryo- 
genic development, by cell tracking studies 
allowing for the greater understanding of the pro- 
cess of androgenesis in plants. In the case of 
anther culture, certain factors responsible for 
microspore embryogenesis are provided by the 
anthers which need to be supplemented exter- 
nally for microspore culture. However, in some 
plant species, microspores require the parent 
environment during initial culture period, which 
is available only inside the anther wall. 


DAPI stained uninucleate microspores 


Small plantlets 4 weeks after culture 
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Haploids/doubled haploid plants in greenhouse 


5.2.2 Modes of Androgenesis 


The initial steps of microspore embryogenesis 
are unusual to any other embryogenic system, 
since multicellular structure (MCS) formation 
takes place inside the exine wall of microspores 
(Mordhorst et al. 1997). During the initial phase 
of microspore embryogenesis, several patterns 
of cell division have been identified to take place 
inside the exine wall. The asymmetric division 
of the microspore nucleus resulting in a genera- 
tive and a vegetative cell characterizes the 
A-pathway. In the A-pathway, MCSs are formed 
from repeated divisions of the vegetative cell, 
while the generative cell or its derivatives 
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degenerate and die (Sunderland 1974). This is 
the most widely spread mechanism of MCS for- 
mation during androgenesis and it has been 
described in several plant species, including 
most cereals (Raghavan 1986). In the B-pathway, 
it is the symmetric division of the microspore 
nucleus that gives rise to MCSs (Sunderland 
1974). The B-pathway is known to play a major 
role in rapeseed (Brassica napus L.), potato 
(Solanum tuberosum L.), tobacco (Nicotiana 
tabacum L.) and wheat (Triticum aestivum L.) 
microspore embryogenesis (Zaki and Dickinson 
1991; Rihova and Tupy 1999; Touraev et al. 
1996; Indrianto et al. 2001), and it has been also 
described among barley MCSs (Sunderland and 
Evans 1980). An alternative route to androgene- 
sis 1s defined by the independent divisions of the 
generative and vegetative cells, giving rise to 
MCSs with heterogeneous compositions. This 
modified version of the A-pathway, also termed 
E-pathway (Raghavan 1986), has been described 
in barley, maize, tobacco and pepper (Capsicum 
annuum L.) MCSs (Sunderland et al. 1979; 
Pretova et al. 1993; Touraev et al. 1996; Kim 
et al. 2004). Conversely to the known A- and 
B-pathways, in which the roles of the generative 
and vegetative cell are clearer, in this modified 
version of the A-pathway, both generative and 
vegetative cell appear to contribute equally to 
embryo formation (Raghavan 1986). What leads 
microspores to give preference to a particular 
division sequence is not yet clear. Different 
developmental pathways have been reported to 
vary according to plant species and in response 
to different stress treatments and the microspore 


developmental stage (Raghavan 1986; 
Sunderland et al. 1979; Kasha et al. 2001b; Kim 
et al. 2004). 


Irrespective of the above early pattern of 
microspore divisions, the embryogenic pollen 
grains ultimately become multicellular and burst 
open, gradually assuming the form of a globular 
embryo. This is followed by the normal stages of 
postglobular embryogeny until the development 
of plant. This is called direct androgenesis. 
Examples are Datura and _ Nicotiana. 
Alternatively, the multicellular mass liberated 
from the bursting pollen grain proliferates to 


form a callus which may later differentiate into 
whole plants either on the same medium or on a 
modified medium. This is called indirect andro- 
genesis and is common in most of the species, 
caused by complex media. The callus-derived 
plants also result in genetic variations. 


5.3 Factors Affecting 
Androgenesis 
5.3.1 Genotype and Physiological 


Conditions of the Donor 
Plants 


The androgenic response is genotype specific; 
different varieties and genotypes respond differ- 
ently to anther/microspore culture. The number 
and the vigour of microspores within the anther, 
the nutritional status of the tissues of the anther 
and the endogenous levels of growth regulators 
may to some extent be influenced by the physio- 
logical age and growth condition of the donor 
plant, which in turn will have a bearing on micro- 
spore embryogenic competence. Differences in 
competence of the cultured microspores for 
embryogenic development have been observed 
from field-grown and pot-grown plants (Datta 
2005; Silva 2010) and crop grown under different 
seasons (Foroughi-Wehr and Mix 1979). The 
natural flowering conditions are normally the 
best environment for donor plants to produce 
anthers for successful regeneration experiments 
(Wang et al. 2000). We also observed similar 
results where androgenetic response from field- 
grown plants was higher in comparison to the 
pot-grown greenhouse plants in the case of 
Nicotiana tabacum (unpublished data). Infected, 
diseased and unhealthy plants should be avoided 
as donors as they will reduce the response in 
androgenesis. In general, the anthers from the 
first or early flush of flowers have been found to 
be highly responsive to embryogenic develop- 
ment rather than the anthers from buds in later 
stages or tillers of the plant, and the androgenic 
response can be improved by carefully nurturing 
of donor plants and identifying the favourable 
environmental conditions. 
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5.3.2 Developmental Stage 
of Anthers/Microspores 


The microspore development stage is a major 
factor bearing strong influence on the success of 
anther culture. The exact microspore develop- 
ment stage that responds to inductive treatment 
varies with the species, but is restricted to a rela- 
tively narrow developmental window. There are 
slight differences in the developmental stage of 
microspores inside the anthers; thus, even the 
microspores inside an anther will vary with 
regard to their embryogenic competence. 
Therefore, only a percentage of microspores 
within a single anther would shift their normal 
gametophytic pathway to sporophytic develop- 
ment. For many species, the most amenable stage 
is either the uninucleate stage of the microspore 
or at or just after the first pollen mitosis, which is 
the early binucleate stage. At this time, the tran- 
scriptional status of the microspore may still be 
proliferative and not yet fully differentiated 
(Malik et al. 2007). Lessons learnt from model 
crops suggest that mature microspores are con- 
sidered irreversibly committed to gametogenesis 
when the vegetative cytoplasm of binucleate pol- 
len starts to accumulate starch (Touraev et al. 
1997a). 

It is necessary in anther culture to identify the 
flower buds that need to be harvested, containing 
microspores inside the anthers at the correct stage 
appropriate for embryogenesis. In many species, 
a maturity gradient is observed in flower buds 
within the inflorescence that displays either an 
acropetal or basipetal developmental succession. 
Therefore, a series of buds taken from different 
positions on the inflorescence or buds of different 
sizes in the case of individual flowers need to be 
pre-examined in order to identify the stage of 
microspore development within their anthers. 
This is done by squashing anthers to release pol- 
len and staining with acetocarmine to observe the 
nuclei under the light microscope. DAPI (4’, 
6-diamidino-2-phenylindole) may be also used to 
stain the nuclear DNA, which can then be visual- 
ized using a fluorescent microscope. Such pre- 
examination will be very useful to establish a 
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correlation between easily observable morpho- 
logical traits (e.g. the petal to sepal length ratio in 
tobacco as reported by Sunderland et al. (1974) 
and Sood et al. (2013) and size and appearance of 
buds in pepper by Ozkum-Ciner and Tipirdamaz 
(2002)) with the pollen development stage. This 
can be used as a quick guide to identify flower 
buds containing anthers/microspores suitable for 
androgenesis. 


5.3.3 Stress Pretreatment 


In androgenesis, a physical or chemical pre- 
culture treatment is applied to excised flower 
buds, whole inflorescences or separated anthers 
to trigger the induction of the sporophytic path- 
way, thereby preventing the normal gametophytic 
pathway of pollen development (Hosp et al. 
2007a). A variety of microspore pretreatment 
stresses have been tested and found to enhance 
pollen embryogenesis, although the type, dura- 
tion and time of application of these pretreat- 
ments may vary with the species or even the 
variety (Shariatpanahi et al. 2006). It has been 
shown that stress treatment to anthers also 
enhances the haploid regenerants in crop plants. 
With regard to different explants, the type, levels 
and duration of pretreatments are different, and 
regeneration efficiencies vary as well. Stress pre- 
treatments such as high or low temperature, car- 
bohydrate/nitrogen starvation, high osmotic 
pressure and colchicine treatment are most often 
used at the time of culture initiation and have 
shown to play a vital role for the reprogramming 
of microspores. During the plant life cycle, stress 
in general seems to be a common trigger for 
phase change. First, stressed microspores 
undergo a transition towards a more dedifferenti- 
ated state. This transition is exhibited in changes 
in their overall morphology, cellular enlarge- 
ment, typical ‘star-like’ vacuolization and cyto- 
skeletal rearrangements resulting in the migration 
of the nucleus towards a more central position 
(Touraev et al. 1997a; Raina and Irfan 1998). 
Second, the degradation of cellular components 
and certain proteins takes place and is considered 
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a prerequisite for developmental reprogramming 
in many organisms (Wang et al. 1999; Maraschin 
et al. 2005). Third, in tobacco and rapeseed, the 
stress-treated microspores undergo a cell cycle 
arrest, which is relieved when microspores are 
transferred to non-stress conditions (Zarsky et al. 
1995; Binarova et al. 1997). Some _ reports 
described that stress pretreatment is not only 
needed for switching the development fate but 
also influence of divisions and embryos, green 
plant production and spontaneous doubling effi- 
ciency (Kasha et al. 200la; Oleszczuk et al. 
2006). The mechanisms underlying the stress- 
induced switch from gametophytic to sporo- 
phytic pathway on cellular, biochemical and 
molecular processes taking place during transi- 
tion to the embryogenic state have been reported 
by Shariatpanahi et al. (2006). 

The mechanisms of androgenesis for induc- 
tion and embryo formation process are correlated 
to cell cycle change with different biological sys- 
tems. Wang et al. (2000) mentioned that during 
androgenesis, the haploid microspore divides and 
develops into an embryo and subsequently into a 
new haploid plant. Maraschin et al. (2006) 
showed that the embryogenic development cell 
cycle changes in barley during androgenesis is 
divided into three main characteristics, e.g. (1) 
acquisition of embryogenic potential, (11) initia- 
tion of cell division and (111) pattern formation of 
embryos. Touraev et al. (1996) described that 
after the asymmetric division, the vegetative cell 
become arrested in the G1 phase of the cell cycle, 
while the generative cell progresses into mitosis 
and divides again to produce two sperm cells. 
Induction of androgenesis by stress is able to 
overcome the developmentally regulated cell 
cycle arrest, as the vegetative cell re-enters 
S-phase during stress treatment, and microspores 
progress into G2/M transition in culture. 

Specific microspore developmental stages, 
combination of stress pretreatment and proper 
density of microspores are very essential for the 
cell development to embryo formation (Heberle- 
Bors 1989; Hoekstra et al. 1997). It has been 
mentioned that the type, duration and time of 


application of stress pretreatments may vary with 
the species (Datta 2005). After stress pretreat- 
ment, a rapid loss of chlorophyll and degenera- 
tion of tapetum and anther wall of cells were 
observed, and these types of cells are apparently 
most sensitive to stress and enter a cell death 
pathway (Wang et al. 1999). The death of these 
cells may release signals that are able to trigger 
susceptible microspores to enter a sporophytic 
development pathway. Corral-Martinez et al. 
(2013) also showed that the commitment of 
microspores to embryogenesis is associated with 
both massive autophagy and excretion of the 
removed material. It is also hypothesized that 
autophagy would be related to the need for a 
profound cytoplasmic cleaning and excretion 
would be a mechanism to avoid excessive 
growth of the vacuolar system. 


5.3.4 Culture Medium and Culture 
Conditions 


Successful induction of embryogenesis from cul- 
tured microspores or anthers depends to a large 
extent on the composition of the culture medium. 
The requirements of a given species are identified 
mainly through an empirical process of trial and 
error. The source of carbon, macronutrients (par- 
ticularly the form in which nitrogen is supplied in 
the medium) and micronutrients may determine 
if embryogenesis will be initiated or not. The 
type and concentration of the growth regulators, 
particularly the auxins and cytokinins, as well as 
their interactive presence can be the deciding 
media factor that would influence pollen embryo- 
genesis. Solidifying agent should be considered 
an integral part in the case of solid culture 
medium (Dunwell 1985). Standard media have 
been developed for different species although 
specific genotypes may have their individual 
requirements. The more widely used basal media 
for anther culture are the Nitsch and Nitsch 
(1969), MS (Murashige and Skoog 1962), 
N6 (Chu 1978) and B5 (Gamborg et al. 1968). 
These media are used often in their original form 
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but sometimes modified by supplementing or 
subtracting one or more components to better suit 
a given plant species or genotype. 

It is well known that the culture atmosphere 
affects the yield of microspore embryos (Dunwell 
1979). The temperature at which the cultures are 
incubated, light/dark conditions, the density of 
anthers or microspores in a culture vessel and 
other such post culture environmental conditions 
may have a subtle effect on the success of micro- 
spore embryogenesis, depending on the plant 
species or genotype. The temperature at which 
anther cultures are incubated has been shown to 
be an important factor in rice (Okamoto et al. 
2001). However, very few investigations have 
been carried out to manipulate culture tempera- 
tures for enhanced anther culture efficiency. 
Light is another environmental factor that has a 
bearing on anther culture success because it is a 
stimulus that influences in vitro pollen morpho- 
genesis (Reynolds and Crawford 1997). 
Generally, anther or microspore cultures are 
maintained in the dark during the initial phase of 
culture, particularly when regeneration occurs 
through callus, and transferred to illuminated 
conditions at a later stage. For some plant spe- 
cies, diurnal alternation of incubation for several 
hours in the light and then darkness has been ben- 
eficial (Germana et al. 2005; Sunderland 1971). 
However, optimal conditions need to be deter- 
mined for each system. 

Microspore culture has provided a wealth of 
information about the mechanisms underlying 
microspore embryogenesis. Studies of micro- 
spore cultures led to the identification of the 
stress in various forms that converts microspores 
from the gametophytic to the sporophytic path- 
way or from a cell with limited developmental 
potential into a totipotent or embryonic stem cell 
(Shariatpanahi et al. 2006). Functional genomics 
approaches that determine genes associated with 
androgenic induction revealed that a collection 
of genes are up-regulated and down-regulated 
during the switch in developmental pathways 
from gametophyte to sporophyte development 
(Hosp et al. 2007b). Several genes have been 
isolated and characterized as being involved in 
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the reprogramming of microspores from the 
gametophytic to the sporophytic phase, making 
microspore embryogenesis an interesting system 
in which to study cellular reprogramming, phase 
change and totipotency in higher plants (Hosp 
et al. 2007a, b; Tsuwamoto et al. 2007). 

Although the application of anther culture is 
widespread, the processes involved are poorly 
understood. There is a need to study, understand 
and manipulate microspore embryogenesis to 
develop genotype-independent methods. There 
is also a need to ensure direct embryogenesis 
to eliminate an intermediary callus phase that 
can promote gametoclonal variation among 
regenerants. 


5.4 In Vivo Haploid Induction 


5.4.1 Induced Parthenogenesis 
in Maize 


In maize (Zea mays L.), DH lines are commonly 
produced by in vivo maternal haploid induction 
(Bordes et al. 1997; Prigge and Melchinger 
2012). After pollinating the source germplasm 
with pollen inducers, a fraction of 8—10 % of the 
developing seeds have a haploid embryo (Rober 
et al. 2005). At present, discrimination of hap- 
loid seeds from diploid F; seeds resulting from 
normal cross-fertilization relies largely on the 
dominantly inherited marker gene R1-nj carried 
by the inducer. The marker gene causes purple 
coloration of the scutellum and aleurone of dip- 
loid seeds (Nanda and Chase 1966; Neuffer et al. 
1997). Thus, it can be used as an embryo- and 
endosperm-specific marker to distinguish hap- 
loids from diploids seeds as well as seeds origi- 
nating from unintended outcrossing. The use of 
the R1-nj marker system for haploid detection in 
combination with colchicine treatment of hap- 
loid seedlings for chromosome doubling has 
paved the way for routine application of the DH 
technology in private and public maize breeding 
programmes during the past decade (Seitz 2005; 
Smith et al. 2008). The visual scoring of the 
R1-nj marker expression is very labour intensive. 
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Moreover, variable colour intensity of the 
embryo marker hampers an _ unequivocal 
classification of seeds. The classification system 
fails if the source germplasm harbours inhibitor 
alleles such as C1-I (Paz-Ares et al. 1990), which 
suppress the expression of the R1l-nj marker 
(Coe 1962). For rapid and reliable discrimina- 
tion of haploid from diploid seeds, an alternative 
method based on differences in oil content stem- 
ming from pollination with high oil inducers 
has been developed (Melchinger et al. 2013). 
This technology has applicability to any maize 
germplasm and is amenable to automated high- 
throughput screening. 


5.4.2 Wide Hybridization Followed 
by Chromosome Elimination 
for Haploid Induction 


Wide crossing between species has been shown 
to be a very effective method for haploid induc- 
tion and has been used successfully in several 
cultivated species. It exploits haploidy from the 
female gametic line and involves both interspecific 
and intergeneric pollinations. Haploid embryos can 
be produced in plants after pollination by dis- 
tantly related species as shown in barley (Kasha 
and Kao 1970) and wheat (Laurie et al. 1990). 

In barley, wide hybridization between culti- 
vated barley (Hordeum vulgare, 2n=2x=14) as 
female and wild H. bulbosum (2n=2x= 14) as the 
male results in haploid production. The double 
fertilization takes place normally and hybrid 
zygote is formed. However, during early embryo- 
genesis, chromosomes of the wild species are 
eliminated leaving a haploid embryo. The rapidly 
dividing endosperm also suffers chromosomal 
elimination and usually aborts early in seed 
development. The haploid embryos are excised 
and grown in vitro. The ‘bulbosum’ method was 
the first haploid induction method to produce 
large numbers of haploids across most genotypes. 
The method was quickly adopted and improved 
by researchers and barley breeders worldwide 
(Devaux 2003). Maize pollen could also be used 
for the production of haploid barley plants at 
lower frequencies. 


Wheat is another crop where’ wide 
hybridization has been used for haploid produc- 
tion; chromosome elimination has been studied 
and investigated to a great extent and is being uti- 
lized for large-scale practical applications. 
Barclay (1975) was the first person who recov- 
ered wheat haploids in wide crosses between the 
wheat variety Chinese Spring and Hordeum bul- 
bosum. The technique was, however, genotype 
specific due to the presence of the dominant 
crossability inhibitor genes Kri and Kr2 in wheat, 
which are expressed in many wheat varieties. 
Later, wheat x maize system for haploid induc- 
tion was reported (Laurie and Bennett 1987), 
which was genotype nonspecific because of the 
insensitivity of maize pollen to the action of Kr/ 
and Kr2 genes, thereby rendering this system 
more efficient and of practical value. 

In wheat x maize hybridization, hybrid embryo 
develops after pollination, but, in the further pro- 
cess, the maize chromosomes are eliminated and 
haploid embryos are formed. The endosperm 
aborts and haploid embryos cannot develop nor- 
mally. Such haploid embryos are supplemented in 
situ with growth regulator 2,4-dichlorophenoxy- 
acetic acid to maintain the stage suitable for 
embryo isolation and in vitro culture. This method 
enables the production of large numbers of wheat 
haploids from any genotype. Pollination with 
maize is also effective for inducing haploid 
embryos in several other cereals, such as barley, 
triticale (x Triticosecale) (Wedzony 2003), rye 
(Secale cereale) (Immonen and Tenhola-Roininen 
2003) and oats (Avena sativa) (Rines 2003). 
Several species, for example, H. bulbosum, teo- 
sinte, sorghum and pearl millet, have been tested 
as pollinators, but currently the most popular pol- 
len donor is maize (Zea mays). 

Although the wheat x maize system of haploid 
production is quite successful, the maize crop 
needs to be grown under controlled conditions so 
as to coincide flowering with wheat, thereby 
increasing the cost of haploid production. 
Therefore, wheat x Imperata cylindrica system 
was shown to be an efficient alternative to wheat 
x maize system for obtaining the high frequency 
of haploid and doubled haploids in wheat and 
triticale (Chaudhary et al. 2005; Chaudhary 2007, 
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2008a, b, 2010). I. cylindrica is a winter season 
plant and coincides well for flowering with that 
of wheat and triticale under natural conditions. It 
doesn’t require repeated sowings and available 
under natural conditions in almost all parts of the 
world where wheat is cultivated. Moreover, /. 
cylindrica is also genotype nonspecific for 
hybridization with any variety of wheat, triticale 
or their derivatives, and the pollen of I. cylindrica 
is readily available in abundance during the 
wheat hybridization period. 

A similar system operates in potato (Solanum 
tuberosum) (De Maine 2003). In this case, culti- 
vated potato, S. tuberosum, 1s tetraploid (2n= 4x) 
crossed with the diploid relative Solanum 
phureja. In these crosses, both sperm nuclei can 
fuse with the central cell of the ovule to form a 
functional endosperm that initiates and supports 
the parthenogenic development of the unfertil- 
ized egg (De Maine 2003). The tetraploid female 
S. tuberosum produces an embryo sac contain- 
ing one egg cell and two endosperm nuclei, all 
with the genetic constitution n=2x, while the 
diploid pollinator S$. phureja produces two 
sperms of the genetic constitution n= x or 2x. 
After pollination, dihaploid embryos can 
develop from unfertilized egg cells, which are 
supported by a 6x endosperm formed by the 
fusion of polar nuclei with both reduced sperm 
cells. The frequency of seed containing DH 
embryos is low, but these can be easily differen- 
tiated from hybrid seed by exploiting a colour 
marker gene carried by the pollinator line. 
Dihaploid potatoes can be used for breeding 
purposes, including alien germplasm introgres- 
sion or selection at the diploid level, but such 
plants are not homozygous. Haploids have a sig- 
nificant role in potato breeding programmes of 
quite a few companies, since they enable inter- 
specific hybridization, which would not other- 
wise be possible due to differences in ploidy 
levels and endosperm balance numbers. The 
gene pool of potato can be broadened, and cer- 
tain valuable traits, such as disease resistance 
characters from the wild solanaceous species, 
can be more efficiently introgressed into culti- 
vated potato (Rokka 2009). 
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It has advantages in that the methods involved 
(emasculation, pollination, embryo culture) are 
familiar to breeders, cross combinations can be 
manipulated to maximize haploid embryo pro- 
duction by eliminating genotypic dependency on 
the maternal side, and there is no seedling mor- 
tality caused by albinism (a major problem in 
microspore-derived haploids of cereals). 
However, flowering times of both parents need to 
be matched (seasonal limitations) and additional 
chemical treatments, such as colchicine, are 
required to double up the chromosome comple- 
ment of haploid plants. In species other than 
cereals, inefficient chromosome doubling is con- 
sidered to be the main bottleneck of this 
technology. 


5.4.3 Ploidy Determination 
of Regenerants 


Non-haploid embryos and plantlets have been 
observed in anther culture of different genotypes 
(Dunwell 2010). Therefore, ploidy determination 
is of highest importance in haploid induction pro- 
grammes. Three are two methods for detection of 
haploid and non-haploids. 


5.4.4 Direct Method 


This involves mitotic or meiotic cell chromo- 
some counting of induced plants. Root tips are 
the most convenient source of mitotic cells 
where the chromosome counting is easier and 
faster. When the roots are not available, young 
floral buds, leaves or callus can be used. Basic 
principles for handling the mitotic chromosomes 
of all plant species are similar and consist of 
material collection, fixation and _ staining. 
However, slight modifications are required 
depending upon the plant species. The chromo- 
some preparation varies with the plant species 
and chromosome size. For example, tobacco 
chromosome spread is difficult in comparison to 
rice and wheat spread. 
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5.4.5 Indirect Method 

There are a number of indirect methods for ploidy 
determination. These include flow cytometry, 
stomatal size, chloroplast number of the guard 
cells and morphological observations. Among 
these methods, stomatal size and chloroplast 
number are simple, inexpensive and less labour 
intensive, while recording morphological obser- 
vations like plant height, leaf dimensions and 
flower morphology requires long time. 

A large population of haploids need to be 
screened for use in plant breeding studies. This 
requires an efficient high-throughput ploidy anal- 
ysis system. Flow cytometry has emerged as a 
powerful tool for large-scale speedy genome 
analysis. Flow cytometry provides a rapid and 
simple option for large-scale ploidy determination 
as early as in the in vitro culturing phase (Bohanec 
2003). It also enables the detection of mixoploid 
regenerants and the determination of their pro- 
portion. We have analyzed tobacco haploids and 
doubled haploids plants using flow cytometry 
(Fig. 5.2). 

Several markers can be used for assessing the 
origin of diploids, depending on their availability 
for a particular plant species. In the past, evalua- 
tion of regenerants mainly relied on phenotypic 
markers, progeny testing after self-pollination 
and isozyme analysis. Nowadays, DNA molecu- 
lar markers, such as AFLP (amplified fragment 
length polymorphism), RAPD (random amplified 
polymorphic DNA), SCAR (sequence character- 
ized amplified region) or SSR (simple sequence 
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repeat), are commonly used for homozygosity 
testing and assessment of plant origin. There is a 
considerable difference in the interpretation 
between dominant or codominant electrophoretic 
profiles. Codominant molecular markers, as well 
as isozyme markers, have the advantage that a 
single locus, when heterozygous in donor plants, 
might be used for homozygosity determination. 
In contrast, a more complex profile is analyzed 
with dominant markers. In such a case, bands 
missing from the donor profile indirectly indicate 
homozygosity. 


5.4.6 Chromosome Doubling 
of Haploids 


To use haploid plants in breeding programmes, it 
is necessary to have high chromosome duplica- 
tion rates of haploid plants. Haploids are gener- 
ally smaller, less vigorous, more sensitive to 
disease and stress sources and, most importantly, 
sterile. Duplication of the chromosome comple- 
ment is therefore necessary. Conversion of sterile 
haploid plants into fertile homozygous doubled 
haploid plants requires spontaneous or induced 
chromosome doubling. 


5.4.7 Spontaneous Chromosome 
Doubling 


In anther-/microspore-derived plants, spontane- 
ous duplication of chromosomes may occur in 
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Fig.5.2 Detection of haploids and doubled haploids using flow cytometry analyses 
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cultures, often in a sufficient proportion, thus 
eliminating the need for chemically induced dou- 
bling. Spontaneously doubled plants are some- 
times preferred because of the fear that the 
duplication process might induce undesired 
mutations. This percentage may be largely influ- 
enced by the in vitro conditions used to induce 
androgenesis. However, we as well as other 
authors (Kasha 2005; Kasha et al. 2006) feel that 
such a process seems far from being spontane- 
ous, as many different in vitro or ex vitro factors 
may be influencing duplication. As pointed out 
by Henry (1998) and Kasha (2005), the micro- 
spore stage at the time of anther/microspore inoc- 
ulation, the stress treatments or the culture 
conditions used to induce androgenesis do affect 
the frequency of chromosome doubling. Several 
factors, like duration of inductive conditions, 
temperature (heat and cold), mannitol pretreat- 
ments and colchicine and other antimitotic drugs, 
used to induce androgenesis, are known to influ- 
ence the frequency of genome duplication (Zhao 
and Simmonds 1995; Henry 1998; Kasha et al. 
2001a; Zhou et al. 2002a, b; Kasha 2005; Shim 
et al. 2006). The use of plant hormones for 
in vitro cultures has also been directly related to 
DNA duplication events (Joubes and Chevalier 
2000; Magyar et al. 2005). Even the type of 
explant used to induce androgenesis can have an 
impact. A striking example can be found in the 
work on Brassica rapa of Sato et al. (2005), who 
found that microspore culture yields up to a 
threefold higher rate of duplication than anther 
culture. Similar results have been obtained in B. 
napus (Lichter et al. 1988) and B. oleracea (Wang 
et al. 1999). With respect to androgenesis, endo- 
reduplication has long been suggested as a mech- 
anism to explain the spontaneous chromosome 
doubling and polyploidy in certain species (for 
details of endoreduplication in androgenesis, see 
Segui-Simarro and Nuez 2008). However, it has 
been inferred that the early stages of androgene- 
sis induction and microspore embryogenesis do 
not match many of the characteristics of cells 
typically undergoing endoreduplication (Segui- 
Simarro and Nuez 2008). The evidences in favour 
of nuclear fusion during androgenesis have 
increased in the last few years. However, most of 
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the reports demonstrating nuclear fusion do not 
exclude the possibility of endoreduplication 
(Kasha 2005; Shim et al. 2006). But in some spe- 
cies, mainly cereals, evidence for nuclear fusion 
is conclusive (Gonzalez-Melendi et al. 2005; 
Shim et al. 2006). 

Because of the spontaneous chromosome dou- 
bling occurring in the haploid calli and embryos, 
ploidy level analysis cannot always identify 
pollen-derived plants. In fact, when in vitro 
anther or ovary cultures are performed, diploid 
plants can be homozygous doubled haploids or 
heterozygous diploids produced by the somatic 
tissue. For example, anther culture can be 
employed to obtain somatic embryos and clonal 
plant propagation in many genotypes (Germana 
2003, 2005, 2011). Isozyme analyses, RAPD 
markers, sequence characterized amplified region 
(SCAR) and simple sequence repeats (SSRs) 
have been utilized to assess homozygosis and to 
determine the gametic origin of callus and regen- 
erated plantlets, irrespective of their ploidy. 


Induced Chromosome 
Doubling 


5.4.8 


There are always a number of induced embryos 
which do not undergo doubling and finally 
become a haploid plant. This number of individu- 
als may oscillate enormously among species. 
Within a species, there are differences among 
genotypes as well. For maternally derived hap- 
loid plants, the rate of spontaneous doubling is 
often much lower or entirely absent, so a dou- 
bling procedure is essential. The most frequently 
used application is treatment with  anti- 
microtubule drugs, such as colchicine (colchicine 
is a natural alkaloid extracted from several spe- 
cies of the genus Colchicum, especially C. 
autumnale), which inhibits microtubule polymer- 
ization by binding to tubulin. Although colchi- 
cine is highly toxic, used at a millimolar 
concentration and known to be more efficient in 
animal than in plant tissues, it is still the most 
widely used doubling agent. Other options are 
oryzalin, amiprophosmethyl (APM), trifluralin 
and pronamide, all of which are used as herbicides 
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and are effective in micromolar concentrations. 
The method of application for effective duplica- 
tion varies depending on the genotype. 
Colchicine concentration and treatment dura- 
tion seem to have a key role (Caperta et al. 2006). 
The right combination of time and concentration 
is critical for the promotion of either c-metaphase 
arrest or chromosome doubling followed by cell 
cycle progression. Short-term exposures to col- 
chicine seem to work better than prolonged treat- 
ments in terms of doubling frequency and 
avoidance of polyploidies or embryo abnormali- 
ties (Chen et al. 1994; Zhou et al. 2002a, b). 
Genotype is another factor that influences the 
final action of colchicine. The most common 
stages to apply the treatment are (1) application 
to parts or the whole haploid plant, once regener- 
ated, (2) application to developing microspore- 
derived haploid embryos and (3) application 
during androgenesis induction as a component of 
the induction medium. Haploid plants can be 
totally sunk into colchicine solutions for varying 
times, never longer than a few hours. However, 
this method yields poor efficiency of doubling 
and may carry a significant amount of plant 
abnormalities and/or death. An additional draw- 
back is the generation of a considerable volume 
of toxic waste that must be properly disposed of. 
An alternative is to apply colchicine to specific 
plant organs such as roots, axillary buds or apical 
meristems. These in planta methods are easy but 
seem not to be an efficient wide-range strategy. 
Colchicine can also be applied in vitro to 
developing androgenic embryos (Pintos et al. 
2007) or calli (Wan et al. 1989; Ouyang et al. 
1994), but it implies low survival rates and chi- 
maerism. However, what seems to be more effi- 
cient is to directly apply colchicine as soon as 
possible during the early stages of haploid 
embryogenic development. When applied before 
embryo/callus formation, the generation of chi- 
maeric individuals can be greatly avoided. 
Besides, early application is much more eco- 
nomic due to the small amount of reagent used 
and the reduced volume of waste generated. In 
wheat, the incorporation of colchicine during the 
first hours of anther or microspore culture 
improved the doubling rate (Hansen and Andersen 


1998; Soriano et al. 2007) and colchicine concen- 
trations of 1 mM for 24 h or 0.3 mM applied for 
48 or 72 h during the first hours of microspore 
culture, resulting in pronounced increase in the 
proportion of doubled haploids in_ triticale 
(Wurschum et al. 2012). However, in the case of 
melons, in vivo colchicine application has been 
found better in terms of enhanced doubling effi- 
ciency in comparison to in vitro treatments 
(Yetisir and Sari 2003; Solmaz et al. 2011). In 
vitro trifluralin treatment to microspores of 
Brassica napus resulted in higher doubling effi- 
ciency in comparison to in vivo and in vitro col- 
chicine and in vitro oryzalin application (Klima 
et al. 2008). Trifluralin has been suggested as 
more efficient, cheaper and less toxic substitution 
of widely used antimitotic agent colchicine for 
production of doubled haploid lines in Brassica 
sp. (Abraha et al. 2008). N,O has been used to 
induce chromosome doubling in a variety of spe- 
cies, mostly cereals. However, N,O is considered 


to be less effective than colchicine in 
dicotyledons. 
5.5 Applications of Doubled 


Haploids 


Haploids and double haploids have very impor- 
tant applications in the field of plant breeding, 
genetic and molecular studies. Some of these 
applications are detailed below. 


5.5.1 Development of Cultivars 

Conventional breeding programme involving 
pedigree method of selection normally takes 
7 years to produce a variety in self-pollinated 
crops (excluding the testing period). Doubled 
haploids overcome many limitations of conven- 
tional breeding (Fig. 5.3). It is the fastest route to 
homozygosity and can be performed in any gen- 
eration in breeding programme. Since DH lines 
are homozygous, there are no dominance effects 
on selection. Thus, DH breeding accelerates the 
breeding programmes and shortens the time up to 
3—5 years to release the cultivars. These DH lines 
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Fig.5.3 Reduction in breeding cycle-conventional vs DH breeding 


can be directly released as a variety for general 
cultivation and/or used as parents in breeding pro- 
grammes to develop inbred lines for hybrid seed 
production in the case of cross-pollinated species. 
The DH lines can also serve as useful germplasm 
in generating new cross combinations. 

There is currently much interest in expanding 
DH for F1 hybrid production to high value crops 
such as medicinal and aromatic plants. Because 
breeding has been neglected in these species, DH 
for Fl production has the potential to make sig- 
nificant advances in providing high, stable and 
predictive yields of the raw biochemicals pro- 
cessed by pharmaceutical and _ nutraceutical 
industries (Ferrie 2006). Furthermore, DH is an 
indispensable tool to obtain homozygosity in 
species suffering from inbreeding depression 
(implying therefore that they cannot easily pro- 
duce fertile homozygous lines by _ self- 
pollination), such as rye (Secale cereale) 
(Immonen and Anttila 1996) and forage grasses 
Festuca and Lolium (Nitzche 1970), as well as in 
perennial woody species, generally characterized 
by a long juvenile period, a high degree of het- 
erozygosity, large size and, often,  self- 
incompatibility (Germana 2006, 2009) and 


generally in plants with barriers to repeated self- 
ing (e.g. dioecy and self-incompatibility), where 
it is not possible to obtain homozygous breeding 
lines through conventional methods. In A. offici- 
nalis, a dioecious crop species, an inbred popula- 
tion is produced through sib crosses between 
pistillate and staminate plants which yield 50 % 
males and 50 % females. However, the commer- 
cially desirable features of this crop are uniform 
male population with spears having low fibre 
content. Anther culture was used to produce hap- 
loids of this species and this was diploidized to 
raise homozygous males. These are called as 
super-males. 

The first crop plant released by DH technol- 
ogy was the rapeseed (Brassica napus) cultivar 
Maris Haplona (Thompson 1972), followed by 
the cultivar Mingo in barley (Hordeum vulgare), 
obtained in Canada in the late 1970s (Ho and 
Jones 1980). Worldwide, the use of DH technol- 
ogy has become routine by many breeding com- 
panies and _ laboratories involved in_ the 
improvement of a wide range of crops (Tuvesson 
et al. 2007), resulting in the development of 
almost 300 new varieties (these data are probably 
underestimated because of the scarce information 
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from some parts of the world and from some 
private companies) (Thomas et al. 2003; Dunwell 
2010). Most of the DH varieties are in barley 
(96), followed by rapeseed (47), wheat (20) and 
the rest in pepper, rice, tobacco, eggplant, melon, 
triticale and asparagus. DH cultivars are now a 
feature in many crop species; for barley (H. vul- 
gare) it is estimated that 50 % of contemporary 
cultivars in Europe are produced via DH (Forster 
et al. 2007). In vegetable crops, DHs are used 
prominently as parents for Fl hybrid seed 
production. 

However, a number of comparisons have 
shown that DH populations were comparable to 
pedigree breeding and single-seed descent-derived 
populations. The technology has not been rou- 
tinely used in public breeding programmes. The 
main reason for non-inclusion of DH as primary 
technology for cultivar release universally could 
be its resource intensiveness, lack of skilled 
workers and high initial setup cost. However, the 
commercial plant breeding sector has included 
the DH programme especially in_ hybrid 
breeding. 


5.5.2. Backcross Breeding 


Backcross conversion is a standard plant breed- 
ing method for improving an elite line defective 
in a particular trait like disease resistance, which 
is transferred usually from a hardy or wild geno- 
type (donor) to a high-yielding but susceptible 
genotype (recipient) of the same species. This is 
done by crossing donor and recipient parents dur- 
ing the first year and recurrent backcrossing of Fl 
hybrid to recipient parent to reconstitute/restore 
the genes of high yield in new cultivar. It takes 
5—6 years to recover the genes of recipient parent. 
Another problem in backcross breeding is identi- 
fication of gene of interest or disease resistance at 
each generation particularly if disease resistance 
is controlled by recessive gene. In this case we 
have to self each BC generation to generate reces- 
sive homozygotes for gene expression. This takes 
double time as compared to dominant gene. 
Contrary to the problems associated with conven- 
tional backcross method, DH technology com- 
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bined with molecular markers not only solves the 
problems mentioned above but also decreases the 
product development period significantly. A 
molecular marker is used to identify a gene of 
interest in a backcross generation and DH is 
developed to get the homozygosity in a single 
generation. The method has been used success- 
fully to introgress stripe rust resistance in barley 
(H. vulgare) (Chen et al. 1994; Toojinda et al. 
1998). Backcross inbred lines (BILs) are pro- 
duced by backcrossing, and making DH from this 
backcross is very useful and ideal for gene map- 
ping. Hence, double haploidy combined with 
marker-assisted selection (MAS) increases the 
efficacy and precision of selection of desirable 
traits. It is also helpful in stacking resistance 
genes. 


5.5.3. Mutant Selection 


The use of haploid systems for mutant induction 
and selection has been listed among the most 
important applications of haploid development 
technologies (Kasha 1974). On an average the 
isolation of a homozygous mutant line in seed- 
propagated crop takes three to four generations, 
whereas the DH system can shorten the process 
by half. All mutated traits are expressed in the 
first generation after mutagenic treatment, both 
at haploid and doubled haploid levels. DH 
mutant lines selected on the basis of their phe- 
notype are genetically fixed and do not segre- 
gate in the next generation. The frequency of 
desired mutant in DH population will be much 
higher than in conventional M, generation due 
to homozygous condition for mutants. Haploids 
seem to be a good platform for selection of 
mutants under in vitro conditions. The popula- 
tion of haploid cells can provide an opportunity 
of identifying a rare mutation event by screen- 
ing a large mutagenized population in vitro. 
Two approaches of applying mutagenesis to 
haploids in in vitro culture have been suggested 
by Maluszynski et al. (1996). In the first case, 
haploid cells or tissues can be directly given the 
mutagenic treatment, and in the alternative 
approach, seeds are treated with mutagen, and 
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M1 plant gametes are used as donors for the 
haploid culture. 

Microspores offer an interesting alternative 
for mutation treatment. By targeting the mutation 
treatment at single gametic cells and then induc- 
ing embryogenesis and DH plant production, it is 
possible to create a population of homozygous 
mutant lines directly (Szarejko and Forster 2006) 
which provide the possibility to screen both 
recessive and dominant mutants in the first gen- 
eration. DH also increases the selection effi- 
ciency of desired mutants due to the gametic 
segregation ratio (1:1) and lack of chimaerism. 
However, the disadvantage is that homozygous 
lethal mutations would be eliminated which 
otherwise are retained in the heterozygous 
condition. 


5.5.4 Reverse Breeding and Male 
Sterility 


DH plants also play an integral part in reverse 
breeding, a recently developed technology. The 
protocol for ‘reverse breeding’ was proposed by 
Winker et al. (2007). This technology involves 
the recovery of DHs from microspores of plants 
in which no or limited recombination § has 
occurred. First, the superior hybrid genotypes are 
identified among the segregating population. 
Then using genetic transformation, a gene for 
induced suppression of meiotic recombination is 
introduced. The resulting recombinant inbred 
populations can be screened via molecular mark- 
ers to identify those with complementary combi- 
nations of chromosomes to allow an original 
heterozygous parent of the DH to be recon- 
structed by hybridizing the two individuals. 
Consequently, different parents with different 
chromosome constitutions can be identified to 
reconstruct existing Fl hybrids. 

Reversible male sterility and doubled haploid 
production have been used in tobacco for F1 
hybrid production. The technology has been 
developed based on metabolic engineering of 
glutamine in developing tobacco anthers and pol- 
len (Ribarits et al. 2007, 2009). Cytosolic gluta- 
mine synthetase (GS1) was inactivated in tobacco 


S. Sood and S. Dwivedi 


by introducing mutated tobacco GS genes fused 
to the tapetum-specific TA29 and microspore- 
specific NTM19 promoters. Pollen in primary 
transformants aborted close to the first pollen 
mitosis, resulting in male sterility. A non- 
segregating population of homozygous doubled 
haploid male-sterile plants was generated through 
microspore embryogenesis. Fertility restoration 
was achieved by spraying plants with glutamine 
or by pollination with pollen matured in vitro in 
glutamine-containing medium. The combination 
of reversible male sterility with doubled haploid 
production results in an innovative environmen- 
tally friendly breeding technology. This technol- 
ogy precludes the release of transgenic pollen 
into the environment. 


5.5.5 Doubled Haploids 
in Quantitative Genetics 
Studies 


DH lines are valuable and the most important 
material for quantitative inheritance studies as 
these are completely homozygous and there is no 
heterozygosity present in these lines. This makes 
them best fit for studying ‘quantitative inheri- 
tance’ (Touraev et al. 2009; Mochida et al. 2004). 
Important components of quantitative inheritance 
like number of genes controlling a quantitative 
trait, gene interactions among different genes, 
gene linkages and chromosome locations can be 
studied using DH lines (Choo 1981a; Jui et al. 
1997). The number of genes controlling a poly- 
genic trait can be determined by taking the square 
of deviation of extreme DH plants from sample 
mean and dividing it by genotypic variance or 
additive variance of DHs (Choo 1981b). For link- 
age studies, DH lines from F1 and F2 generations 
are developed, and means, variances and skew- 
ness are compared for both types of DH lines 
produced which gives the idea about linkage 
among the genes. This method was proposed by 
Snape and Simpson (1981). They also concluded 
that if skewness of a DH population is zero, it 
means there is no gene interaction present. If 
skewness is greater than zero, complementary 
gene interaction is present and less than zero 
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means duplicate gene interaction is present. So 
we can detect additive gene interaction and mul- 
tiple gene action by this analysis. 


5.5.6 Doubled Haploids in Genetic 
Mapping and Genomics 


DHs are being routinely used for basic research 
in areas such as genomics, gene expression and 
genetic mapping (Ferrie and Caswell 2011). DH 
populations are ideal for genetic mapping due to 
their complete homozygosity, ability of repeating 
the experiment on several times and at different 
labs and less time required to produce a large DH 
population. Map construction is relatively easy 
using a DH population derived from a F, of two 
homozygous parents due to simple segregation 
ratio (1:1) similar to backcross (Snape 1976). 
Moreover, using DH lines for QTL mapping pro- 
vides an opportunity to score the polymorphism 
in complete homozygous conditions in several 
environments in different years (Young 1994; 
Paterson 1996). DHs have been a key feature in 
establishing genetic maps in a range of species, 
notably barley (H. vulgare), rice (Oryza sativa), 
rapeseed (B. napus) and wheat (7: aestivum) 
(Forster and Thomas 2005). In out-pollinating 
species, genetic analysis can be simplified by 
using at least one DH parent in the initial cross to 
produce a segregating population — this has been 
successful in vegetables such as the Brassica 
oleracea complex, namely, cabbage, cauliflower, 
broccoli and Brussels sprouts (Sebastian et al. 
2002). Marker maps provide a platform for trait 
mapping, which is of particular interest to plant 
breeders. The development of molecular marker 
map in DH populations led to a rapid deployment 
in QTL studies. The first QTL studies in barley 
were published within a year of their respective 
marker maps (Heun 1992; Hayes et al. 1993). For 
complete homozygous lines (like DH lines), an 
MQTL software has been developed for QTL 
analysis at multi-locations (Tinker and Mather 
1995). In responsive crop species, such as barley, 
the process of producing a segregating DH 
population from an F, hybrid to find marker-trait 
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associations has become routine (Tuvesson et al. 
2006). 

In rice, DH population has been used for QTL 
mapping, and QTLs have been found for yield; 
quality; agronomic, developmental traits; and 
biotic stress (Forster and Thomas 2005). 
Similarly, the QTLs have also been detected in 
wheat, barley, maize, mustard, rapeseed, flax, 
pepper, etc. (Forster and Thomas 2005). 

DHs are also ideal in establishing marker-trait 
associations in bulked segregant analysis (BSA), 
a technique that compares individuals from two 
extremes of a population distribution for a given 
trait, for example, disease resistance and suscep- 
tibility. As with quantitative trait mapping, accu- 
racy in BSA is. dependent upon robust 
phenotyping; because DHs can be repeatedly 
tested, reliable data, including field data from tri- 
als over several seasons, can be obtained. DNA 
from the selected DHs can then be genotyped 
progressively with a range of marker systems. 
BSA and DH analysis has been successful in 
establishing marker-assisted selection for several 
breeding traits, mainly disease and pest resis- 
tance, but also quality traits (Michelmore et al. 
1), 

DHs play an important role in genomics. 
Expressed sequence tags (ESTs) are used for 
searching of candidate genes. These ESTs can be 
mapped for their position relative to QTLs, and 
their physical chromosomal location can be 
identified via BAC libraries. DH populations 
play a vital role in integrating genetic and physi- 
cal maps, thereby providing precision in target- 
ing candidate genes. In rice, ESTs have been 
found to be co-located with major genes and 
QTLs for resistance to major diseases in the IR 
64x Azucena DH map (Wang et al. 2001). 
Induced mutation provides another method of 
linking genes to phenotypes. The mutant popula- 
tions for TILLING (Targeting Induced Local 
Lesions in Genomes) have been developed in 
several species for forward and reverse genetics. 
It is important that mutant populations are 
derived from an inbred or a homozygous line 
preferably DH line to avoid the detection of false 
positives owing to inherent variation in the start- 
ing material. 
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5.5.7. Production of Transgenic 
Plants 


Transgenes can be incorporated into the haploid 
microspore system to produce homozygous 
transgenic plants. Both haploid and DHs can be 
used for transformation of desirable gene, and 
thus, transgenic plant having stability are pro- 
duced which can be used for transfer of a desir- 
able gene, genetic studies like mapping and gene 
identification (Forster et al. 2007). Although the 
primary target is the unicellular microspore, cells 
or explants at all stages from microspore embryo- 
genesis and regeneration have been used as recip- 
ients for gene delivery, the choice being 
determined by regeneration potential after treat- 
ment. Many transformation techniques have been 
tried, including microinjection, electroporation, 
particle bombardment and Agrobacterium 
tumefaciens-mediated transformation (Touraev 
et al. 2001). Jahne et al. (1994) were the first to 
produce homozygous transgenic plants using 
biolistic bombardment of barley microspores. In 
tobacco, particle bombardment has been directed 
at embryogenic and non-embryogenic micro- 
spores and immature pollen (Stoger et al. 1995; 
Touraev et al. 1997b). In wheat, DH transgenic 
plants were produced for drought tolerance using 
the transgene (HVAJ), and transgenic plants had 
higher tolerance to simulated water stress. The 
transgene was stably integrated and expressed 
over generations (Chauhan and Khurana 2011). 
The problem with transformation process 1s that 
once a transgene is transferred, transgenic plants 
need to be stabilized for homozygosity to study 
the phenotypic effects of transferred gene. This 
requires additional time and resources. Thus, 
combining DH with transgenic technology can 
enhance the efficiency and speed up the produc- 
tion of transgenics. 
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Abstract 

Horticultural crops are important for nutritional and livelihood security of 
farmers in the developing countries. Generating improved varieties 
adapted to modern production, transportation and storage requirements 
can make a significant impact on these economies. Compared to progress 
achieved in cereals and grasses, genetic improvement in horticultural 
crops has lagged behind. A significant review of the status and availability 
of genomic resources in these crops can be utilised efficiently to conserve 
and explore existing genetic diversity, to understand relationship between 
genotype-phenotype and to accelerate breeding. Recent advances in auto- 
mation and high-throughput sequencing can be used in decoding as yet 
uncharacterized, complex genomes. In this review, we explore the various 
tools available under genomics, with emphasis on underutilised tropical 
and subtropical horticultural crops. 
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Phenotyping 

6.1 Introduction 

The importance of horticulture in improving 
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plantation crops and medicinal and aromatic 
crops, 1s an important component of the agricul- 
ture sector. Advances made in the field of molec- 
ular biology are being effectively used in 
horticultural crop improvement programmes. 
With particular reference to fruit crops, we pro- 
vide herewith advances made in tropical and sub- 
tropical crops. Genomic resource development, 
in the form of whole genome sequencing, 
expressed sequence tags (ESTs), genomic survey 
sequences (GSS) and high-throughput genome 
sequences (HTGs), is helping growth and genetic 
improvement of these crops. Resources such as a 
large number of ESTs, large-insert genomic 
libraries, plenty of molecular markers and high- 
density genetic maps are being used for sequenc- 
ing and annotation/identification/mapping and 
cloning of genes or quantitative trait loci (QTLs) 
and marker-assisted selection (MAS) in impor- 
tant horticultural crops. Availability of next- 
generation sequencing (NGS) technologies like 
FLX-454, Ulumina, SOLiD and PacBio has 
accelerated the pace at which genomic resources 
are being generated for economically important 
horticultural crops (Van et al. 2013). 


6.2 Genome Sequencing 


Projects 


Whole-genome sequencing and annotation proj- 
ects have significantly improved our knowledge 
of genome composition, organisation and evolu- 
tion. Our understanding of plant origin and evo- 
lution through genome duplication, ancestral 
rearrangements and unexpected polyploidization 
events has made rapid progress in recent years 
due to the analysis of plant genomes. Many 
sequencing projects have been initiated 1n several 
horticultural crops of importance. Some of the 
most noticeable projects are tomato genome 
sequencing project (www.sgn.cornell.edu/about/ 
tomato), potato genome sequencing consortium 
(www.potatogenome.net), papaya genome 
sequencing project (www.asgpb.mhpcc.hawaii. 
edu/papaya/) and grape genome sequencing proj- 
ect (www.vitaceae.org). Genome Database for 
Rosaceae (GDR) provides centralised access to 
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Rosaceae genomics, genetics and breeding data, 
and analysis tools include apple, peach, straw- 
berry, rose and other crops of Rosaceae family 
and many more. 

Papaya whole genome sequencing project has 
been completed and published the genome 
sequence in early 2008 (www.asgpb.mhpc. 
hawaii.edu/papaya) (Ming et al. 2008). 
Transgenic female SunUp variety was sequenced 
to 3x coverage using whole genome shotgun 
(WGS) with Sanger sequencers. Genome size of 
papaya is 372 Mbp, about three times the size of 
Arabidopsis, with which it shared a common 
ancestor about 72 million years ago (Wikstr6m 
et al. 2001). Some interesting features emerged 
from genome sequencing data, like papaya hav- 
ing about 25 % fewer genes than Arabidopsis, 
with reduction across most gene families and bio- 
synthetic pathways. The lower gene number is 
believed to be due to the absence of the recent 
genome wide duplication of papaya genome, 
unlike in Arabidopsis (Ming et al. 2008). The 
much awaited banana genome, based on double- 
haploid Musa acuminata cv. Pahang, was pub- 
lished in June 2012. The genome was sequenced 
to a depth of 20x using a combination of 454 and 
Sanger sequencing and 50x using [lumina short- 
read sequencing technology. The genome is 
organised into 11 pseudomolecules, covering 
332 Mbp accounting for 63 % of the estimated 
total banana genome size of 523 Mbp. An addi- 
tional 140 Mbp of unanchored scaffolds and con- 
tigs (33 % of the estimated total genome size) are 
also included in the current genome release 
(D’ Hont et al. 2012). Among citrus species, the 
draft genome of sweet orange cv. valencia has 
been recently published (Xu et al. 2013), while 
the genome of clementine orange is unpublished 
and still not fully assembled. The genome of hap- 
loid clementine has been developed by in situ 
parthenogenesis of Clemenules clementine 
induced by irradiated pollen of fortune mandarin, 
followed by direct embryo germination in vitro 
by the International Citrus Genome Consortium 
(UICGC http://www.citrusgenome.ucr.edu/). 
Using Sanger whole genome shotgun approach, 
301.4 Mbp of genome spread over 1,398 scaf- 
folds with 2.1 % gap at 7x coverage has been 
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generated. Over 96 % of the assembly is 
accounted for by the nine chromosome 
pseudomolecules. Similarly, a double-haploid 
(dihaploid) line derived from the anther culture 
of valencia sweet orange has been used to reduce 
the complexity of the sequenced genome (Xu 
et al. 2013). In addition to this, transcriptome 
sequencing of four representative tissues using 
shotgun RNA sequencing (RNA-seq) to capture 
all transcribed sequences and _ paired-end-tag 
RNA sequencing (RNA-PET) to demarcate the 5’ 
and 3’ ends of all transcripts has been conducted. 
On the basis of DNA and RNA sequencing data, 
characterisation of the orange genome for its 
gene content, heterozygosity and evolutionary 
features has been accomplished. 

Availability of the full genome sequence of 
several crop species is helping plant biologists to 
answer many questions about the evolution of 
species, genome structure and genetics. Also, it 
helps plant breeders conserve and explore biodi- 
versity. In order to illustrate how knowledge 
about genomes is directly impacting crop 
research, we can consider the examples in rice, 
maize, tomato, potato, sugar beet and apple. In 
rice, once the genome sequence information 
became available, it was immediately used to 
help elucidate a major QTL for rice grain pro- 
duction, which was found to be a cytokinin ox1- 
dase (Ashikari et al. 2005). Later, a transcription 
factor controlling the expression of the gene was 
identified as DST (Li et al. 2013). The re- 
sequencing of many accessions within a species 
is enabling high-throughput genotyping to iden- 
tify a large array of SNPs and subsequently to 
produce haplotype maps (Barabaschi et al. 
2012). In maize, this technique has been used to 
develop powerful haplotype maps using genome 
information (Gore et al. 2009), and also QTLs 
for biomass and bioenergy using whole genome 
and metabolic prediction have been identified 
(Riedelsheimer et al. 2012). The tomato genome 
has elucidated the esterase responsible for differ- 
ences in volatile ester content in different tomato 
species (Goulet et al. 2012). A gene underlying 
the uniformity ripening locus has also been iden- 
tified to be Golden 2-like transcription factor 
which is responsible for chlorophyll distribution 


in unripe fruits (Powell et al. 2012). The 
cultivated tomato genome along with the draft 
genome of its wild relative S. pennellii has 
shown the evolution of terpene biosynthesis in 
this genus (Matsuba et al. 2013). The informa- 
tion generated will undoubtedly help in breeding 
tomatoes with improved quality traits. The 
genome of another member of Solanaceae fam- 
ily potato has been used for identification of 
transcription factors involved in the regulation of 
plant maturity and life cycle (Kloosterman et al. 
2013). The sugar beet genome has been used to 
identify the biological factors controlling flower- 
ing time in order to create a ‘winter beet’ which 
would be able to avoid bolting during winter 
thereby giving more yields (Pin et al. 2010). 
Among fruit crops, in apple, 27 cultivars repre- 
senting worldwide breeding germplasm have 
been re-sequenced at low coverage with Illumina 
Genome Analyzer II. These sequences were 
aligned with the whole genome sequence of 
‘Golden Delicious’ to identify a total of 2, 
113,120 SNPs. Applying stringent filtering crite- 
ria, SNPs exhibiting even distribution across the 
apple genome and a range of minor allele fre- 
quencies were chosen giving rise to a panel of 
7,867 apple SNPs for International RosBREED 
SNP Consortium (IRSC) apple 8K SNP array. 
Of these, 5,554 were reported to be polymorphic 
in segregating families and the germplasm col- 
lection (Kumar et al. 2012). This genomic 
resource is proving helpful in marker-locus-trait 
association discovery, description of genetic 
architecture of quantitative traits, investigation 
of genetic variation and genomic selection in 


apple. 


EST Resources 
in Horticultural Crops 


6.3 


Expressed sequence tags (ESTs) are short 
(200-800 nucleotide base in length), unedited, 
randomly selected single-pass sequence reads 
derived from cDNA libraries. EST and comple- 
mentary DNA (cDNA) sequences provide 
direct evidence for all the sampled transcripts. 
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These are the most important resources for 
transcriptome exploration. ESTs enable gene 
discovery, complement genome annotation, aid 
in gene structure identification, establish viabil- 
ity of alternative transcripts, characterise single 
nucleotide polymorphism (SNP) and facilitate 
proteome analysis (Jongeneel 2000; Rudd 
2003; Dong et al. 2005). High-throughput ESTs 
can be generated at reasonably low cost from 
either 5’ or 3’ end of a cDNA clone to gain 
insight into the transcriptionally active regions 
in any organism. In plants having a large 
genome size, it is cumbersome to sequence the 
whole genome; but, EST data sets can be uti- 
lised to complement or serve as an alternative 
to genome sequencing, earning the label ‘poor 
man’s genome’(Rudd 2003). However, ESTs 
are subject to sampling bias resulting in under- 
representation of rare transcripts, often account- 
ing for only 60 % of an organism’s genes 
(Bonaldo et al. 1996). ESTs have been devel- 
oped in several plant species, and this DNA 
sequence information has been deposited in 
online databases. In several fruit crops, 
sequence data for several fully characterised 
genes and full length cDNA clones have been 
generated. At the National Centre for 
Biotechnology Information (NCBI) EST data- 
base (dbEST; www.ncbi.nlm.nih.gov/dbEST/Y), 
there are 77,528, 44,565 and 234,474 ESTs of 
papaya, Musa and sweet orange, respectively. 
These fruit crops are of worldwide economic 
importance; therefore, they are being studied 
extensively at the genomic level. 

With the advent of NGS, it is expected that 
the number of ESTs will significantly increase, 
since these sequences form an important 
resource for functional genomics. ESTs derived 
from various kinds of tissues, including tissues 
of organisms in a range of developmental stages 
or under stress, could significantly facilitate 
gene discovery as well as gene structure annota- 
tion, large-scale expression analysis and the 
design of expressed gene-oriented molecular 
markers and probes for microarrays until the 
whole genome is sequenced (Zhang et al. 2004; 
Kawaura et al. 2006). 
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6.4  Transcriptome/Gene 


Expression Studies 


In fruit crops with large EST collections, the next 
logical step is expression analysis of genes using 
techniques like SAGE, microarray, tilling array 
and RNA-seq. Once available, the microarray can 
be exploited to study genes involved in disease 
resistance, fruit ripening, origin and evolution of 
reproductive organs, etc. The transcriptome can 
also be used as a proxy for a complete genome 
and sequenced using NGS technology at a lower 
cost, as the transcriptome size can be a fraction of 
the total genome size as demonstrated in maize 
which has 2.3 GB genome, but the size of anno- 
tated genes is a mere 97 Mb or 4 % of the genome 
(Schnable et al. 2009). A representation of a 
majority of all genes in de novo assembly can be 
obtained through sampling of several core tissues 
with diverse biological functions, e.g. root, leaf, 
flower and fruit at lower cost. 

Several studies have been made on economi- 
cally important fruits like papaya, banana, citrus, 
avocado and pineapple. In papaya, analysis of 
ripening-related genes has been carried out using 
a cross-species (XSpecies) microarray technique 
based on phylogenetic proximity between papaya 
and Arabidopsis thaliana (Fabi et al. 2012). 
Transcriptome analysis resulted in the identifica- 
tion of 414 ripening-related genes. An expression 
of some of these genes was validated using 
qPCR. The transcription profile was compared to 
tomato and grape, which revealed several simi- 
larities between ripening in papaya and tomato, 
especially with respect to primary metabolism, 
regulation of transcription, biotic and abiotic 
stress and cell wall metabolism. In another study, 
the identification of genes responsible for sex 
determination in papaya, namely, male, female 
and hermaphrodite plants, was done using the 
digital gene expression analysis by 
Ht-SuperSAGE (Matsumura et al. 2010) which 
resulted in the identification of a large number of 
tags by sequence analysis using SOL1D3 which 
were mapped on _ papaya primitive sex 
chromosome. 
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In avocado, transcriptome sequencing studies 
have been undertaken in a big way to discover 
candidate genes. Identification of defence-related 
genes after infection with Phytophthora cinna- 
momi which causes root rot in avocado (Mahomed 
and Van den Berg 2011) and Colletotrichum 
gloeosporioides responsible for anthracnose dis- 
ease, using Roche 454 pyrosequencing (Djami- 
Tchatchou and Straker 2012), has been done. 
Floral transcriptome of avocado is of interest to 
biologists since it is considered as a basal angio- 
sperm, with weakly differentiated floral organs. 
Using expression profiles of flowers at different 
stages of development, it has been shown that 
overlapping transcriptional programmes charac- 
terise the floral transcriptome of avocado com- 
pared to Arabidopsis where floral gene expression 
domains are typically organ-specific (Chanderbali 
et al. 2009). Transcription analysis is also useful 
for the development of molecular markers like 
the SNP and SSRs. In an unpublished work by 
John Ohlrogge of Michigan State University, a 
large EST library has been prepared for ‘Hass’ 
genotype of avocado from different tissues like 
leaves, flowers, mesocarp, etc. using Roche 454 
sequencing. This is being used as a reference 
transcriptome, while RNA from leaves and flow- 
ers of other varieties like ‘Bacon’, ‘Simmonds’ 
and “Tonnage’ has been pooled and sequenced 
using [lumina GAII. These reads have been 
aligned to ‘Hass’ reference to develop single- 
nucleotide polymorphisms and indels (Arias 
et al. 2012). 

Musa (which includes both bananas and plan- 
tains) is an important genus for global export and 
local markets. Musa paradisiaca (ABB group, 
plantain) has been shown to possess superior 
stress resistance (drought, chilling, diseases, etc.) 
compared to Musa acuminata (AAA group, 
banana). The de novo transcriptome assembly 
has been used to gain a comprehensive overview 
of plantain transcriptome to identify resistance 
genes/proteins for use in banana breeding pro- 
grammes. Fungal infections like Fusarium oxys- 
porum f. sp. cubense (Foc) cause devastating 
disease in this crop. Recently, efforts have been 
made to understand the mechanism of Foc 


infections (Wang et al. 2012) and the genetic 
basis of resistance to Fusarium oxysporum f. sp. 
cubense tropical race 4 by transcriptome profil- 
ing of resistant and susceptible Cavendish banana 
roots (Li et al. 2012). Using RNA-seq and digital 
gene expression (DGE) technique, these studies 
have shed light on the transcriptional changes in 
roots following infection and large differences in 
the transcriptome profile of resistant somaclonal 
variant and its susceptible wild type. The charac- 
terisation of tolerance to Fusarium oxysporum f. 
sp. cubense infection in banana using suppres- 
sion subtractive hybridization and gene expres- 
sion analysis along with host response, pathogen 
diversity, current understanding of biochemical 
and molecular changes which occur during host- 
pathogen (Foc) interaction and strategies to 
develop resistant genotypes has been reviewed 
(Swarupa et al. 2013, 2014). 

In pomegranates, the fruit peel is a rich source 
of bioactive compounds, and work has been done 
on transcriptome analysis of peel at different 
stages of fruit development to unravel the genes 
and pathways involved in the synthesis of these 
compounds; also, genic SSR markers have been 
developed in this study (Ono et al. 2011). 
Recently, in Litchi, de novo assembly and char- 
acterisation of fruit transcriptome has been done 
to identify differentially regulated genes in fruit 
in response to shading. This study also sheds 
light on the molecular mechanism underlying 
severe fruitlet abscission process under the shade 
which leads to very low yield and significant eco- 
nomic losses for growers (Li et al. 2013). 

In citrus, extensive gene expression studies 
have been conducted on different species as listed 
by NCBI (January 2013). A maximum number of 
studies (13) have been carried out on sweet 
orange (Citrus sinensis) due to its commercial 
importance, followed by C. reticulata (4), C. par- 
adisi (3), C. limona (2), C. clementia (2), C. trifo- 
liata (2) and one each in C. medica, C. maxima, 
C. aurantifolia and C. limon. These studies cover 
a wide range of problems like differential gene 
expression due to infection with pathogens in 
response to peel wounding, water deficit, etc. 
Comparative transcript profiling has been done 
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for seedless and wild-seeded orange, C. reticu- 
lata Blanco and stylar canal cells of C. clementia, 
to identify candidate genes implicated in 
self-incompatibility. 

With the goal of discovering genes controlling 
consumer and grower traits in mango, gene dis- 
covery project has been started by the Queensland 
Primary Industries and Fisheries (Holton 2010). 
Traits like fruit quality and tree architecture are 
being focussed on using a multidisciplinary 
approach of sequencing expressed genes via 
ESTs and serial analysis of gene expression 
(SAGE), next-generation sequencing producing 
low-pass genome coverage, identifying candidate 
genes for fruit quality (aroma and colour) and 
tree architecture. Recently, the leaf transcriptome 
and chloroplast genome of mango have been 
characterised using NGS (Azim et al. 2014). A 
total of 24,593 annotated unigenes (80 % of total) 
has been reported and C. sinensis identified as the 
closest neighbour with 37 % matched sequences. 
The chloroplast genome in mango was reported 
to contain 139 genes, out of which 91 are coding 
protein. 

In other crops like pineapple, persimmon, etc., 
few transcriptome studies have been done, while 
such resources and studies are yet to be taken up 
for several other horticultural crops of interest. 
Because of the various key functionalities of 
transcriptome-based gene discovery studies, it is 
essential to integrate these with other studies, like 
diversity analyses and combining transcript 
expression with metabolite levels and other 
‘omics’ fields. By doing this, transcriptomics can 
become more useful than a descriptive gene cata- 
loguing exercise and can aid in identifying key 
genes in a pharmaceutically/nutritionally relevant 
pathway (Hirsch and Buell 2013). 


Molecular Markers: Marker- 
Assisted Selection (MAS) 
and Genetic Diversity 


6.5 


Molecular marker techniques (employing RFLP, 
RAPD, microsatellites, AFLP, SNP and other 
kinds of marker systems) represent a suite of 
powerful research tools for characterisation and 
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management of genetic polymorphism (variation 
and diversity) in plant breeding and germplasm 
characterisation programmes. Where they are 
available and cost-effective, molecular markers 
can have a wide number of applications in plant 
breeding, the most commonly considered being 
marker-assisted selection (MAS). The FAO- 
BioDeC suggests that the earliest generation 
forms of DNA-based molecular markers (RAPD, 
RFLP) are more widely used than the more 
recently developed types of molecular markers 
(microsatellite and AFLP markers). 

MAS is based on identification and use of 
markers which are linked to the gene(s) control- 
ling the trait of interest (FAO 2003). By virtue of 
that linkage, selection may be applied to the 
marker itself. The advantage consists in an oppor- 
tunity for speeding up the application of selection 
procedure. For instance, a character which is 
expressed only at the mature-plant stage may be 
selected at the plantlet stage if selection is applied 
to a molecular marker. Also, selection may be 
applied simultaneously to more than one charac- 
ter, and selection for a resistance gene can be car- 
ried out without needing to expose the plant to 
the pest, pathogen or deleterious agent. Finally, 
when there is linkage between a molecular 
marker and a quantitative trait locus (QTL), 
selection may become more efficient and rapid. 
The construction of detailed genetic and molecu- 
lar maps of the genome of the species of interest 
is a prerequisite for most forms of MAS. However, 
the current cost of the application of these tech- 
niques is significant, and the choice of one tech- 
nique rather than others may be dictated by cost 
factors. There are still very few examples of crop 
varieties in farmers’ fields which have been 
developed based on MAS, largely because of the 
currently prohibitive cost of incorporating large- 
scale MAS into the budgets of most plant breed- 
ing programmes. 

In addition to MAS, molecular markers can be 
used in germplasm characterisation. Compared 
to morphological and protein markers, DNA- 
based genetic markers are often considered to be 
the most useful for genetic diversity studies 
because they are highly polymorphic and 
heritable (their expression is not affected by 


6 Plant Molecular Biology Applications in Horticulture: An Overview 119 


environmental variability). In recent years, 
different molecular markers (RAPD, RFLP, 
AFLP, SSRs and ISSR) have been employed for 
the investigation of cultivar origins and taxo- 
nomic relationships of several plant species 
(Manoj et al. 2010). Molecular approaches, such 
as RAPD for the estimation of genetic diversity 
(Chen et al. 2007; Feria-Romero et al. 2009), 
ALFP for genetic characterisation of genotypes 
(Hernandez-Delgado et al. 2007) and SSRs for 
cultivar identification and linkage mapping 
(Risterucci et al. 2005), have been reported in 
crops such as guava, papaya, pomegranate, 
mango and several others. The FAO-BioDeC 
indicates that molecular markers are being exten- 
sively used in Latin America with 93 trials and 
165 molecular marker projects at the research 
phase in nine countries. The assessment of the 
genetic diversity of Andean local roots and tubers 
in Latin America using molecular markers is an 
interesting development. Species reported to be 
included in molecular marker programmes in 
these countries are sugar cane, rice, cocoa, 
banana, bean and maize. The survey also indi- 
cates that most countries in Asia are undertaking 
a wide spectrum of crop research using molecu- 
lar markers. Molecular marker-related research 
activities in Africa are reported to be underway in 
only a few of the countries, such as Ethiopia, 
Nigeria, South Africa and Zimbabwe; the range 
of Africa crops under study with molecular mark- 
ers, however, 1s very wide: from traditional com- 
modities to tropical fruits. 


6.6 Linkage Maps 


and Association Studies 


Conventional fruit breeding is primarily based on 
phenotypic selection with the help of morpho- 
logical markers, but this takes years in case of 
woody species which have a long generation time 
and are mostly cross-pollinated. Also, the mor- 
phological markers are influenced by epigenetics 
and environmental factors. Traits of economic 
importance like fruit quality, yield, and precocity 
and disease resistance are polygenic and complex 
in nature. Therefore, advanced genetic tools like 


molecular markers (which can be used on any 
tissue at any time of plant growth, overcoming 
limitations of traditional methods) are being 
identified and used in the breeding/varietal iden- 
tification of fruit trees. The most popular markers 
developed from genomic resources like ESTs 
include SSRs, SNPs and conserved ortholog set 
(COS) markers. SNPs are becoming more popu- 
lar than SSRs as genetic markers in linkage anal- 
ysis because these are more abundant and suitable 
for automation (Novelli et al. 2004; Selmer et al. 
2009). In temperate fruit crops like the apple, 
Illumina Infintum SNP assay containing 8,000 
SNPs is being used as a crucial tool for applica- 
tion of a novel selection strategy called genomic 
selection (GS) in_ breeding programmes. 
Availability of this assay has resulted in dramatic 
reduction in genotyping costs for large-scale 
genotyping of apple seedlings (Chagné et al. 
2012). SNPs have been developed only for a few 
tropical fruit species like banana, where a small 
number of 20 SNP markers are available with 
NCBI. At present, the cost of developing and 
testing SNPs 1s still higher than that for develop- 
ing SSRs. Nonetheless, SNP discovery is actively 
being pursued in crops like avocado (Arias et al. 
2012), mango (Kuhn 2014; Sherman et al. 2014) 
and longan, under a USDA-sponsored project on 
genomics and single nucleotide marker discovery 
in horticultural fruit crops. In fruit trees, where 
the reference genome sequence exists, like 
papaya, clementine and sweet orange and banana, 
genotyping- by-resequencing using next-genera- 
tion sequencing will allow rapid and efficient 
identification of SNPs and other types of poly- 
morphism that are related to structural variations 
such as copy-number variations (CNVs), inser- 
tion-deletion polymorphisms (IDPs) and pres- 
ence-absence variants (PAVs) between genomes. 
These can be used for genome-wide and candi- 
date gene-based association mapping studies to 
identify QTL-trait associations (Khan and 
Korban 2012). Such studies have been made in 
cereal crops like maize, where extensive struc- 
tural variation (including hundreds of CNVs and 
thousands of PAVs) has been discovered and 
implicated in important biological phenomena 
like heterosis (Springer et al. 2009). 
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A number of approaches have been developed 
to reduce the complexity of the genomes for cost- 
efficient and simplified discovery of SNP mark- 
ers using NGS, like RNA-seq, complexity 
reduction of polymorphic sequences (CRoPS), 
restriction-site-associated DNA — sequencing 
(RAD-seq) and — genotyping-by-sequencing 
(GBS) (Davey et al. 2011). Among these, RAD- 
seq and GBS are proving to be particularly effi- 
cient in rapid and mass discovery of SNPs in 
highly heterozygous species. This is expected to 
help develop ultrahigh density maps and associa- 
tion mapping. Together with NGS, these tech- 
niques help to discover SNPs by sequencing a 
large set of restriction fragments. DNA fragments 
from one individual are ligated to a modified 
adapter containing a barcode. Fragments from 
many individuals can, therefore, be pooled 
together and sequenced on a single lane. The 
resulting reads can be separated bioinformati- 
cally by identifying the barcode. By sequencing a 
family or other populations of interest this way 
and comparing the tags to phenotypes of individ- 
uals, many biologically relevant SNPs and 
genetic loci can be identified in a single experi- 
ment. If each genotype of a mapping population 
has been characterised phenotypically for traits 
of interest and has an individual barcode, then 
RAD and GBS markers can be used routinely for 
linkage mapping and QTL analysis (Rowe et al. 
2011). Using RAD-seq, Chutimanitsakun et al. 
(2011) recently constructed an RAD linkage map 
in barley and performed QTL analysis; Scaglione 
et al. (2012) discovered about 34,000 SNPs and 
800 indels in globe artichoke; and Stolting et al. 
(2013) revealed the patterns of divergence and 
gene flow between ecologically divergent species 
of poplar. GBS information for hundreds of culti- 
vars and few wild relatives has been used for 
development of a catalogue of segregating SNPs 
in cassava (Prochnik et al. 2012). Among these 
two markers, GBS holds more promise because 
of its ability to allow simultaneous marker dis- 
covery and genotyping with low-cost and simple 
workflow (Kumar et al. 2012). In grape, under 
the project VitisGen, GBS is being employed for 
a coordinated marker discovery and application 
using centralised phenotyping and map 
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development (Davidson and Lance 2012). Under 
this project, over 5,000 breeding lines are being 
genotyped each year to track alleles introgressed 
from 11 Vitis species by mapping 20,000 SNP per 
population, including alleles not present in V. 
vinifera reference genome. It is expected that a 
marker set of at least 30 markers will be gener- 
ated from this project for alleles controlling traits 
of interest for current and future grape breeding 
programme. 

While SNP markers are the markers of choice 
for well-studied plant species, these may not 
address the needs of researchers working on 
crops with limited genomic information avail- 
able. In such crops, microsatellite markers have 
been used in various areas like conservation of 
genetic resources, the establishment of core 
germplasm, population genetics, molecular 
breeding and _ fingerprinting of varieties 
(Ellegren 2004). The wide range of application 
of these markers is due to their codominant, 
multiallelic and highly reproducible nature; 
high resolution, amenability to high-throughput 
and polymerase chain reaction (Oliveira et al. 
2006). These markers have been discovered tra- 
ditionally by constructing genomic libraries 
enriched for a few, targeted SSR motifs and 
sequencing of clones containing SSRs. However, 
such library-based approaches are expensive, 
labour- and time-consuming and can isolate 
only the targeted, enriched SSR motifs (Zalapa 
et al. 2012). The various methods of genome- 
wide marker discovery (SSR and SNPs) and 
genotyping using NGS like reduced-representa- 
tion sequencing using reduced-representation 
libraries (RRLs) or complexity reduction of 
polymorphic sequences (CRoPS), restriction- 
site-associated DNA sequencing (RAD-seq) 
and low-coverage genotyping can be used in 
both model crops with high-quality reference 
genome sequence and in non-model species 
with no existing genomic data. Low-coverage 
genotyping by pyrosequencing has been used to 
develop 171 highly polymorphic markers (60 % 
markers with PIC value >0.5) in pomegranate 
which will help in the breeding and conserva- 
tion of this speciality fruit crop (Ravishankar 
et al. communicated, unpublished as of date). 
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Sequences generated by pyrosequencing were 
also mined to identify five conserved miRNAs 
in pomegranate (Kanupriya et al. 2013). 
Available genomic resources like ESTs are also 
used for isolating SSRs using bioinformatics 
tools. However, EST-derived markers are gener- 
ally less polymorphic; thus, they are not very 
useful for developing linkage maps and for 
DNA fingerprinting (Kalia et al. 2011). 

Linkage maps are used to show the position 
of known genes or genetic markers relative to 
each other in terms of recombination frequency, 
rather than a specific distance along each chro- 
mosome. These maps are important for the 
development of physical maps, mapping QTLs 
and association studies. In high-density maps, 
the markers are placed very close to each other 
throughout the genome (Tanksley et al. 1992). A 
large number of SSR/SNP markers and mapping 
population is required for developing a high- 
density linkage map. Availability of such maps 
along with phenotypic data is useful for predic- 
tive breeding since it helps in identifying the 
QTL regions associated with traits of interest. A 
physical map of a chromosome or a genome on 
the other hand shows the physical locations of 
genes and other DNA sequences of interest. 
These maps facilitate in the identification and 
isolation of genes by positional cloning. In the 
past decade, linkage maps have been prepared 
for a few fruit crops using various types of map- 
ping populations and markers. QTL mapping 
has not been widely used due to the perennial 
nature of most fruit crops, because of which, 
establishing and maintaining biparental crosses 
and progenies of fruit trees in multiple locations 
are difficult and costly (Rikkerink et al. 2007). 
Also, the Fl progeny derived from full-sib 
crosses between two outbred parents is used for 
QTL mapping which gives a low resolution of 
QTLs. The QTL region has been shown to span 
~ 5-10 cM, with likelihood of presence of hun- 
dreds of genes within this region (Gonzalez- 
Martinez et al. 2007). Therefore, utilising QTLs 
in MAB without a fine mapping is not advocated 
due to genetic drag effects and/or rapid decay of 
linkage disequilibrium in most fruit trees (Khan 
and Korban 2012). 


Association mapping (AM) has been pro- 
posed as a viable alternative to overcome the 
above limitations. This mapping technique has 
the potential to identify a single nucleotide 
polymorphism within a gene that is responsible 
for phenotypic differences. It involves search- 
ing for genotype-phenotype — correlations 
between unrelated individuals. It has high reso- 
lution due to historical recombination accumu- 
lated in natural populations and collections of 
landraces, breeding materials and _ varieties 
(Flint-Garcia et al. 2003). Although AM pres- 
ents clear advantages over linkage mapping, 
they are often applied in conjugation. The low- 
resolution linkage analysis is used to identify 
large candidate regions (1-10 Mb) within 
which fine mapping is used for identification of 
causal locus. Linkage maps therefore must be 
developed in fruit crops to validate the associa- 
tions identified by AM, thus reducing spurious 
associations/false positives (Kover et al. 2009). 
An example where three maps have been used 
in conjugation is found in peach where classical 
linkage, physical map and association genetics 
analyses were combined to find alleles of SSR 
markers suitable for marker-assisted selection 
(MAS) of the flat peach character at both the 
parent and progeny levels (Picanol et al. 2013). 
Phenotypic characterisation of germplasm is a 
prerequisite for association mapping analyses. 
However, it is cumbersome and expensive for 
perennial plant species. Therefore, characteri- 
sation could be more efficient if focused on a 
reasonably sized subset of accessions or so- 
called core collection, reflecting the geographic 
origin and variability of the germplasm 
(El-Bakkali et al. 2013). Hence, there is an 
urgent need for identification of core collec- 
tions in many of the horticulturally important 
fruit crops using markers. 


6.7 Comparative Genomics 

Comparative genomics is the study of the rela- 
tion between structural and functional attributes 
of genomes across species generally within a 
family. Sequence information on several fruit 
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crops deposited in public databases can be uti- 
lised for both basic and applied research in 
related genera. This approach has been used 
mainly across families’ like Rosaceae, 
Brassicaceae and Myrtaceae. Comparative 
genomic studies have shown that genome evolu- 
tion within family results from chromosomal 
rearrangements, leading to syntenic colinearity 
among large chromosomal regions. As a result, 
chromosomal regions where the marker order is 
highly conserved have been utilised in Malus, 
where primer pairs have been designed from 
conserved exon sequences flanking predicted 
intron-exon junctions in Malus cDNA sequences. 
Transferability of these primers to other rosa- 
ceous genera was found to be high, with primer 
pairs representing 85 % of genes amplifying 
products from Fragaria and primer pairs repre- 
senting 85 % of genes, amplifying products from 
Prunus genomic DNA. These primers were 
screened in Fragaria and Prunus mapping bin 
sets, and 38 % of the genes were successfully 
located on both maps (Sargent et al. 2009). This 
approach has also proved useful for construction 
of first genetic linkage map of loquat (Eriobotrya 
Japonica) by using SSR markers derived from 
Malus (Gisbert et al. 2009). Whole genome of 
Eucalyptus grandis, which is a member of 
Myrtaceae family, has been recently sequenced 
(Myburg et al. 2011). This sequence 1s currently 
in use as a reference to identify homologies in 
species belonging to the same family like guava 
(Ritter 2012). Development of various genomic 
tools such as genetic map, physical map, tran- 
script map and map-based genome sequence, 
markers, in model/important crop of one family 
can be exploited in improvement of related 
crops, for example, mango, cashew and pista- 
chio belong to Anacardiaceae, while in Moraceae 
family breadfruit, jackfruit, mulberry and fig are 
found. Whole genome sequencing of one of the 
members of these families will help in develop- 
ing genomic resources for others (Ravishankar 
et al. 2013) making this as one of the most cost- 
effective methods in the development of genomic 
resources. 
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6.8 Database Resources 

Large-scale genotype and phenotype projects 
along with the advent of next-generation sequenc- 
ing technology have increased the amount of data 
generated and the scale at which analysis can be 
performed. There is many a software available 
for sequence analysis, creation of database and 
curation of data for plant genome. These data- 
bases have become an essential resource for 
experimental and computational biologists allow- 
ing users to take advantage of diverse data sets 
and provide a platform for comparative genom- 
ics. These resources are organised around a spe- 
cific crop or related genera. Table 6.1 shows 
resources generated in fruit crops and provides 
curated records that have detailed and updated 
information. Such records provide researchers 
the essential genomic resources for molecular 
understanding of biological properties and for 
application of this knowledge. Such database 
resources are widely used in many horticultural 
crops like tomato (Iquebal et al. 2013; Suresh 
et al. 2014). The large resources of SNPs/SSR 
markers, especially in the genic region, QTLs, 
genes and miRNAs, would immensely benefit 
both breeders and the molecular biologist to uti- 
lise the natural trait reservoir and functional 
aspects of genes for crop improvement. It is 
therefore the development of an integrated data- 
base of SSRs, genes, miRNAs, known QTLs, 
SNPs and their location on chromosome which 
would greatly help crop biologists in understand- 
ing a crop and their efforts at crop improvement. 


6.9 Phenotyping 

The plant phenotype is the set of structural, phys- 
iological and performance-related traits of a gen- 
otype in a given environment. Phenotyping is the 
act of determining the quantitative or qualitative 
values of these traits. In recent years, with 
increasing availability of high-density genotypic 
information, our understanding of the components 
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of phenotypic variation that are due to genetics 
has improved considerably. However, the geno- 
type-phenotype relationship is expected to be 
more dependent on the availability of high- 
quality phenotypic and environmental informa- 
tion. Unlike genotyping, which is now highly 
mechanised, automated and uniform across 
organisms, phenotyping 1s still a species-specific, 
labour-intensive and environmentally sensitive 
methodology which requires integrated efforts of 
specialists from disciplines such as functional, 
quantitative and computational genetics/genom- 
ics, bioinformatics, modelling, physiology and 
computer science (reviewed by Cobb et al. 2013). 

Several approaches are available for pheno- 
typing, destructive or noninvasive measurements, 
in or ex situ, at a spatial resolution stretching 
from the subcellular level (nm-scale precision) to 
canopy stands (metre-scale precision) and tem- 
poral resolutions ranging from seconds to entire 
growing seasons. At the macroscopic level, shoot 
structure, growth, physiological status and photo- 
synthetic performance can be measured in a 
range of crop species using optical methods (2D 
imaging and 3D reconstructions in the visible 
and invisible regions of the electromagnetic spec- 
trum) and non-optical methods (e.g. micro- 
waves). Phenotyping for complex traits is aided 
by analytical chemistry such as mass spectros- 
copy, high performance liquid chromatography, 
inductively coupled plasma spectroscopy, etc. 
Several ‘omics’ technologies (transcriptomics, 
metabolomics, ionomics, proteomics, methylo- 
mics, etc.) are also being used for this purpose. 
These are all highly automated, and useful, due to 
their high-throughput rates and accuracy (Cobb 
et al. 2013). Choice of the phenotyping approach 
depends on several factors like the biological 
question the researcher is trying to answer, the 
size of the population in question [e.g. less than 
ten individuals for precise physiological experi- 
ments to a moderate number of plants (200—400) 
for mapping studies or GS training populations 
or a large number of plants (400-—1,000?) for 
association studies] and the heritability of the 
phenotype, controlled environment testing, trac- 
tability of the phenotype and resource availability 
(Cobb et al. 2013). 
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Examples of high-throughput phenotyping in 
tree species can be found in eucalyptus, poplar 
and apple. In eucalyptus and poplar, phenotypic 
assessment methods of physical and chemical 
properties of wood using near-infrared reflec- 
tance (NIR) together with the classical measure- 
ments of growth and wood quality tree properties 
have been reported by Poltri et al. (2011). In 
apple, fruit quality is an important trait, and 
large-scale standardised phenotyping using sen- 
sory and instrumental methods has been devel- 
oped under RosBREED to enable data pooling 
across locations/institutes (Evans et al. 2011). 
Texture analysis in an apple progeny through 
instrumental, sensory and histological phenotyp- 
ing has been reported by Galvez-Lopez et al. 
(2012) and high-throughput metabolic phenotyp- 
ing by McGhie and Rowan (2012). From these 
experiences, we need to evolve methodologies 
for traits of interest in each tropical fruit tree crop 
through institutional collaborations. 


6.10 Conclusions 


Technologies available in the present day, like 
next-generation sequencing, are helping biolo- 
gists decipher genome information in a short 
span of time and apply it to generate genomic 
information for non-model horticultural crops. 
NGS has led to sequencing of whole genomes 
and transcriptomes providing opportunity for dis- 
covery of new and useful information on the 
genetic control of important traits. Converting 
these into genetic tools like markers would 
enhance the efficiency and accelerate the breed- 
ing of these crops compared to conventional 
phenotype-based approaches. Currently, the 
application of these technologies is much slower 
in horticultural crops compared to others like 
cereals. 

First, we need to employ these technologies in 
the discovery of markers like SSR and SNPs in 
large numbers in non-model crops. Also, gener- 
ally, the laboratories involved in researching 
second-tier crops and non-model species have 
limited technical and financial resources; there- 
fore, it is desirable that marker assays developed 
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should be easy enough to be implemented in labo- 
ratories with basic equipments. Such resources will 
aid in the development of linkage maps provided 
that large population with a known family structure 
is available for traits of interest. Another useful 
application of markers will be in identification and 
conservation of core group of individuals or geno- 
types which can be used for association studies. 
The NGS technologies will also help in sequence- 
enabled allele mining and exploration and exploita- 
tion of naturally occurring allelic variations in 
candidate genes controlling key traits for pre- 
breeding. This approach is expected to help 
identify new and useful haplotypes and guide 
development of allele-specific markers to be used 
in marker-assisted selection. Secondly, the NGS 
technology is also proving to be an important tool 
in de novo transcriptome assembly providing rapid, 
robust, inexpensive and informative assessment of 
transcriptome content, transcript abundance and 
diversity. A transcriptome can serve as a proxy for 
the genome and can be generated at significantly 
lower cost by whole transcriptome shotgun 
sequencing (RNA-seq method). High-throughput 
sequencing methods can also help identify known/ 
novel small RNAs involved in the regulation of 
gene expression. Thirdly, comparative genomics, 
using genomic data sets from phylogenetically 
related species, can help assemble and interpret 
genomes and transcriptomes of desired plant spe- 
cies. Finally, there is a need to develop strong 
international collaborations and industrial and 
academic interactions to ensure adequate public 
and private funding of projects. This has been 
demonstrated in other crops like tomato, apple, 
maize and rice where major achievements have 
been made with regard to crop improvement. 


References 


Arias De Ares RS, Borrone JW, Tondo CL, Kuhn DN, 
Schnell RJ (2012) Genomics of tropical fruit tree 
crops. In: Schnell RJ, Priyadarshan PM (eds) Genomics 
of tree crops. Springer, Heidelberg, pp 209-239 

Ashikari M, Sakakibara H, Lin S, Yamamoto T, Takashi T, 
Nishimura A, Matsuoka M (2005) Cytokinin oxidase 
regulates rice grain production. Science 309(5735): 
741-745 


K. Chaturvedi and L. Sahijram 


Azim MK, Khan JA, Zhang Y (2014) Characterization of 
mango (Mangifera indica L.) transcriptome and chlo- 
roplast genome. Plant Mol Biol 7:1—16 

Barabaschi D, Guerra D, Lacrima K, Laino P, Michelotti 
V, Urso S, Vale G, Cattivelli L (2012) Emerging 
knowledge from genome sequencing of crop species. 
Mol Biotechnol 50:250—266 

Bonaldo MF, Lennon G, Soares MB (1996) Normalization 
and subtraction: two approaches to facilitate gene dis- 
covery. Genome Res 6:791—806 

Chagné D, Crowhurst RN, Troggio M, Davey MW, 
Gilmore B (2012) Genome-wide SNP detection, vali- 
dation, and development of an 8K SNP array for apple. 
PLoS One 7(2):e31745 

Chanderbali AS, Albert VA, Leebens-Mack J, Altman NS, 
Soltis DE, Soltis PS (2009) Transcriptional signatures 
of ancient floral developmental genetics in avocado 
(Persea americana; Lauraceae). Proc Natl Acad Sci U 
S A 106(22):8929-8934 

Chen TW, Chang-Chai NG, Wang CY, Shyu YT (2007) 
Molecular identification and analysis of Psidium gua- 
java L. from indigenous tribes of Taiwan. J Food Drug 
Anal 15:82-88 

Chutimanitsakun Y, Nipper RW, Cuesta-Marcos A, Cistué 
L, Corey A, Filichkina T et al (2011) Construction and 
application for QTL analysis of a restriction site asso- 
ciated DNA (RAD) linkage map in barley. BMC 
Genomics 12(1):4 

Cobb JN, DeClerck G, Greenberg A, Clark R, 
McCouch S (2013) Next-generation phenotyping: 
requirements and strategies for enhancing our 
understanding of genotype—phenotype relationships 
and its relevance to crop improvement. Theor Appl 
Genet 6:1-21 

D’Hont A, Denoeud F, Aury JM, Baurens FC, Carreel F, 
Garsmeur O et al (2012) The banana (Musa acumi- 
nata) genome and the evolution of monocotyledonous 
plants. Nature 50(3):250—266 

Davey JW, Hohenlohe PA, Etter PD, Boone JQ, 
Catchen JM, Blaxter ML (2011) Genome-wide 
genetic marker discovery and genotyping using 
next-generation sequencing. Nat Rev Genet 12(7): 
499-510 

Davidson C, Lance E (2012) VitisGen: a coordinated effort 
in grape genotyping, phenotyping, and marker assisted 
breeding. Plant and Animal Genome XX Conference. 
San Diego, CA, USA 

Djami-Tchatchou AT, Straker CJ (2012) The isolation of 
high quality RNA from the fruit of avocado (Persea 
americana. Mill.). South Afr J Bot 78:44-46 

Dong Q, Kroiss L, Oakley FD, Wang BB, Brendel V 
(2005) Comparative EST analyses in plant systems. 
Methods Enzymol 395:400—408 

El Bakkali A, Haouane H, Moukhli A, Costes E, 
Van Damme P, Khadari B (2013) Construction of core 
collections suitable for association mapping to 
optimize use of Mediterranean olive (Olea europaea 
L.) genetic resources. PLoS ONE 8(5):e61265 

Ellegren H (2004) Microsatellites: simple sequences with 
complex evolution. Nat Rev Genet 5:435-445 


6 Plant Molecular Biology Applications in Horticulture: An Overview 


Evans K, Luby J, Brown S, Clark M, Guan Y, Orcheski B, 
Schmitz C, Peace C, van de Weg E, Iezzoni A (2012) 
Large-scale standardized phenotyping of apple in 
RosBREED. ISHS postharvest unlimited. Acta Hortic 
945:233-238 

Fabi JP, Seymour GB, Graham NS, Broadley MR, May 
ST, Lajolo FM, do Nascimento JRO (2012) Analysis 
of ripening-related gene expression in papaya using an 
Arabidopsis-based microarray. BMC Plant Biol 
12(1):242 

FAO (2003) _ http://www.fao.org/docrep/006/j0083e/ 
j0083e00.htm 

Feria-Romero IA, Astudillo-Dela HV, Chavez-Soto MA, 
Rivera-Arce E, Lopez M, Serrano H, Lozoya X (2009) 
RAPD markers associated with quercetin accumula- 
tion in Psidium guajava. Biol Plant 53:125-128 

Flint-Garcia SA, Thornsberry JM, Buckler ES IV (2003) 
Structure of linkage disequilibrium in plants. Annual 
Rev Plant Biol 54(1):357—374 

Galvez-Lépez D, Laurens F, Devaux MF, Lahaye M 
(2012) Texture analysis in an apple progeny through 
instrumental, sensory and histological phenotyping. 
Euphytica 185(2):171-183 

Gisbert AD, Romero C, Martinez-Calvo J, Leida C, Llacer 
G, Badenes ML (2009) Genetic diversity evaluation of 
a loquat (Eriobotrya japonica (Thunb) Lindl) germ- 
plasm collection by SSRs and S-allele fragments. 
Euphytica 168(1):121—134 

Gonzalez—Martinez SC, Wheeler NC, Ersoz E, Nelson 
CD, Neale DB (2007) Association genetics in Pinus 
taeda L. I. Wood property traits. Genetics 
175:399-409 

Gore MA, Chia JM, Elshire RJ, Sun Q, Ersoz ES, Hurwitz 
BL et al (2009) A first-generation haplotype map of 
maize. Science 326(5956):1115—1117 

Goulet C, Mageroy MH, Lam NB, Floystad A, Tieman 
DM, Klee HJ (2012) Role of an esterase in flavor vola- 
tile variation within the tomato clade. Proc Natl Acad 
Sci USA 109(46):19009-19014 

Hernandez-Delgado S, Padilla-Ramirez JS, Nava-Cedillo 
A, Mayek-Pérez N (2007) Morphological and genetic 
diversity of Mexican guava germplasm. Plant Genet 
Res: characterization and utilization 5(03):131—-141 

Hirsch CN, Buell CR (2013) Tapping the promise of 
genomics in species with complex, non-model 
genomes. Annu Rev Plant Biol 64(1):67—73 

Holton T (2010) Mango genomics. Abs PAG XVIII, 2010, 
San Diego 

Iquebal MA, Arora V, Verma N, Rai A, Kumar D (2013) 
First whole genome based microsatellite DNA marker 
database of tomato for mapping and variety identifica- 
tion. BMC Plant Biol 13(1):197 

Jongeneel CV (2000) Searching the expressed sequence 
tag (EST) databases: panning for genes. Brief 
Bioinform 1:76—92 

Kalia RK, Rai MK, Kalia S, Singh R, Dhawan AK (2011) 
Microsatellite markers: an overview of the recent 
progress in plants. Euphytica 177(3):309-334 

Kanupriya, Radhika V, Ravishankar KV (2013) Mining of 
mirnas in pomegranate (Punica granatum L.) by pyro- 


127 


sequencing of part of the genome. J Hortic Sci 
Biotechnol 88(6):735—742 

Kawaura K, Mochida K, Yamazaki Y, Ogihara Y (2006) 
Transcriptome analysis of salinity stress responses in 
common wheat using a 22k oligo-DNA microarray. 
Func Integr Genomics 6(2):132—142 

Khan MA, Korban SS (2012) Association mapping in for- 
est trees and fruit crops. J Exp Bot 63(11):4045—4060 

Kloosterman B, Abelenda JA, Gomez MDC, Oortwijn M, 
de Boer JM, Kowitwanich K, Bachem CW et al (2013) 
Naturally occurring allele diversity allows potato 
cultivation in northern latitudes. Nature 
495(7440):246—250 

Kover PX, Valdar W, Trakalo J, Scarcelli N, Ehrenreich 
IM, Purugganan MD, Mott RA et al (2009) Multiparent 
advanced generation inter-cross to fine-map quantita- 
tive traits in Arabidopsis thaliana. PLoS Genet 
5(7):e 1000551 

Kuhn DN (2014) Development of SNP markers from the 
mango transcriptome for mapping and estimation of 
genetic diversity. In: Plant and Animal Genome XXII 
Conference. Plant and Animal Genome 10-14 Jan, 
San Diego 

Kumar S, Banks TW, Cloutier S (2012) SNP discovery 
through next-generation sequencing and its applica- 
tions. Int J Plant Genomics 2012, 831460 

Li WY, Peng ZP, Yang SH, Yu JH, Huang JC, Wu XN, 
Yang LX (2012) Effects of plant growth-promoting 
rhizobacteria on growth and controlling fusarium-wilt 
disease of banana seedlings. Acta Hort Sinica 
39:234-242 

Li S, Zhao B, Yuan D (2013) Rice zinc finger protein DST 
enhances grain production through controlling 
GnlalOsCKX2 expression. Proc Natl Acad Sci USA 
110(8):3167—3172. doi: 10.1073/pnas. 1300359110 

Mahomed W, Van den Berg N (2011) EST sequencing and 
gene expression profiling of defence-related genes 
from Persea americana infected with Phytophthora 
cinnamomi. BMC Plant Biol 11(1):167 

Manoj KR, Pooja A, Jaiswal VS, Jaiswal U (2010) 
Biotechnological advances in guava (Psidium guajava 
L.): recent developments and prospects for further 
research. Trees 24:1—12 

Matsuba Y, Nguyen TT, Wiegert K, Falara V, Gonzales- 
Vigil E, Leong B, Pichersky E et al (2013) Evolution 
of a complex locus for terpene biosynthesis in 
Solanum. Plant Cell 25:2022—2036 

Matsumura H, Kentaro Y, Shujun L, Eiji K, Takahiro F, 
Zayed A et al (2010) High-throughput SuperSAGE for 
digital gene expression analysis of multiple samples 
using next generation sequencing. PLoS One 
5(8):e12010 

McGhie TK, Rowan DD (2012) Metabolomics for mea- 
suring phytochemicals, and assessing human and ani- 
mal responses to phytochemicals, in food science. Mol 
Nutr Food Res 56(1):147—158 

Ming R, Hou S, Feng Y, Yu Q, Dionne-Laporte A, Saw JH 
et al (2008) The draft genome of the transgenic tropi- 
cal fruit tree papaya (Carica papaya L.). Nature 
452:99 1-996 


128 


Myburg A, Grattapaglia D, Tuskan G, Jenkins J, Schmutz 
J, Mizrachi E et al (2011) The Eucalyptus grandis 
genome project: genome and transcriptome resources 
for comparative analysis of woody plant biology. 
BMC Proc 5(7):20 

Novelli MM, Takita MA, Machado MA _ (2004) 
Identification and analysis of single nucleotide poly- 
morphisms (SNPs) in citrus. Euphytica 138:227—237 

Oliveira EJ, Padua JG, Zucchi MI, Vencovsky R, Vieira 
MLC (2006) Origin, evolution and genome distribu- 
tion of microsatellites. Genet Mol Biol 29:294—307 

Ono NN, Britton MT, Fass JN, Nicolet CM, Lin D, Tian L 
(2011) Exploring the transcriptome landscape of 
pomegranate fruit peel for natural product biosyn- 
thetic gene and SSR marker discovery. J Integr Plant 
Biol 53(10):800-8 13 

Picafol R, Eduardo I, Aranzana MJ, Howad W, 
Batlle I, Iglesias I, Arts P et al (2013) Combining link- 
age and association mapping to search for markers 
linked to the flat fruit character in peach. Euphytica 
190(2):279-288 

Pin PA, Benlloch R, Bonnet D, Wremerth-Weich E, Kraft 
T, Gielen JJ, Nilsson O (2010) An antagonistic pair of 
FT homologs mediates the control of flowering time in 
sugar beet. Science 330(6009): 1397-1400 

Poltri SNM, Borralho N, Rodrigues JC, Elizaul J, de Ortiz 
MV, Cardozo C, Grattapaglia D et al (2011) Biotech 
MERCOSUR project: an integrated genotyping and 
phenotyping platform of Eucalyptus germplasm for 
mapping purposes. BMC Proc 5(7):33 

Powell AL, Nguyen CV, Hill T, Cheng KL, Figueroa- 
Balderas R, Aktas H, Bennett AB (2012) Uniform rip- 
ening encodes a Golden 2-like transcription factor 
regulating tomato fruit chloroplast development. 
Science 336(6089):1711-1715 

Prochnik S, Marri PR, Desany B, Rabinowicz PD, Kodira 
C, Mohiuddin M, Rounsley S (2012) The cassava 
genome: current progress, future directions. Trop 
Plant Biol 5(1):88—94 

Ravishankar KV, Kanupriya, Rekha A, Upadhyay A, 
Vasugi C, Vijayakumari N, Kishnani P, Dinesh MR 
(2013) Omics approaches in tropical fruit crops. In: 
Barh D (ed) OMICS applications in crop science. 
CRC Press, Boca Raton, pp 285-324 

Riedelsheimer C, Lisec J, Czedik-Eysenberg A, Sulpice 
R, Flis A, Grieder C, Melchinger AE et al (2012) 
Genome-wide association mapping of leaf metabolic 
profiles for dissecting complex traits in maize. Proc 
Natl Acad Sci U S A 109(23):8872-8877 

Rikkerink EHA, Oraguzie NC, Gardiner SE (2007) 
Prospects of association mapping in perennial horti- 
cultural crops. In: Oraguzie NC, Rikkerink EHA, 
Gardiner SE, Silva HN (eds) Association mapping in 
plants. Springer, New York, pp 249-269 

Risterucci AM, Duval MF, Rohde W, Billotte N (2005) 
Isolation and characterization of microsatellite loci 
from Psidium guajava L. Mol Ecol Notes 5:745-—748 

Ritter E (2012) Guava biotechnologies, genomic 
achievements and future needs. Acta Horticult 
959:131-140 


K. Chaturvedi and L. Sahijram 


Rowe HC, Renaut S, Guggisberg A (2011) RAD in the 
realm of next-generation sequencing technologies. 
Mol Ecol 20(17):3499—3502 

Rudd S (2003) Expressed sequence tags: alternative or 
complement to whole genome sequences? Trends 
Plant Sci 8:321—329 

Sargent DJ, Fernandéz-Fernandéz F, Ruiz-Roja JJ, 
Sutherland BG, Passey A, Whitehouse AB et al (2009) 
A genetic linkage map of the cultivated strawberry 
(Fragaria X ananassa) and its comparison to the diploid 
Fragaria reference map. Mol Breed 24(3):293—303 

Scaglione D, Acquadro A, Portis E, Tirone M, Knapp SJ, 
Lanteri S (2012) RAD tag sequencing as a source of 
SNP markers in Cynara cardunculus L. BMC 
Genomics 13(1):3 

Schnable PS, Ware D, Fulton RS, Stein JC, Wei EF 
Pasternak S, Liang C et al (2009) The B73 maize 
genome: complexity, diversity, and dynamics. Science 
326(5956):1112—-1115 

Selmer KK, Brandal K, Olstad OK, Birkenes B, Undlien 
DE, Egeland T (2009) Genome-wide linkage analysis 
with clustered SNP markers. J Biomol Screen 
14:92—96 

Sherman A (2014) Creating genomic infrastructure for 
mango breeding. Plant and Animal Genome XXII 
Conference 

Springer NM, Ying K, Fu Y, Ji T, Yeh CT, Jia Y et al (2009) 
Maize inbreds exhibit high levels of copy number 
variation (CNV) and _ presence/absence variation 
(PAV) in genome content. PLoS Genet 5(11):e1000734 

Stélting KN, Nipper R, Lindtke D, Caseys C, Waeber S, 
Castiglione S, Lexer C (2013) Genomic scan for single 
nucleotide polymorphisms reveals patterns of diver- 
gence and gene flow between ecologically divergent 
species. Mol Ecol 22(3):842-855 

Suresh BV, Roy R, Sahu K, Misra G, Chattopadhyay D 
(2014) Tomato genomic resources database: an inte- 
grated repository of useful tomato genomic informa- 
tion for basic and applied research. PLoS One 
9(1):e86387. doi:10.1371/journal.pone.0086387 

Swarupa V, Ravishankar KV, Rekha A_ (2013) 
Characterization of tolerance to Fusarium oxysporum 
f. sp. cubense infection in banana using suppression 
subtractive hybridization and gene expression analy- 
sis. Physiol Mol Plant Pathol 83:1—7 

Swarupa V, Ravishankar KV, Rekha A (2014) Plant 
defense response against Fusarium oxysporum and 
strategies to develop tolerant genotypes in banana. 
Planta 239:735-751 

Tanksley SD, Ganal MW, Prince JP et al (1992) High den- 
sity molecular linkage maps of the tomato and potato 
genomes. Genetics 132(4):1141-1160 

Van K, Rastogi K, Kim K-H, Lee SH (2013) Next- 
generation sequencing technology for crop improve- 
ment. J Breed Genet 45:84—99 

Wang Z, Zhang J, Jia C, Liu J, Li Y, Yin X, Jin Z et al 
(2012) De novo characterization of the banana root 
transcriptome and analysis of gene expression under 
Fusarium oxysporum f. sp. cubense tropical race 4 
infection. BMC Genomics 13(1):650 


6 Plant Molecular Biology Applications in Horticulture: An Overview 129 


Wikstr6m N, Savolainen V, Chase MW (2001) Evolution sequencing approaches to isolate simple sequence 
of the angiosperms: calibrating the family tree. Proc R repeat (SSR) loci in the plant sciences. Am J Bot 
Soc Lond 268:2211—2220 99(2):193—208 

Xu Q, Ling Chen L, Ruan X, Chen D, Zhu A, ChenC etal Zhang H, Sreenivasulu N, Weschke W, Stein N, 
(2013) The draft genome of sweet orange (Citrus Rudd S, Radchuk V, Graner A et al (2004) 
sinensis). Nat Genet 45(1):59-68 Large-scale analysis of the barley transcriptome 

Zalapa JE, Cuevas H, Zhu H, Steffan S, Senalik D, based on expressed sequence tags. Plant J 40(2): 


Zeldin E, Simon P (2012) Using next-generation 276-290 


Nicolas Carels 


Abstract 

We present a comprehensive compilation of genomic structure evolution 
that should help the reader who is not familiar with genomics to under- 
stand the mechanisms that are shaping its structures over time. We believe 
that this understanding is essential to work with genomics in the sense that 
it should help to formulate productive hypothesis for new original works. 
We believe that the mechanism by which the extant genomic structures 
arose 1s more important than the shape of these structures since evolution 
is continuously at work. In addition, taking genomics under the evolution 
perspective gives the possibility to release a unified picture of its unlimited 
natural complexity to the reader without fair to be incomplete. It 1s amaz- 
ing to realize how fast complex biological structures arose in the early 
time of Earth, and it has been our aim to try to give a constructive view of 
the life journey to the reader. 
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Start? Understanding 
the Origin of Modern 
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Life’s major transitions were classified as the (1) 
compartmentalization of replicating molecules in 
the first cells, (11) coalescence of replicating mol- 
ecules to form chromosomes, (111) use of DNA 
and proteins as the fundamental elements of 
genetic code and replication, (iv) consolidation 
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of symbiotic cells to generate the first eukaryotic 
cells containing chloroplasts and mitochondria, 
(v) sexual reproduction involving the production 
(by meiosis) and fusion of haploid gametes, (v1) 
evolution of multicellular organisms from unicel- 
lular ancestors, and (vil) establishment of social 
groups composed of discrete multicellular indi- 
viduals. All these transitions had consequences 
for the organization of genomes and led to the 
development of specific strategies among lin- 
eages. In every major transition, selection favor- 
ing increased levels of biological complexity has 
been opposed by genetic conflicts acting within 
and across levels of biological organization (see 
in Grosberg and Strathmann 2007). It is the man- 
agement of these conflicts across successive gen- 
erations that led cells to display the genome 
structure diversity observed in extant organisms. 
This diversity is so large and displays such intri- 
cated functions that it would not make sense to 
describe it without reference to its evolutionary 
context. Here, we aimed to review in a compre- 
hensive way the challenges that drove the emer- 
gence of genomic structures during their 
evolution from prebiotic chemistry to complex 
multicellular organisms, with particular focus on 
plants. 


7.1.1 Prebiotic Chemistry 


and the RNA World 


The hypothetical emergence of RNA might have 
occurred during the period referred to as prebiotic 
chemistry, approximately 0.5 billon years after 
the formation of the solar system, 1.e., ca. 4.5 bil- 
lion years ago (Gya), by which time Earth’s ocean 
had condensed (ca. 4.4 Gya). Many chemical and 
biochemical studies pertaining to the chemical 
origin of life, which were performed under simu- 
lated prebiotic conditions (Miller 1987), showed 
that cyclic autocatalytic systems mediated by 
small organic molecules might have undergone 
Darwinian evolution in agreement with the local 
conditions prevailing under prebiotic chemistry 
(Davies et al. 2009). This chemical activity is 
expected to have set the foundations for the emer- 
gence of a molecule that was able to encode infor- 
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mation and catalyze functions in a similar way to 
that used by modern ribonucleotide acid (RNA). 
This model offers a solution to the difficulties 
linked to the synthesis of RNA (Bernhardt 2012). 
Dissection of fundamental innovations in the 
molecular machinery of translation revealed that 
translation appeared after the discovery of a large 
number of metabolic functions but before the 
enzymes that are necessary for DNA synthesis. 
The catalysis of a glycosidic linkage between 
nitrogenous bases and ribose according to a puta- 
tive prebiotic chemistry has been recognized as 
problematic (Ricardo et al. 2004). If some prebi- 
otic routes for the synthesis of both pyrimidine 
and purine nucleotides could be found (Glaser 
et al. 2007; Powner et al. 2009), then the abiotic 
synthesis of RNA polymers via oligomer conden- 
sation might have been sufficient to trigger the 
emergence of a replicating system based on RNA 
(Cheng and Unrau 2012). It is this inference that 
is actually expected from the RNA world hypoth- 
esis (coined by Gilbert 1986). According to this 
hypothesis, “the most critical event in life emer- 
gence is the apparition of a self-replicating mole- 
cule with catalytic function that can both copy 
itself and evolve by mutation. It is more parsimo- 
nious to conceive of a single type of molecule rep- 
licating itself than to posit that two different 
molecules (a nucleic acid and a protein capable of 
replicating that nucleic acid) were synthesized by 
random chemical reactions in the same place at 
the same time”’ (Gilbert 1986). 


7.1.2 Peptides First? 


However, peptides might also have preceded 
RNA emergence, or, at least, RNA and proteins 
might have coevolved. This hypothesis is known 
as the proteins (or peptides) first hypothesis 
(Andras and Andras 2005). As shown by Oba 
et al. (2005), peptides with catalytic activity 
could accumulate and participate in the multipli- 
cation of primeval proteins by pseudo-replication 
in a process of repeated drying-heating cycles; 
thus, without the need of an RNA-based 
translation system. Primeval proteins might have 
possessed the catalytic activity to promote the 
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formation of cyanide and purines from amino 
acids (McGlynn et al. 2010). The route of nucleo- 
tide formation from HCN in aqueous solution by 
physico-chemical means is relatively simple 
(Glaser et al. 2007; Roy et al. 2007; Powner et al. 
2009), and it is thought to have contributed to the 
abiotic synthesis of RNA-like polymers via 
oligomer condensation to trigger the emergence 
of a replicating system based on RNA (Cheng 
and Unrau 2012). Interestingly, peptides are syn- 
thesized today by a complex and apparently uni- 
versal protein machinery termed non-ribosomal 
peptide synthetase (NRPS) (Strieker et al. 2010) 
that does not involve RNA molecules and NRPS 
proteins are more ancient than the ribosomal pro- 
teins (Bernhardt 2012). It is unlikely, however, 
that protein synthesis occurred before the advent 
of the ribosome (Bernhardt and Tate 2010). 
However, after coding emerged, the sequences of 
noncoded proteins might have needed to be reca- 
pitulated by coded proteins. Thus, the phyloge- 
netic signal can only go back to the point of 
recapitulation, which draws a veil over the his- 
tory of the protein before the recapitulation point. 
The situation is different for noncoding RNAs 
such as ribosomal RNA and tRNA because these 
molecules were able to replicate prior to the evo- 
lution of ribosomal protein synthesis (Bernhardt 
2012). With phylogenetic trees in hand, it is now 
possible to infer the sequences of ancestral pro- 
teins as they might have been 3 Gya and to study 
their enzymatic activity in the laboratory using 
recombinant DNA technology. Such reconstruc- 
tion makes sense because successful adaptations 
tend to persist on large time scales. Evolutionary 
descent follows a path involving finite steps, but 
each descendant is continuously linked to its 
antecedents by individual practical changes in a 
genetic text (Yarus 2011). 


7.1.3 The Primordial Replicator 


In the context of prebiotic chemistry, it is inter- 
esting to note that a mineral replicator might have 
preceded the rise of RNA (WAachtershauser 1988; 
Orgel 2000; Martin and Russell 2003). 
Montmorillonite clay might have participated to 
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vesicle formation, thereby creating a pathway for 
the prebiotic encapsulation of catalytically active 
surfaces within membrane vesicles 
(Wachtershauser 1988; Cavalier-Smith 2001). 
These vesicles can grow and divide upon agita- 
tion to give daughter protocells carrying newly 
replicated nucleic acids (Hanczyc and Szostak 
2004). Some form of encapsulation is likely to 
have been a key early step in life. Encapsulation 
can protect a genome from degradation and pre- 
dation and allows the cellular concentration of 
useful small molecules, thus enabling natural 
selection (Cech 2012). 

Even if one considers that RNAs were co- 
opted by proteins, it is likely that at least some of 
the cofactors now used by proteins came directly 
from the RNA world, in which they played a sim- 
ilar role in assisting catalytic functions. Actually, 
many naturally occurring RNA riboswitches are 
able to bind protein enzymes as_ cofactors 
(Bernhardt 2012). An appeal of the RNA world 
hypothesis is that it shows that the roles of geno- 
type/replicator and phenotype/catalyst might 
have been played by the same molecule. The pri- 
mordial replicator is expected to have given way 
to RNA ~4 Gya; therefore, the immediate ances- 
tor must have been chemically related to RNA by 
a plausible transformation involving a few atoms 
(Yarus 2011). The earliest replicating polymers 
should have satisfied three primary conditions: (1) 
initial polymers should have had an intrinsic 
mechanism to facilitate their replication either 
abiotically or biotically, (41) monomers should 
have been easily synthesized and polymerized 
abiotically, and (111) the resulting polymers should 
have been able to promote a wide range of chem1- 
cal reactions (Fig. 7.1). Systematic investigation 
of naturally occurring nucleic acid analogs led to 
the identification of threose nucleic acid (TNA) 
(Wachtershauser 1988; Eschenmoser 2004). 
TNA pairs with itself and with RNA and is much 
more stable in water than RNA. Therefore, TNA 
might potentially serve as a template in nonenzy- 
matic template-directed RNA oligomerization. 
Thus, a transition from a TNA world (based on a 
primitive replicating system, with relatively sim- 
ple chemistry involving a stable phosphodiester 
backbone) to an RNA world is theoretically 
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Fig. 7.1 Model of the putative primordial replicator. (a) 
Cross-replication of RNA ligase ribozymes: a ligase ribo- 
zyme (gray and black) catalyzes the ligation of white and 
crossed oligonucleotides (Rz’—1 and Rz’—2) to generate a 
ligase ribozyme that catalyzes the ligation of black and 
gray oligonucleotides (Rz—1 and Rz—2) to regenerate the 
first ligase ribozyme. (b) Ribozyme-catalyzed RNA 
polymerization. The complete extension of an RNA 
primer (black) according to the sequence of a template 


possible (Srivatsan 2004). Glycerol-derived 
nucleic acid analogs, peptide nucleic acids 
(PNAs), and other nonsugar compounds were 
also proposed as ancestral polymer backbones 
that could have acted as molecular supports for 
information (Benner et al. 2012). PNA forms 
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(crossed) by an RNA polymerase produces a RNA duplex. 
(c) The genome replication, 1.e., short double-stranded 
elements of RNA (left and right), is guided by mixed 
priming (center). The asymmetrical transcription of either 
strand of the duplex genomic element results in the syn- 
thesis of multiple genomic element copies together with 
an excess of one strand (Modified from Cheng and Unrau 
(2012)) 


Stable double helices with RNA or DNA, and 
information can be transferred from PNA to RNA 
in a template-directed fashion. Polymers of N-(2- 
aminoethyl)-glycine (AEG) have also been 
hypothesized as possible backbones of PNAs 
because they form in spark-tube experiments and 
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can spontaneously polymerize at 100 °C. In 
addition, AEG polymers are synthesized by 
Cyanobacteria (Banack et al. 2012). 
Cyanobacteria are photosynthetic cosmopolitan 
bacteria that are supposed to have strongly con- 
tributed to the oxygenation of the atmosphere and 
occupy extreme habitats such as geothermal 
pools or hypersaline waters that arose early dur- 
ing Earth’s history. The earliest evidence of 
Cyanobacteria, in Australian fossils, date from 
~3.5 Gya (Schopf 1993). However, the catalytic 
potential of PNA, TNA, and other possible RNA 
analogs has not yet been explored in detail 
(Srivatsan 2004). 


7.1.4 What Does RNA Do Today? 


In contemporary biology, RNA acts as (i) a 
primer in DNA replication; (11) a messenger that 
Carries genetic information to the translation 
machinery (mRNA); (iii) a small noncoding 
RNA (snRNA) in spliceosomal intron splicing, 
which requires the collaboration of ~200 proteins 
and five snRNAs; (iv) self-splicing introns 
(groups I and II); (v) a primer for the telomerase 
in telomere extension; (vi) metabolite signaling 
molecules (riboswitches bind small metabolites; 
they use the binding energy to switch from one 
RNA structure to another and might affect gene 
expression in bacteria and eukaryotes); (vii) 
defenses against pathogens (RNAI); (vili) an 
endonuclease RNA (RNase P) that specifically 
promotes tRNAs maturation; (1x) a transcription 
modulator by switching off gene expression and 
attracting proteins that modify chromatin struc- 
ture (the effect may affect one or few genes or 
can even spread over an entire chromosome as 
with Xist RNA); (x) a component (7SL RNA) of 
the signal recognition particle (SRP) responsible 
for the delivery of proteins to cellular mem- 
branes; (x1) a ribozyme (virtually all redox reac- 
tions in modern organisms can be catalyzed by 
RNA molecules by directed evolution, as with a 
NAD+-dependent redox-active ribozyme mim- 
icking alcohol dehydrogenase activity. 
Ribozymes are also involved in functions of 
information processing, such as editing, ligation, 
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polymerization, and_ replication); (xii) a 
modulator of gene expression (small double- 
stranded RNAs, e.g., 21-bp small interfering 
RNAs and microRNAs regulate the stability or 
translatability of mRNAs); (xii1) an active factor 
in the highly condensed state of heterochromatin 
at chromosome centromeres; (xiv) weapons 
(siRNA) to circumvent host defense by viruses or 
to manipulate host cellular machinery; (xv) a 
peptidyl transferase; (xvi) a mRNA start site; 
(xvii) codon—anticodon interaction agents; and 
(xviii) decoding centers in ribosomal operation. 


7.1.5  SELEX: A Synthetic RNA World 


It is generally considered that to be an acceptable 
model, the RNA world must support Darwinian 
evolution in the laboratory (Benner et al. 2012). 
One of the questions regarding the RNA world 
asks which mechanisms might have acted as a 
foundation for a primeval replication system suit- 
able for Darwinian evolution. Guide RNAs, hav- 
ing the reverse complement of some target 
sequence, could have been very useful early in 
evolution and might have directed RNA ligase 
ribozymes to specifically ligate short RNA frag- 
ments in a sequence-specific fashion. Precisely, 
RNA genetic material might have been formed of 
RNA exons held together by self-splicing RNA 
introns. Intron out-splicing would have tied the 
exons together to make functional ribozymes 
(Fig. 7.2). This view (1) provides a way of distin- 
guishing information storage (genome) from 
chemical reaction catalysis (ribozyme), (11) pro- 
vides a mechanism for enhanced illegitimate 
recombination by the RNA synthetase occasion- 
ally jumping from one molecule to another dur- 
ing replication and thus creating recombinant 
products (Darwinian evolution) (in addition, two 
introns surrounding an RNA exon can cut across 
the exon and construct a transposon that can then 
enter an intron in some other molecule — Gilbert 
and de Souza 1999), and (111) allows the reduction 
of combinatorial space to be screened by looking 
for complex RNA functions among sequences 
constructed from the association of short RNA 
exons (30-40 bases long). 
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Fig. 7.2 Model of putative 
ancestral RNA gene with 
intron—exon structure. 
Type I introns might splice 
out to produce a ribozyme 
or might splice together to 
carry an intervening exon 
to a new position in a novel 
gene, 1.e., transpose 
(Modified from Gilbert and 
de Souza (1999)) 


Transposition 


By analogy with proteins, a RNA enzyme 
(ribozyme) might need to be larger than 300 
nucleotides to carry out a complex function. 
Considering 300-base ribozymes, a selection 
process for a given catalytic activity would 
require more than 4°° molecules to be screened. 
One way to circumvent this problem is to hypoth- 
esize that the enzymatic activity is the result of 
the combinatorial effect of short RNA pieces fol- 
lowing an intron—exon structure (Gilbert and de 
Souza 1999). Currently, it is considered that if the 
best molecules in a library are present at a fre- 
quency of 10°°-10-" or less, a starting random 
library must be of the order of 10'*-10" mole- 
cules (Gold et al. 2012). In vitro RNA selection 
provides the opportunity to explore the repertoire 
of RNA-catalyzed reactions and has been suc- 
cessfully used to isolate many new ribozymes. 
These ribozymes include catalysts for glycosidic 
bond formation in nucleotide synthesis, RNA 
polymerization, the aminoacylation of transfer 
RNA, peptide bonds, the Diels—Alder reaction, 
hydroxyl phosphorylation, and the Michael reac- 
tion (see in Srivatsan 2004). Laboratory-induced 
ribozyme evolution using the systematic evolu- 
tion of ligands by exponential enrichment 
(SELEX) technique (Wright and Joyce 1997) led 
to the isolation of both pyrimidine nucleotide and 
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purine nucleotide synthase ribozymes (Lau et al. 
2004). Catalytic RNA activities that might have 
been needed in an RNA world, but have not been 
found in the RNA pool of contemporary organ- 
isms, are generally already present in large com- 
binatorial libraries of RNA sequences and can be 
discovered using SELEX (Cheng and Unrau 
2012). Ribozymes are comprised of sensing 
ligands and a catalytic platform. The sensing 
ligands are termed “aptamers” (derived from the 
Greek word aptus, “to fit”) and are the output of 
the SELEX process (Gold et al. 2012). 

Due to the establishment of the RNA world, 
improvements in RNA polymerization might 
have enabled the gradual concatenation of RNA 
genomic elements into longer fragments. This 
process only requires two replication activities: 
RNA ligation and RNA polymerization to tran- 
scribe and replicate short RNA genomic ele- 
ments. Because the phosphodiester chemistry 
mechanisms of RNA ligation and polymerization 
are related, their emergence from an abiotic sys- 
tem might be evolutionarily linked and thus pro- 
vide a simple route to biological replication 
(Cheng and Unrau 2012). Initial efforts during 
the 1990s led to the isolation and structural 
characterization of the highly efficient class 
I ligase ribozyme (Bartel and Szostak 1993) and, 
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subsequently, DSL ligase (Voytek and Joyce 
2007). A synthetic ribozyme polymerase (B6.61) 
that is able to extend a primer-template duplex by 
20 nucleotides (Zaher and Unrau 2007) was then 
developed in the laboratory. A ribozyme repli- 
case that is able to replicate RNA stretches of 95 
nucleotides has also been created in the labora- 
tory (Wochner et al. 2011). Replication of nucleic 
acid polymers always involves the synthesis of 
sense and antisense strands following canonical 
base-pairing rules. 

Fundamentally, replication is a copying pro- 
cess whose symmetry is broken during transcrip- 
tion where a single strand is over-copied in 
several fragments to allow gene expression. A 
significant advantage of dsRNA over ssRNA or 
folded RNAs is that the dsRNA duplex has a uni- 
form and predictable double helix that makes it 
easily recognizable as a genetic component 
(genome) by replicative enzymes, thus forming a 
basis for the evolution of a repairing system. 
Thus, a synthetic genomic system would involve 
the combination of transcriptional asymmetry 
and the replication and ligation of short RNA 
genomic elements that would produce a self- 
consistent system of RNA replication capable of 
simultaneous expression of metabolic ribozymes 
(Cheng and Unrau 2012). One of the aims of the 
investigations described above was to create cel- 
lular cooperative systems that are capable of arti- 
ficial RNA evolution (Bartel and Unrau 1999) for 
use in validating the RNA world hypothesis. If 
such a system was constructed in the laboratory, 
it would be much easier to understand the chal- 
lenges associated with the very earliest steps of 
the complex metabolic systems that now domi- 
nate the Earth (Cheng and Unrau 2010). For now, 
RNA molecular interactions can be inferred from 
in silico simulations (Tinoco et al. 2010) because 
the prediction of RNA secondary and tertiary 
(3D) structures has improved dramatically in 
recent years (Mathews et al. 2010). 


7.1.6 Protocells 


Ancestral to the ast universal common ancestor 
(LUCA), a primeval organism based on RNA 
would already contain proteinaceous material in 


137 


the form of oligopeptides to support RNA 
enzymes. It is now clear that the formation of 
peptide bonds in modern cells is catalyzed by 
RNA moieties of the ribosome (Moore and Steitz 
2002). Therefore, the emergence of the ribozyme 
ancestor of today’s ribosomes was a crucial tran- 
sitional step in the history of the RNA world, 
together with the establishment of an earliest ver- 
sion of the present genetic code (Forterre 2005). 
Peptide synthesis would have begun with a few 
amino acids and a single transfer/activating 
RNA. A_ simple hairpin duplication—ligation 
mechanism has been proposed for the ancestor of 
tRNA by Di Giulio (1992). The number of amino 
acids would have extended to 20 during the tran- 
sition from an RNA world to an RNA-protein 
world (Gilbert and de Souza 1999). Gradually, 
protein chains would have increased in complex- 
ity and replaced ribozymes in a number of bio- 
chemical processes. Even if amino acids and 
short peptides were already present in prebiotic 
times, large self-folding proteins could only have 
appeared after the emergence of a primeval ribo- 
somal system. 

RNA clearly preceded DNA because multiple 
enzymes are dedicated to the biosynthesis of the 
ribonucleotide precursors of RNA, whereas 
deoxyribonucleotide biosynthesis is a derivative 
of ribonucleotide synthesis and requires only two 
additional enzymatic activities (thymidylate syn- 
thase and ribonucleotide reductase). DNA took 
over the role of genome approximately | Gya. 
The LUCA already contained a DNA genome 
and carried out biocatalysis using both protein- 
based and ribonucleoproteic (RNP) enzymes 
(such as the ribosome) and ribozymes (Cech 
2012). Life radiation would have been impossi- 
ble without the cellular confinement of metabo- 
lism. Several membrane-related proteins were 
already present in the LUCA, and this can be 
considered a definitive argument that the LUCA 
was formed by cellular organisms. The RNA 
world should be seen as a biosphere containing 
cells with RNA genomes (RNA cells); this world 
ended when all of their cellular descendants were 
eliminated by Darwinian competition from newly 
emergent cells with DNA genomes (DNA cells). 
Selective pressures would have induced organ- 
isms to modify their genomes to protect them 
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against RNAses from hostile competitors. The 
higher stability of DNA (compared to RNA) 
would have allowed the genome size to increase 
during evolution, in turn allowing the evolution 
of new metabolic functions in DNA cells beyond 
those that existed in RNA cells (Forterre 2005). 


7.2 The Genetic Code and Genes 


7.2.1 Origin of Codons 

Proteins are chains of amino acids that constitute 
most parts of the cell machinery. Together, 20 
amino acid types are present in proteins, and a 
typical protein comprises a string of several hun- 
dred amino acids. Each amino acid is encoded by 
a three-base nucleotide sequence termed a codon. 
With four different letters in the DNA alphabet, 
4x4x4=64 different codons are possible; 61 of 
these specify amino acids, and three represent 
stop. The question of how this genetic code origi- 
nated in a primordial world remains open. 

A curious observation made during the appli- 
cation of SELEX is that aptamers selected to bind 
arginine (Arg) in different laboratories using dif- 
ferent protocols are predominantly composed of 
Arg codons (Knight and Landweber 1998). This 
finding, together with the finding that experi- 
ments simulating the early Earth’s atmosphere 
(Miller 1987) yielded as many as ten different 
natural amino acids, lays the foundation for a 
theory that the genetic code evolved under pri- 
mordial conditions (Wachtershauser 1988; Orgel 
2000) still dominant in modern proteins (Carels 
and Ponce de Leon 2015). 

Alanine (Ala) and glycine (Gly), the most 
abundant amino acids resulting from Miller’s 
experiments, are encoded today by the comple- 
mentary triplets GCC and GGC, respectively; 
this finding suggested the hypothesis that com- 
plementarity and thermostability were important 
for the evolution of the triplet code (Eigen and 
Schuster 1977, 1978a, b). Presumably, the earli- 
est minigenes carried messages encoding mini- 
proteins based on two independent alphabets: a 
Gly alphabet encoding Gly strands and an Ala 
alphabet encoding Ala strands. At a later stage, 
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minigenes fused together in longer sequences, 
and the resulting sequences had a mosaic struc- 
ture comprising short patches of the two alpha- 
bets (Trifonov 2004). AGE aptamers recognizing 
the amino acids contained in the prebiotic soup, 
as defined by Miller’s experiment, and a primeval 
code of 3 AGE letters might have been the start- 
ing point of a pre-RNA world genetic system. As 
seen above, the enzymatic activity of AGE or any 
other type of polymer easily catalyzed under pri- 
meval Earth conditions remains to be demon- 
strated. If such polymers do, like ribozymes, have 
enzymatic activity, they might have been ances- 
tral candidates for a pre-RNA world genetic code. 
Riboswitch-like molecules would progressively 
have captured more and more amino acids from 
the prebiotic soup and evolved together with their 
substrate: the primeval genetic code. Ribozyme- 
like molecules could have carried out the peptide 
synthesis of these primeval proteins. A GNC 
code (G for guanine, N for any of the four nucle- 
otides, and C for cytosine) is sufficient to encode 
the four amino acids Gly, Ala, aspartate (Asp), 
and valine (Val) able to form globular, water-sol- 
uble proteins with o-helix, B-sheet, and 6-turn 
secondary structures (Ikehara 2009) that are 
capable of catalytic activities (Oba et al. 2005). 
According to I[kehara (2009), this primitive code 
would have evolved first to a code linking 16 
codons and 10 amino acids, the so-called SNS (S 
for strong: G or C), and then the RNY (R for 
purines, Y for pyrimidines) ancestral codons 
(Carels and Frias 2013). 

The chronology of amino acid—codon pair 
association would have followed the order: Gly, 
Ala, Asp, Val, proline (Pro), serine (Ser), gluta- 
mate (Glu) [leucine (Leu), threonine (Thr)], Arg 
[isoleucine (Ile), glutamine (Gln), asparagine 
(Asn)], histidine (His), lysine (Lys), cysteine 
(Cys), phenylalanine (Phe), tyrosine (Tyr), 
methionine (Met), and tryptophan (Trp) (“[’? and 
‘*/’ identifies the amino acids that appeared at the 
same time). The genetic code would have evolved 
toward higher complexity by successive amino 
acid capture according to successive transition 
steps in Gly-strand and Ala-strand modes. A 
purine transition at the 2nd position of the Gly 
codon GGC would have successively led to GAC 
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and GAG to capture Asp and Glu, respectively. 
Arg would have been captured by the CGC 
codon, which is complementary to the Ala codon 
GCG on the Ala strand. Up to this point, the Gly 
strand would have encoded Gly, Asp, Glu, and 
Arg, whereas the Ala strand would have encoded 
Ala and Val. The most frequent transitions 
(purine to purine and pyrimidine to pyrimidine) 
would have left the N-purine-N and 
N-pyrimidine-N triplet structures unchanged. All 
later stages in codon evolution would have 
required changes at the third position and, conse- 
quently, in the first positions of the codons on the 
complementary strand. The two alphabets Gly, 
Asp, Glu, Arg, Ser, Gln, Asn, His, Lys, Cys, Tyr, 
Trp (the Gly alphabet) and Ala, Val, Pro, Ser, 
Leu, Thr, Ile, Phe, Met (the Ala alphabet) would 
have successively appeared in descending order 
of thermostability (Trifonov 2004). Tryptophan 
and cysteine (as well as selenocysteine and pyr- 
rolysine) codons were likely borrowed from a 
putative UGN repertoire of terminators 
(Trifonov 2004). 


Fig. 7.3 Physicochemical 
properties of proteins that 
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Modern proteins follow an amino acid bias 
that is led by physicochemical constraints 
(hydropathy and secondary structures) on pro- 
teins, highly correlated with a linear regression 
with thymidine in the 2nd codon position (T2). 
The choice of secondary protein structures for a 
given catalytic activity depends on the energy 
available in the biological system because the 
energy cost of secondary protein structures fol- 
lows the series: strand > helix > turn. In contrast, 
the amino acid choice is optimized in such a way 
that the minimal energy cost (the number of 
chemical bonds in the amino acid residue) of 
amino acid synthesis is associated with a larger 
purine content in the Ist position (R1) than in 
codon positions 2 or 3 (Ponce de Leon et al. 
2014, Fig. 7.3). Thus, one can conclude that the 
RNY (or Rrr, Carels et al. 2009) ancestral codon 
reflects selective pressures on protein functional- 
ity, energy costs and thermodynamic constraints 
on the translation machinery (Ponce de Leon 
et al. 2014). Protein-coding DNA sequences can 
be distinguished from nonprotein-coding DNA 
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based on the purine bias (Rrr) across codons 
(Carels and Frias 2009, 2013). Because the codon 
structure of CDSs is a universal pattern across the 
biosphere, CDSs are the easiest DNA motifs to 
classify; approximately 95 % of folding proteins 
are larger than 100 amino acids (300 bp), corre- 
sponding to the minimal size required to detect a 
coding frame among the six possible frames with 
a Statistical consistency of approximately 95 % 
(Carels and Frias 2009, 2013). 


7.2.2 A Definition of Genes 


The current understanding of a gene is that it rep- 
resents a basic unit of genetic information that 
controls a specific aspect of a phenotype. 
According to this definition, which is operational 
in the field of selective breeding, one must con- 
sider the 5’ and 3’ noncoding regions as part of a 
gene at the molecular level because these regions 
are involved in the regulation of CDS transcrip- 
tion and translation (see Gerstein et al. 2007). 
Given the complexity of the gene regulation and 
expression processes, a full explanation of the 
molecular consequences of the gene concept can 
be eventually confusing; Fig. 7.4 that depicts how 
genes are translated into phenotype effects. 

In many cases, the multiple alignments of pro- 
tein sequences sharing a given function across the 
three superkingdoms (Bacteria, Archaea, and 
Eukarya) show a patchwork of similar and dissimi- 
lar stretches suggesting that similar stretches are 
homologous, but dissimilar stretches are not. This 
observation suggests that modern genes considered 
homologous today might have been assembled 
from shorter primeval polyphyletic elements 
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during the formation of the LUCA by a process of 
mini-gene_ trans-splicing (Di Giulio 2008). 
According to this hypothesis, the translation of 
mRNAs from mini-gene patchworks resulted in 
proteins whose genes (DNA) would have evolved 
much later, 1.e., after the divergence of the three 
superkingdoms (Di Giulio 2006). In this way, one 
can explain why multiple alignments of the same 
protein from organisms belonging to the three 
domains of life can be obtained and why they 
apparently define a monophyletic origin of genes. 
The hypothesis of a polyphyletic origin of genes 1s 
consistent with Gilbert’s proposition (1986), pre- 
sented above, that genes evolved as a former patch- 
work of exon/introns; thus, the appearance of large 
coding sequences (CDS) was only enabled by 
intron excision, a phenomenon still largely 
exploited by eukaryotes. It would not be surprising 
that trans-splicing (still in operation in protozoa) 
would have been a major player in the RNA world 
and that CDSs from exon shuffling might have 
been easily exchanged among protocells (Hanczyc 
and Szostak 2004) as plasmids are still exchanged 
among bacteria through conjugation. 


7.2.3 Introns and Spliceosome 


Machinery 


Introns are noncoding intervening sequences that 
are not part of the coded protein message and 
must be removed from RNA transcripts to pro- 
duce functional mRNAs. Active mRNAs are 
comprised of exons concatenated into CDSs by 
splicing out introns from pre-mRNAs. 

Group I and II introns are found in some bac- 
terial and organelle genomes, and group I introns 
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are also found in the ribosomal RNAs (rRNAs) of 
protist and fungal nuclei. These two groups have 
distinct RNA structures that facilitate their self- 
splicing activity. These introns may also contain 
internal ORFs, facilitating both their splicing 
from RNA transcripts and propagation to intron- 
less sites by reverse transcription. In_ total, 
approximately 1,500 group I and 200 group II 
introns have been identified. The original inva- 
sion of self-splicing Group II introns, presumably 
originating from the mitochondrial endosymbi- 
ont, into the genome of the emerging eukaryote 
might have led to alternative splicing, a major 
contribution to the biological complexity of mul- 
ticellular eukaryotes (Rogozin et al. 2012). 
Spliceosomal introns are present in the nuclear 
genomes of all characterized eukaryotes but 
absent in prokaryotes. Their numbers and lengths 
vary widely between species, from tens of bases 
in some protists to hundreds of kilobases in mam- 
mals. The spliceosome is a complex (Ritchie 
et al. 2009) that comprises five small nuclear 
RNAs (snRNAs) and hundreds of proteins. Each 
snRNA is associated to proteins to form a small 
nuclear ribonucleic particule (snRNPs). 
Surprisingly, two  spliceosomal complexes 
emerged before the origin of multicellularity. The 
major spliceosome, a complex RNA-—protein 
machine containing the U1, U2, U4/U6, and U5 
snRNPs, splices the classic set of introns that 
usually start with GT and end with AG (Fig. 7.5). 
By contrast, the minor spliceosome involves four 
specific snRNPs and the U5 snRNP, which is 
shared by both spliceosomes. The minor spliceo- 
some processes a different set of introns, many of 
which start with AT and end with AC, that is, 
introns that are ~300 times less frequent than 
those processed by the major spliceosome (Steitz 
et al. 2008). 

Prokaryotes experienced a massive intron loss 
since the LUCA; it has been proposed that this 
loss is one of the many outcomes of a determinis- 
tic trend toward the reduction of the number of 
rate-limiting factors in gene expression that is 
determined by a strong selection pressure for 
rapid reproduction and/or survival at higher tem- 
peratures (Rodriguez-Trelles et al. 2006). In 
eukaryotes, 20-68 % of introns are specific to 
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each species; 25 % of human introns are found at 
the same positions as those in orthologous genes 
from Arabidopsis. It 1s also clear from the homol- 
ogous comparison of genes in maize and 
Arabidopsis that GC-poor and GC-rich genes are 
under different selective constraints (Carels and 
Bernardi 2000a). The bias of GC-poor genes 
toward high intron number has been attributed to 
the fact that the regulatory functions of intron 
sequences depend on specific AT constraints for 
correct splicing efficiency (Carels 2005a). Such 
constraints would justify why position, but not 
size (correlated to genome size), 1s conserved 
between distant species. The classical view 1s that 
intron sequences are subject to weak purifying 
selection at best or evolve in a regime that is 
indistinguishable from neutral evolution. 
However, as noted by Rogozin et al. (2012), “this 
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Fig. 7.5 Splicing of pre-mRNA by the major spliceo- 
some. Conserved motifs are found at the 5’ and 3’ intron 
extremities as well as at the branching region (A). 
Stepwise assembly of Ul, U2, U4/U6-U5 snRNPs with 
pre-mRNA introns at conserved regions forms the spli- 
ceosome, which catalyzes sequential transesterification 
reactions leading to the intron splicing out as a Lariat 
product and the assembly of contiguous exons in a coding 
sequence 
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(nearly) neutral background of intron evolution 
does not rule out the possibilities that, first, the 
very presence of introns affects the regulation of 
expression of the respective genes (presumably 
through the interaction with the splicing machin- 
ery) and hence their function, and second, that 
many introns harbor specific functional elements. 
Indeed, there is abundant evidence that introns 
are often functional at both levels.’ These authors 
reviewed the functions associated with (1) splic- 
ing (splicing and mRNA export are directly cou- 
pled), (41) alternative splicing and exon shuffling, 
and (111) genes nested within introns (for instance, 
miRNA or IncRNA genes — Yang et al. 2012; 
Geisler et al. 2012). 


7.2.4 Regulatory Sequences 


Eukaryotes employ diverse mechanisms to regu- 
late gene expression, including chromatin con- 
densation, DNA _ methylation, transcriptional 
initiation, alternative RNA splicing, difference in 
mRNA stability, translational controls, several 
forms of post-translational modification, intracel- 
lular trafficking, and protein degradation. Genes 
encoding regulatory proteins possess some of the 
most complex expression profiles. In metazoans 
and metaphytes, most such genes are expressed 
in several distinct domains (see in Wray et al. 
2003). Although the transcription profiles of 
housekeeping genes are generally much simpler, 
most genes are transcribed at different levels 
among different cell types. 


7.2.4.1 Transcription Factors and Their 
DNA Targets 

The function of a promoter is to integrate infor- 
mation about the status of the cell and to tune the 
rate of transcription of a single gene or operon 
accordingly. The core promoter is a ~100 bp 
region whose function is to provide a docking 
site for the transcription complex and to position 
the start of transcription relative to CDSs (Lee 
and Young 2000). The organization of promoters 
is much less regular than that of CDSs, and no 
consistent sequence motifs exist for the promot- 
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ers of protein-coding genes in eukaryotes. 
However, two functional features are always 
present: a core promoter and a collection of 
diverse transcription factor binding sites that tune 
specificity and the level of transcription. 

Eukaryotic genes that encode proteins are 
transcribed by the RNA polymerase II holoen- 
zyme complex, which is composed of 10—12 pro- 
teins (Lee and Young 2000). Core promoter 
sequences differ among genes. For many genes, 
the critical binding site is a TATA box, usually 
located approximately 25-30 bp 5’ of the tran- 
scription start site. However, most genes lack a 
TATA box, although some of these may contain 
an initiator element spanning the transcription 
start site or neither a TATA box nor an initiator 
element. Some core promoters may also contain 
additional protein binding sites for general tran- 
scription factors (Fig. 7.6). A gene may have 
more than one basal promoter, each of which in1- 
tiates transcription at a distinct position, and both 
TATA and TATA-less basal promoters can be 
associated with alternate start sites of the same 
gene (Goodyer et al. 2001). 

The complement of active transcription fac- 
tors within the nucleus differs in response to 
environmental conditions during development 
across regions of the organism and among cell 
types. This changing array of transcription fac- 
tors provides nearly all of the control over when, 
where, at what level, and under what circum- 
stances a particular gene is transcribed (Wray 
et al. 2003). 

Most transcription factor binding sites span 
3-8 bp. Given that there are many different tran- 
scription factors and that each may bind many 
different short sites, every kilobase of genomic 
DNA contains many potential transcription factor 
binding sites on the basis of similarity (Stone and 
Wray 2001). However, many of these consensus 
matches do not bind protein in vivo and have no 
influence on transcription. Due to DNA looping 
between proteins that are associated with DNA 
sequences at distant binding sites, the physical 
extent of cis-regulatory regions varies by a few 
hundred base pairs to ~100 kb. The position of 
transcription factor binding sites relative to the 
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Fig. 7.6 Promoter structure and function in eukaryotic 
genes. (a) Relative position of the transcription unit, basal 
promoter region (black box), and transcription factor 
binding sites (bars). (b) Model of a promoter in operation. 
The initiation of transcription requires several dozens of 


transcription unit also differs enormously among 
genes. Transcription factor binding sites often lie 
within a few kb 5’ of the core promoter. 

All proteins that regulate transcription directly 
or indirectly influence the frequency with which 
the polymerase I] complex assembles onto the 
basal promoter. This influence occurs through a 
wide variety of protein-protein interactions. 
Some transcription factors maintain local chro- 
matin in a decondensed state whereas others con- 
dense it (Chen and Courey 2000). Chromatin 
remodeling is highly dynamic and is apparently 
regulated on spatial scales as small as promoters 
or even regions within a promoter. Condensed 
chromatin impairs the docking of the RNA poly- 
merase II complex and blocks transcription in the 
absence of specific transcription factors. Because 
transcription is inactivated by default, all promot- 
ers contain binding sites for transcription activa- 
tors, but only some contain binding sites for 
negative regulators (see in Wray et al. 2003). 

Clusters of approximately 6—15 binding sites 
for 4-8 different transcription factors may oper- 
ate as functionally coherent segments termed 
enhancers (Arnone and Davidson 1997). The 
terms enhancer, booster, activator, insulator, 
repressor, locus control region, upstream activat- 
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interacting proteins, including the RNA polymerase II 
holoenzyme complex, TATA-binding protein, general 
transcription factor proteins (TAF), transcription factors, 
transcription cofactors, and chromatin remodeling com- 
plexes (Modified from Wray et al. (2003)) 


ing sequence, and upstream repressing sequence 
all refer to various types of regulatory sequences 
with specific features. Activators and repressors 
can function as Boolean (off/on) or scalar (quan- 
titative) elements whose interactions have pre- 
dictable, additive effects on transcription (Tuch 
et al. 2008). Deleting one of these regulatory 
sequences often eliminates a specific aspect of 
the expression profile without disrupting the 
remainder. In addition, predictable artificial 
expression profiles can be constructed by com- 
bining modules from different promoters. 
Promoters integrate multiple diverse inputs and 
produce a single scalar output, 1.e., the rate of 
transcriptional initiation. In many promoters, sig- 
nal integration occurs at the basal promoter 
through specific interactions between bound 
transcription factors and components of the RNA 
polymerase II enzyme complex (Lee and Young 
2000). 

Most transcription factors directly regulate a 
few percent of the genes. Genetic networks are 
therefore highly connected, and each node that is 
represented by a transcription factor is linked to 
many other nodes. This high degree of connectiv- 
ity may be responsible in large part for the classi- 
cal genetic phenomena of epistasis, polygeny, 
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and pleiotropy. The expression profile of a gene 
is a System property; thus, even if a mutation in a 
promoter region alters transcription, the network 
of functionally interacting genes and gene prod- 
ucts may modulate this effect. Feedback loops 
are rather common components of gene networks 
(Lee et al. 2002) and may mask some function- 
ally significant mutations in promoters. The num- 
ber of functional noncoding nucleotides is 
approximately equal to the number of protein- 
coding nucleotides, and approximately half of all 
functional or phenotypical molecular evolution 
involves non-CDSs. Thus, a substantial fraction 
of a eukaryotic genome is devoted to extracting 
information from itself. Because transcriptional 
regulation conditions the way in which genotype 
is converted into phenotype, functional genetic 
variations in promoter sequences within popula- 
tions are sorted by selection. Many mutants that 
have emerged from genetic screens involve quan- 
titative effects on transcriptional regulation and 
are constituents of quantitative trait loci (QTL). 

The gradual modification of transcription cir- 
cuits over evolutionary timescales is an important 
source of the diversity of life. Natural selection 
may operate on CDS expression and phenotype 
morphology by promoting sequence variations at 
promoter sites through (1) negative (purifying) 
selection; (11) positive selection, where some pro- 
moter alleles appear to be under directional selec- 
tion; and (111) overdominant selection, where 
similar clusters of transcription factor binding 
sites are sometimes present in the promoters of 
orthologous genes of species that diverged up to 
10’-10® years ago (Tiimpel et al. 2002). In con- 
trast, promoter sequences can also diverge exten- 
sively among even relatively closely related 
species by including gains or losses of multiple 
binding sites and changes in the position of regu- 
latory sequences relative to the transcription start 
site. Regulatory sequences inserted into promot- 
ers through transposition may exert an influence 
on transcription. Through exonization, retroposi- 
tion can create novel genes that are subsequently 
expressed (Wang et al. 2002). Genes that arise 
through retroposition are often expressed in 
tissue-specific patterns similar to those of a par- 
ent locus (Bétran et al. 2002). 
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7.2.4.2 Noncoding RNAs 

Noncoding RNA (ncRNAs) genes produce tran- 
scripts that function as RNA, such as ribosomal 
RNA (tRNA) and transfer RNA (tRNA), rather 
than encoding proteins. On the basis of their 
length and genomic locations, ncRNAs can be 
further classified as (1) small ncRNAs including 
miRNAs and small interfering RNAs (siRNAs), 
(11) natural antisense transcripts (NATs), (111) long 
intronic noncoding RNAs (incRNAs), and (iv) 
long intergenic noncoding RNAs (lincRNAs). 
RNAs in the last three categories are at least 200 
nucleotides (nt) or longer, and they are referred to 
as long noncoding RNAs (IncRNAs). Several 
IncRNAs have biological function as regulators 
of gene expression both transcriptionally and 
posttranscriptionally. Similar to most mRNAs 
transcribed by RNA polymerase II, IncRNAs are 
both capped and polyadenylated. It has been pro- 
posed that IncRNAs are used as a means to tightly 
maintain repression at inducible genes and that 
efficient clearance of the IncRNA by decapping 
through DCP2 (decapping proteins) is vital for 
robust gene activation (Geisler et al. 2012). In 
plants, thousands of IncRNAs were identified in 
Arabidopsis. A large number of these IncRNAs 
are responsive to abiotic stresses. Genomic loci 
of many IncRNAs are associated with histone 
modifications and DNA methylations suggesting 
an epigenetic regulation of these loci. In addition, 
some sense and antisense double-stranded RNAs 
involving IncRNA partners are processed by the 
RNA interference machinery into siRNAs (Jin 
et al. 2013). 

The category of small ncRNAs (smRNA) 
encompass ~20—30 nt RNA molecules that have 
emerged as critical regulators in the expression 
and function of eukaryotic genomes including 
chromatin structure, chromosome segregation, 
transcription, RNA processing, RNA stability, 
and translation. Short interfering RNAs (siR- 
NAs) and microRNAs (miRNAs) act to regulate 
endogenous genes and to defend the genome 
from invasive nucleic acids (virus, mobile ele- 
ments). The effects of small RNAs on gene 
expression and control are generally inhibitory, 
and the corresponding regulatory mechanisms 
are referred to as RNA silencing. smRNAs serve 
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as specificity factors that direct bound effector 
proteins to target nucleic acid molecules via 
base-pairing interactions. The core component 
of the effector machinery is a member of the 
ARGONAUTE protein superfamily that mainly 
functions in eukaryotes. Single-cell organisms 
such as the green algae Chlamydomonas rein- 
hardtii also produce miRNAs (Molnar et al. 
2007). The common features of miRNAs and 
siRNAs in algae and protists indicate that com- 
plex RNA-silencing systems evolved before 
multicellularity and were a feature of the LUCA 
(Makarova et al. 2009). From the three main cat- 
egories of smRNA — siRNAs, miRNAs, and 
piwi-interacting RNAs (piRNAs) — only siRNAs 
and miRNAs are present in plants. These 
smRNAs are only known to be present in eukary- 
otes, although the ARGONAUTE proteins (or 
their CRISPR protein homologues) can also be 
found scattered in prokaryote species. Thus, the 
boundaries between the various small RNA 
classes are becoming increasingly difficult to 
discern. siRNAs and miRNAs are the most 
broadly distributed in both phylogenetic and 
physiological terms and are characterized by the 
double-stranded nature of their RNA precursors 
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Fig. 7.7 Biogenesis of miRNAs and assembly into 
miRISC in plants. Nuclear transcription leads to capped 
and polyadenylated pri-miRNAs. Dcll processes the 
RNA in succession. The terminal loop may first be excised 
or it might be the flanking segments that are cleaved first. 
The second processing step by Dcll yields a mature 
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(dsRNA). They trigger the silencing of genes 
with homologous sequences through RNAi. 

A transcript may encode clusters of distinct 
miRNAs, or it may encode a miRNA and a protein. 
The latter type of transcript is organized in such a 
way that the miRNA sequence is located within an 
intron (see in Carthew and Sontheimer 2009). 
Given the small size of intronic sequences in plants, 
intronic miRNAs in rice and Arabidopsis are coex- 
pressed with their host genes. Exonic miRNAs are 
mostly found in retrotransposon exonic sequences 
and participate to their silencing (Yang et al. 2012). 
A transcript is then processed into 20- to 25-nt siR- 
NAs or microRNAs (miRNAs) by RNase III-like 
enzymes called Dicers. These siRNAs bind to 
effector protein complexes, termed RNA-induced 
silencing complexes (RISC), to mediate degrada- 
tion of cognate mRNA, translational repression, or 
transcriptional silencing (Fig. 7.7). RISCs contain 
several proteins, including a member of the AGO 
clade of ARGONAUTE proteins, which has a 
smRNA binding domain (PAZ) that provides slic- 
ing activity to RISCs and programs it to guide 
sequence-specific inactivation of complementary 
RNA or DNA (see the pathway in more details in 
Voinnet 2009). 
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miRNA/miRNA* duplex that becomes methylated and 
exported from the nucleus to the cytoplasm and assem- 
bled into miRISC. Only one strand of the duplex is stably 
associated with a miRISC complex, and the miRISC* is 
degradated (Modified from Carthew and Sontheimer 
(2009)) 
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7.2.5 Ancestral Metabolism 


Protein networks are commonly used in biology 
to represent molecular mechanisms occurring in 
the cell, from gene regulation (regulatory net- 
works) to enzymatic reactions (metabolic net- 
works) and _ protein-protein interactions 
(signaling networks) (Mazurie et al. 2010). 

The description of regulatory networks 
remains elementary because the annotation of 
regulatory elements in genomes has lagged 
behind the analysis of coding regions. Coding 
regions can be readily assessed by comparing 
genome sequences with cDNA sequences; in 
contrast, regulatory regions are still identified 
primarily on a gene-by-gene basis. Even with the 
use of powerful tools, such as whole-genome 
ChIP-seq and the Encode project, the functional 
significance of binding sites found by large-scale 
screening can only be definitively tested by 
measuring the effect of the loss of binding due 
to mutation on gene expression (Gronostajski 
et al. 2011). 

Currently available metabolic pathway data- 
bases contain hundreds of separate metabolic 
maps describing different parts of general metab- 
olism (Kanehisa and Goto 2000; Maltsev et al. 
2006). Metabolism started in the prebiotic era 
and was optimized during the biotic era to exploit 
a wide variety of energy sources. The metabolism 
processed by extant enzyme networks results 
from the evolutionary optimization of an enor- 
mous variety of modules of similar chemistry. 
Major transitions during evolution occur only 
when energy or some other growth-determining 
resource is abundant because the extra costs 
involved in developing something new are low, 
and pioneering variants have a chance to develop 
their fitness-improving functions under low 
selective pressure (Schuster 1996). Pathways 
processing similar enzymatic functions are likely 
to have similar topologies. This observation has 
motivated the construction of a phylogenetic tree 
based on metabolic pathways to represent the 
evolutionary changes in both genetic content and 
metabolism (Mithani et al. 2009, 2010). The 
organization of metabolism is affected by the 
species’ lifestyle and phenotype. Comparison of 
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metabolic networks across species 1s a key meth- 
odology in understanding how evolutionary pres- 
sures shape these networks. Metabolic networks 
differ among the Archaea, Bacteria, and Eukarya 
due to natural selection (Wagner 2009). Species 
that evolved to accommodate lifestyles that are 
more complex developed metabolic networks 
that are not only bigger but also more structured. 
Communication between pathways is typically 
increased by the addition of hubs (highly con- 
nected pathways that convert metabolites to 
obtain a broader range of input and output path- 
ways) and switchboards (centrally connected 
pathways that capture a large fraction of all 
metabolite traffic). However, specialization 
niches and extreme environments tend to decrease 
the modularity of the metabolic networks of taxa 
(Mazurie et al. 2010). An example of pathway 
that has been acquired during angiosperm evolu- 
tion is that of terpenoids. Terpenoids constitute 
the largest class of plant secondary metabolites 
and play important roles in plant ecological inter- 
actions. Sesquiterpene synthases are involved in 
the production of floral scents used to attract pol- 
linators and are present in dicots, monocots, and 
Magnoliaceae but absent in non-seed plants, 
gymnosperms, and Amborella (Amborella 
Genome Project 2013). 

Signaling pathways are the primary means of 
regulating cell growth, metabolism, differentia- 
tion, and apoptosis of cells to manage their activi- 
ties. The sensing and processing of extracellular 
stimuli are mediated by signal transduction cas- 
cades: molecular circuits that seek to detect, 
amplify, and integrate information to generate 
responses such as changes in enzyme activity, the 
activation/deactivation of transcription factors, 
gene expression, or ion-channel activity. An 
extracellular signal is transmitted through a series 
of molecular modifications (e.g., phosphoryla- 
tion, dephosphorylation, acetylation, and meth- 
ylation) and interactions (e.g., protein—protein 
interactions and protein-DNA interactions). A 
signaling pathway can be defined as a specified 
group of genes that have a coordinated associa- 
tion with a phenotype of interest. Signal trans- 
duction pathways have the potential to branch at 
many steps of a cascade. The specificity of cel- 
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lular responses can be achieved by a small num- 
ber of mechanisms, which may act in combination: 
(1) the same receptor can activate different intra- 
cellular transducers; (11) differences in the kinet- 
ics of the ligand or receptor might generate 
distinct cellular outcomes; (111) combinatorial 
activation by signaling pathways might result in 
the regulation of specific genes; and (iv) cells that 
express distinct transcription factors might 
respond differently when exposed to the same 
signals. Two types of general network architec- 
ture might account for robustness: positive- and 
negative-feedback loops. In _positive-feedback 
loops, the formation of the ligand is often 
enhanced, thereby amplifying, stabilizing, or 
prolonging signaling. Negative-feedback loops 
are used to inhibit and/or limit signaling (Pires- 
daSilva and Sommer 2003). 


7.3 The LUCA 
The last universal common ancestor (LUCA) of 
all extant life forms is regarded as the most basal 
node of the tree of life and is believed to have 
been a cellular entity. However, the LUCA can- 
not be part of a formal lineage because its cellular 
status was most likely attained progressively 
from a gradually evolving community of primor- 
dial organisms. The LUCA’s complexity arose by 
the stepwise elaboration of genetic and cellular 
structures. A large number of simpler intermedi- 
ate forms must have existed before the LUCA, 
but these forms have not left any descendants. 
Life is thought to have arisen on Earth by 
~3.8 Gya based on evidence provided by carbon 
isotope data relating to biological CO, fixation in 
sedimentary rocks of that age (Ueno et al. 2002). 
Stromatolites are the oldest (~3.5 Gya) fossils 
indicative of biological material deposition by 
photosynthetic prokaryotes. By ~1.5 Gya, micro- 
fossils of unicellular organisms from eukaryotic 
algae became reasonably abundant, and excep- 
tionally well-preserved fossils from multicellular 
organisms, most likely those of red algae, were 
laid down approximately 1.2 Gya in sedimentary 
rocks (Butterfield 2000). 
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The LUCA is thought to have been a hyper- 
thermophilic prokaryote because of (i) the 
GC-rich composition of primordial codons 
(Trifonov 2004), (11) the shape of the rRNA tree 
(Woese 1998), (111) the small genomes and sparse 
growth requirements of primeval organisms, and 
(iv) the hot conditions prevailing on the early 
Earth (Di Giulio 2001). However, the hyperther- 
mophilic nature of the LUCA has not been con- 
firmed (Forterre 2001), and many 
hyperthermophiles have genomes with relatively 
low guanine plus cytosine (GC) contents. Salts 
and compatible solutes are present at high con- 
centrations in the cytoplasm of hyperthermo- 
philes, stabilizing DNA at high temperatures 
(Santos and da Costa 2002). 


The LUCA and the Three 
Superkingdoms of Life 


7.3.1 


As just commented above, which of the three 
superkingdoms (Archaea, Bacteria, and Eukarya) 
evolved from the LUCA first is the subject of 
heated debate. Phylogenetic artifacts can be 
responsible for the varied depictions of the tree of 
life (Fig. 7.8). When evolution is understood not 
only as a biological process but also as a general 
thermodynamic process, it becomes evident that 
the quest for the LUCA is unattainable by 
sequence comparison alone. Ambiguities in 
alignments are unavoidable because the driving 
forces and paths of evolution cannot be separated 
from each other. Thus tracking down life’s origin 
is by nature a non-computable task (Koskela and 
Annila 2012). One can list factors promoting 
phylogeny artifacts as (1) a high pace of sequence 
change, (11) differential evolutionary rates among 
organismal lineages, (111) non-orthologous gene 
displacement, (iv) difficulty in identifying homol- 
ogy by sequence alignment, and (v) horizontal 
gene transfer (Kim and Caetano-Anollés 2011). 
Although comparative genomics has revealed 
that gene transfer occurs frequently during 
genome evolution, a core of informational genes 
(those involved in information processing and 
expression) appears more resistant to transfer. 
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Fig. 7.8 Model of putative formation of the three super- 
kingdoms of life by independent viral DNA transfers to 
RNA cells. Many lineages of RNA cells (square crossed) 
with various types of membranes (bold, thin for archaeal- 
like and bacterial-like lipids, respectively) and translation 
apparatus with different canonical patterns (light gray, 
dark gray) diverged from the RNA LUCA during the sec- 
ond age of the RNA world. In three of them, the RNA 
genomes were replaced independently by DNA genomes 


These genes testify to the existence of a close 
relationship between archaeal and eukaryal infor- 
mational processes (Forterre 2001). By contrast, 
the operational genes of eukaryotes (those 
involved in metabolic and biosynthetic pathways) 
reflect eubacterial ancestry (Walsh and Doolittle 
2005). 

A phylogenomic study of protein domain 
structures based on highly conserved fold super- 
families (FSF) and an iterative approach was 
used to reconstruct upper and lower bounds for 
the LUCA’s proteome of approximately 220 
FSFs. The functional composition of the putative 
LUCA’s proteome represents only 5—11 % of the 
1,420 FSFs of extant proteomes. Trees of pro- 
teomes reconstructed directly from FSFs showed 
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from different large double-stranded DNA viruses (a—c). 
The DNA viruses and the RNA cells at the origin of 
Archaea and Eukarya shared more similar features in their 
informational proteins, but the RNA cells at the origin of 
Bacteria and Eukarya shared similar lipids. The descen- 
dents of the three ancestors of the present-day domains 
eliminated all RNA cell lineages by Darwinian competi- 
tion. The dashed line indicates the time of genome transi- 
tion from RNA to DNA (Modified from Forterre 2005) 


that the LUCA’s proteome is always placed at 
their base and have shown that the tree of life is 
rooted in Archaea (Kim and Caetano-Anollés 
2011). Rooting the tree of life in Archaea implies 
that bacterial and archaeal lineages would have 
evolved from a primordial eukaryotic-like lin- 
eage by reductive loss, as confirmed by phyloge- 
nomic analysis (Wang et al. 2007). The relatively 
small number of FSFs (SCOP — http://scop.mrc- 
Imb.cam.ac.uk/scop/ — currently defines approxi- 
mately 2,000 FSFs) present in nature indicates 
that protein structures are much more conserved 
(three to ten times) than their primary sequences 
(Illergard et al. 2009). Protein structures are 
therefore good phylogenetic markers for deep 
events in evolutionary history; however, these 
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markers are limited in use to periods after the 
LUCA because complex folded proteins are not 
expected to have existed before it. 

According to a taxonomic system based on the 
three superkingdoms as outlined above, an indi- 
vidual FSF can be unique to one of the super- 
kingdoms Archaea (A), Bacteria (B), or Eukarya 
(E) or common to all three superkingdoms (ABE) 
if it was present in the LUCA. The putative 
LUCA’s proteome was defined in such a way that 
it only contained ABE domains. Domains com- 
mon to ABE were found ubiquitously and are 
highly represented in modern proteomes. The 
case where a FSF is shared by two phyla (AB, 
BE, or AE) implies horizontal transfer of the 
character considered during the formation of 
those phyla. 

The informational functions encompassed by 
the putative LUCA’s proteome include DNA rep- 
lication/repair, transcription, RNA _ processing, 
and translation, i.e., ribosomal proteins that are 
crucial for protein synthesis act as landmarks of 
ribosomal evolution. Ribosome — evolution 
occurred in two steps before 3 Gya, suggesting 
that the earliest start of organismal diversification 
occurred sometime after 2.9 Gya (Kim and 
Caetano-Anollés 2011), during planetary oxy- 
genation. The potential for the emission of oxy- 
gen by green plant photosynthesis existed at least 
by 2.7—2.1 Gya, and atmospheric oxygen first 
began to be available in relatively large quantities 
most likely by approximately 1.2 Gya before 
present. Functions related to nuclear structure 
and chromatin structure that are essential for 
developing the eukaryotic cell were absent in the 
putative LUCA’s proteome and only developed 
as life diversified into the three superkingdoms. 

Considering operational functions, FSFs 
related to metabolism were commonly distrib- 
uted in coenzyme metabolism and transport (m/ 
tr), amino acid m/tr, other enzymes, carbohydrate 
m/tr, transferases, polysaccharide m/tr, redox, 
secondary metabolism, energy, and storage. FSFs 
for photosynthesis, cell envelope m/tr, electron 
transfer, nitrogen m/tr, and lipid m/tr were com- 
pletely absent from the putative LUCA’s pro- 
teome. The subcategories of intracellular 
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processes including protein modification, trans- 
port, proteases, and ion m/tr were present in the 
putative LUCA’s proteome, whereas cell cycle 
functions related to apoptosis, phospholipid m/tr, 
cell motility, and _ trafficking/secretion were 
absent. Regarding regulatory functions, FSFs 
present 1n the lower bound of the putative LUCA’s 
proteome were related to kinase/phosphatase 
activities and DNA binding, whereas additional 
FSFs involving receptor activity and other regu- 
latory functions only appeared later (Kim and 
Caetano-Anollés 2011). 


7.3.2 The LUCA’s Emergence 


Presumably, molecular oxygen from the photo- 
dissociation of water vapor might have threat- 
ened the highly reduced nucleic acids of early 
self-replicating cellular systems. To protect 
themselves from oxidation, cells eventually 
evolved toward mechanisms to produce ATP that 
utilized visible solar energy absorbed by 
chlorophyll-like porphyrins, replacing earlier 
mechanisms in which ATP was synthesized by 
vacuolar proton translocating pyrophosphatase 
(V-H+PPase) proteins that are localized in the 
acidocalcisomes of extant organisms from bacte- 
ria to higher eukaryotes (Motta et al. 2009). The 
energy demands of the LUCA in the environ- 
ments of early Earth might have been met by a 
combination of polyP hydrolysis, the use of a H+ 
gradient, and the regeneration of ATP and PPi. 
The multifunctionality of polyP makes it a unique 
link between living organisms and the inorganic 
world (Seufferheld et al. 2011). With the energy 
released from light-produced ATP, atmospheric 
hydrogen might have been used to reduce CQ. 
The production of increasing amounts of oxygen 
by photosynthesis must have been deleterious for 
organic compounds, and obligate anaerobes 
would have found progressively fewer niches. 
Ultimately, all life might have become dependent 
on visible light photosynthesis. Genes in the 
respiratory pathway might have been selected to 
transfer hydrogen to oxygen rather than nitrogen 
or sulfate (Sagan 1967). 
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7.3.3. The Birth of Eukaryotes 
and Green Plant Emergence 


In the Margulis’ hypothesis referred to as serial 
endosymbiosis, mitochondria and photosynthetic 
plastids were once free-living prokaryotic cells 
(Sagan 1967). Classical mitosis evolved in 
protozoan-type cells millions of years after the 
evolution of photosynthesis. During the course of 
the evolution of mitosis, photosynthetic plastids 
would have been acquired symbiotically more 
than 1.5 Gya by some of these protozoans engulf- 
ing a cyanobacterium (Martin and Russell 2003) 
to form eukaryotic algae and green plants. It is 
now widely accepted that an o-proteobacterium 
was the ancestor of mitochondria (Dyall et al. 
2004). 

Terrestrial conquest by simple rootless plants 
lacking vascular tissue and producing spores 
rather than seeds such as bryophytes (liverworts, 
hornworts, mosses) occurred at the Ordovician 
by ~450 million years ago (Mya) and was fol- 
lowed by vascular plants with one or few cells in 
the surface layer of the apical meristem such as 
lycopods by ~415 Mya during the Silurian. New 
vegetal groups (ferns and horsetails) appeared at 
around 385 Mya (Devonian) together with first 
arthropods (scorpions). Terrestrial vertebrates 
appeared at ~360 Mya _ successively with 
dipneusts, amphibians, and reptiles at ~340 Mya 
(Carboniferous). The first gymnosperms 
appeared at ~320 Mya and the first angiosperms 
between 180 and 140 Mya (Lower Jurassic) with 
their fast radiation (<5 million years) during the 
Lower Cretaceous into 97 % of extant species, 
1.e., more than 250,000 (Bell et al. 2010; Soltis 
et al. 2009). Orthologs of developmental genes 
most likely existed long before the establishment 
of their specific role in seed plant (Graham et al. 
2000; Becker et al. 2003; Dolan 2009; Huang 
et al. 2014). The gene set of the last angiosperm 
common ancestor (LACA) was inferred to con- 
tain at least 14,000 protein-coding genes with 
1,179 (including genes for flowering, MADS 
box, wood formation, responses to environmental 
stress) being more specifically associated to 
angiosperm divergence. Notable examples of 
genes involved in response to external stimuli 
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include those that drove floral and insect coevolu- 
tion in angiosperms. Herbivory has also been 
proposed as a significant environmental con- 
straint in angiosperm evolution, particularly in 
Poaceae (Amborella Genome Project 2013). 
Most of small interfering RNAs (siRNAs) and 
microRNA (miRNA) families and their homolo- 
gous targets were also likely present in the 
LACA. 


7.3.3.1 The Origin of the Nucleus 

In contrast to mitochondria and plastids, the ori- 
gin of the cell nucleus has not yet been fully elu- 
cidated. The nucleus is a complex structure, and 
therefore, it is expected to have arisen only once 
during evolution; the underlying reasons cannot 
be simple, and most models take this into account. 
The most widespread and familiar model for the 
origin of nucleus is the concept that the endo- 
membrane system of eukaryotes, to which the 
nucleus and endoplasmic reticulum belong, is 
derived from invagination of the plasma mem- 
brane of a prokaryote (Cavalier-Smith 2002, 
2004). The invagination model postulates that a 
prokaryote lost its cell wall and evolved phagocy- 
tosis. An example of natural endocytosis still 
exists in prokaryotes, 1.e., endospore formation 
by extant bacteria. Another striking example of 
prokaryotic endomembranes that superficially 
resemble those of the eukaryotic nucleus is found 
in the planctomycetes (Lindsay et al. 2001). 
Discrete membrane-bound compartments have 
been described in two planctomycetes of the gen- 
era Gemmata and Pirellula (Fuerst and Sagulenko 
2013). 

Another attractive hypothesis is that the cell 
nucleus could have evolved from viruses. Viruses 
could have evolved by parasitic reduction from 
ancient cellular lineages that were out-competed 
in the Darwinian selection process before the 
LUCA and thus could only survive as parasites in 
the winners of this competition (Forterre 2005). 
Nuclei and viruses have several features in 
common: 

(1) The cell nucleus is functionally equivalent 
to a selfish DNA virus; it replicates its DNA 
using cellular metabolism. 
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(ii) Cell nuclei and viruses lack protein- and 


lipid-producing pathways within _ their 
boundaries. 

Gil) Both contain linear chromosomes, 
whereas most bacterial chromosomes are 
circular. 

(iv) Both disassemble their boundaries during 
replication. 


(v) Both transcribe DNA but do not translate 
mRNA within their boundaries. As they rep- 
licate within a cell, some poxviruses even 
form a membrane around their DNA using 
the endoplasmic reticulum of the infected 
cell. The eukaryotic cell uses the same mate- 
rial to surround its nucleus. 

Some viral DNA strands have primitive 

telomeres, which act as protective DNA 

sequences that are found at the ends of 
eukaryotic chromosomes. 

An astonishing virus that largely overlaps 
with the world of cellular organisms, both in 
terms of size and genome complexity, is the giant 
Acanthamoeba polyphaga, a eukaryotic nucleo- 
cytoplasmic large dsDNA virus (NCLDV). This 
NCLDV has a genome of 1.2 Mb representing 
911 genes with little evidence of horizontal trans- 
fer (Raoult et al. 2004). 

According to the viral hypothesis, the viral 
transfer of DNA replication apparatus to the 
LUCA’s cells would have occurred indepen- 
dently three times (Fig. 7.8), thus promoting the 
foundations of the three ancestral cell lineages 
for Archaea, Bacteria, and Eukarya (Forterre 
2005). With this viral hypothesis, it would be 
possible to explain (Forterre 2006): 

(1) Why there are three superkingdoms instead 
of a single continuum between the LUCA 
and extant organisms. 

(41) The existence of three canonical ribosomal 
patterns. 

(1) The critical differences exhibited by the oth- 
erwise similar eukaryotic and archaeal repli- 
cation machineries. Cellular DNA 
informational proteins are often found in 
only one or two (not three) versions and 
exist in different analogous families. For 
instance, there are six known analogous 
families of cellular DNA polymerases: one 


(vi) 
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version in Bacteria, one in Archaea, and sev- 
eral in Eukarya (Filée et al. 2002). 

(iv) Why eukaryote lipids are bacterial-like, but 
ribosomes are archaeal-like. 

The type of combinatorial evolution just out- 
lined is integral to many current views of evolu- 
tion, and it is obvious that horizontal transfers 
have occurred continuously during genome evo- 
lution. The relationship between Eukaryotes and 
Archaea/Bacteria bears witness to a long com- 
mon history of multiple interchanges between 
phyla that is difficult to decipher (Walsh and 
Doolittle 2005), as is the issue concerning the 
origin of the nucleus. However, it is now thought 
that the molecular machinery involved in infor- 
mation storage and retrieval in eukaryotes shares 
much greater similarity in terms of overall design 
and sequence conservation to archaeal counter- 
parts than to bacterial ones (Anantharaman et al. 
2002). Comparative genome analyses have 
shown that the last eukaryote common ancestor 
(LECA) possessed much of the complexity of 
extant eukaryotes now seen in the replisome, the 
spliceosome, the nucleoli, and the endocytic sys- 
tem, as well as the machineries necessary for 
meiosis and phagotrophy. 

Multinuclearity and duplicated cistrons or 
entire genomes (polyploidy) may have been early 
mechanisms to distribute greater amounts of 
DNA evenly to daughter cells because the use of 
multiple copies greatly increases the probability 
that each daughter cell will contain at least one 
copy of the genome. However, the inefficiency of 
polyploidy and the selective advantage of linkage 
groups over polyploidy have been demonstrated 
(Gabriel 1960). Many lower eukaryotes that still 
follow poliploidy (premitosis) as a mean of 
genetic material transmission to their progeny 
have the general features of eukaryotic cells 
(mitochondria, nuclear membranes, etc.), but 
their aberrant division figures often lack the spin- 
dles and centrioles of typical eumitosis (as seen 
in Amoeba, Euglena, Tetramitus, etc.). This sys- 
tem is ancestral to meiosis and fertilization. The 
evolution of mitosis must have taken millions of 
years. Based on the abundance of fossil eukary- 
otes, this event must have occurred before the 
dawn of the Cambrian. Hence, the most likely 
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period for the evolution of mitosis is between 1.2 
and 0.6 Gya (Sagan 1967). 


7.3.3.2 Chromosomes 

Eukaryotic chromosomes may differ in size, 
shape, number, and DNA composition. The size 
of metaphase chromosomes varies from less than 
1 um to more than 10 pm, but upper and lower 
tolerance limits for chromosome size are appar- 
ently determined by the genome size, chromo- 
some number, and karyotype structure of a given 
species. The shape of a chromosome is deter- 
mined by the position of the primary constriction 
(the centromere), and its size can be altered by 
reciprocal translocation, sister chromatid 
exchange, crossover, and replication slipping 
(Schubert 2007). Telomeric sequences may move 
to interstitial positions by translocation or inver- 
sion and become hot spots of chromosome break- 
age. According to the minimum interaction 
hypothesis (Imai et al. 1986), karyotype evolu- 
tion generally tends toward an increasing number 
of acrocentric chromosomes, thereby minimizing 
the risk of deleterious rearrangements. The 
reduction of chromosome number and the forma- 
tion of metacentric chromosomes are considered 


Fig.7.9 A karyotype of 
maize chromosomes 
showing primary (1st) and 
secondary (2nd) 
constrictions 


N. Carels 


rare and randomly generated events that are 
tolerated or favored when they provide a short- 
term advantage. The nucleolus-organizing region 
(NOR), which is the site of 45S repetitive DNA 
(rDNA), may mark a chromosome either at a ter- 
minal or an interstitial position by giving rise to a 
secondary constriction (Fig. 7.9). The positions 
of NOR and 5S rDNA are highly polymorphic 
and potentially mobile within a genome. Most 
chromosome’ rearrangements occur within 
regions of extended repeats and are the result of 
preferential mis-repair by ectopic recombination 
between nonallelic repeats (Schubert et al. 2004). 

Linear chromosomes are a prerequisite for the 
accumulation of DNA because dispersed, repeti- 
tive sequences tend to recombine ectopically dur- 
ing DNA _ repair, often with deleterious 
consequences for circular, but less so for linear, 
DNA molecules (Schubert 2007). Chromosomes 
evolve by rearrangements and ploidy alterations 
subsequent to interspecific hybridization and/or 
mitotic or meiotic errors. Chromosome rear- 
rangements are due to the erroneous ligation of 
DNA _ strands during’ repair processes. 


Transposition, errors during replication, unequal 
recombination, and insertions and/or deletions 
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during double-strand break repair via 
recombination might also contribute to chromo- 
some evolution by promoting their shrinkage or 
expansion. 


7.3.3.3 Centromere 

Centromeric repeats comprise the most rapidly 
evolving DNA sequences in eukaryotic genomes. 
Satellite changes occur through a variety of 
mutational processes including (i) replication 
slippage, (41) unequal exchange, (111) transposi- 
tion, and (iv) excision. The fast sequence changes 
observed in the centromere appear paradoxical in 
comparison to the stable inheritance of this struc- 
ture. Indeed, the chromosome segregation 
machinery is highly conserved across all eukary- 
otes. Despite the lack of universal sequence 
motifs, repeat unit length can be remarkably sim- 
ilar between organisms and is typically between 
100 and 200 bp. It is thought that incompatibili- 
ties between rapidly evolving centromeric com- 
ponents may be responsible for the reproductive 
isolation of emerging species. Proteins from the 
kinetochore are only found at centromeres. The 
kinetochore is a proteinaceous structure that 
assembles on centromeric chromatin and con- 
nects the centromere to spindle microtubules. 
However, only constitutive proteins are candi- 
dates for directly maintaining centromeres. The 
model states that a H3-like histone exists that is 
specific to the centromere region. The sequence 
of the variable core of this histone coevolves with 
that of centromeres across eukaryote species. 
This H3-like histone is only present at native cen- 
tromeres and neocentromeres, is absent from 
mutated or inactivated centromeres, and is 
expected to maintain favorable interactions with 
centromeric satellites. 

The asymmetry of the meiotic tetrad provides 
an opportunity for chromosomes to compete for 
inclusion into the oocyte nucleus by attaining a 
preferable orientation at meiosis. This competi- 
tive process might drive adaptive changes, thus 
providing an explanation for fast centromere 
evolution, which is critical for such an essential 
function. Deleterious mutation by transposon 
insertion might promote satellite accumulation 
representing centromeric relics surrounding 


153 


functional centromeres. Parents in hybrids are 
compatible when the two sets of centromeric sat- 
ellites and the two alleles encoding the centro- 
meric histone are balanced. Thus, infertility and 
distortion of the sex ratio are expected symptoms 
displayed by hybrids bringing together incompat- 
ible combinations of satellite vs. H3-like histones 
and eventually reproductive isolation or specia- 
tion (Henikoff et al. 2001). 


7.4 DNA Repair Machinery 

In response to continuous hydrolytic and oxida- 
tive DNA damage, cells must have an effective 
network of repair systems that recognize, remove, 
and rebuild the injured sites. DNA repair 1s criti- 
cal for the maintenance of genome integrity and 
replication fidelity in all cells and therefore was 
arguably of major importance in the LUCA as 
well. Like Bacteria, Archaea have minimal sets 
of genes involved in all of the major cellular 
information transfer processes compared with 
Eukarya, which have highly paralogous and 
redundant sets of genes for DNA replication, 
repair, and recombination. 

Repair mechanisms can be classified into sev- 
eral distinct major pathways: (1) direct damage 
reversal (DDR), (41) base excision repair (BER), 
(111) nucleotide excision repair (NER), (iv) mis- 
match repair (MMR), and (v) recombination 
repair (RER, Fig. 7.10). The main protein func- 
tions involved in these repair systems are (1) 
endo- and exonucleases and glycosidases, (11) 
DNA helicases, (ii) ATPases (other than heli- 
cases) that are involved in events such as strand 
migration and the loading of multi-protein repair 
complexes onto DNA, (iv) DNA ligases, (v) 
DNA polymerases and nucleotidyltransferases, 
(vi) DNA-binding domains, (vii) adaptors (pro- 
tein—protein interaction domains that glue 
together diverse proteins in repair complexes and 
link to other cellular components, such as eukary- 
otic chromatin), and (viii) ATPases (Jensen et al. 
2011). 

Repair systems are modeled by selective pres- 
sures on the fidelity of transmission of genetic 
information and the need for evolvability. Repair 
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Fig.7.10 Repair of double-strand breaks by homologous 
recombination in eukaryotes 


systems are striking for their poor representation 
of universal components, which contrasts with 
the case of translation machinery (Farnell 2011). 
The repair machinery shows considerable vari- 
ability, in terms of the genes that are present or 
absent, even in closely related bacteria. Error- 
prone repair or lesion bypass synthesis refers, 
here, to specialized DNA polymerases that are 
able to progress through DNA lesions. This path- 
way, termed translesion repair, is inherently 
mutagenic because of the miscoding nature of 
most damaged nucleotides (DiRuggiero and 
Robb 2004). A highly efficient repair is manda- 
tory in free-living organisms, which are subject 
to rapid changes in the environment and thus 
employ more sophisticated repair regulation at 
the transcription level by specialized regulators. 
In free-living organisms, genomic variation 1s 
obtained via error-prone repair mechanisms, such 
as those induced by translesion repair. Eukaryotes 
have a higher-order chromatin structure that is 
more complex than that of their prokaryotic 
counterparts. The evolution of these structures 
placed additional barriers to the interaction of 
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repair enzymes with damaged DNA and led to 
the concomitant evolution of specific adaptor 
proteins that connect the repair machinery and 
chromatin (Aravind et al. 2000). 

The LUCA is expected to have encoded a 
recombinase and several helicases and nucleases, 
including approximately ten types of repair pro- 
tein domains (universal families). Most of these 
nuclease and helicase families have members 
with RNA substrates, and the most common 
nucleic acid-binding module in repair proteins, 
HhH, exists in both RNA-binding and DNA- 
binding versions, suggesting that DNA repair 
evolved from systems that already provided func- 
tions in the RNA world. The number of ortholo- 
gous or even clearly functionally equivalent 
repair proteins that are shared by all three super- 
kingdoms is very small. In contrast, a much larger 
number of repair proteins are conserved in one or 
two superkingdoms. There appear to be only a 
few known repair proteins, such as RecA/RadA 
recombinase, that have an identical domain 
arrangement that is conserved among Bacteria, 
Archaea, and Eukarya (Aravind et al. 2000). 


The Mutation Bias 
in Bacteria as a Model 
for Eukaryotes 
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The genomic GC content of bacteria varies dra- 
matically, from less than 15 % (McCutcheon and 
Moran 2010) to more than 70 % (Hildebrand 
et al. 2010). It has been shown that genomic GC 
content is correlated with a number of factors 
including genome size, environmental features, 
aerobiosis, nitrogen utilization, and temperature, 
which may be subject to selective pressure for 
tuning GC content (see in Hildebrand et al. 2010). 
Genomic GC variation is generally ascribed to 
differences in the pattern of mutation toward ade- 
nine plus thymine (AT) or GC. There are two pat- 
terns of mutation bias, i.e., global or 
transcript-specific, each of which is derived from 
a different mechanism. The former is attributable 
to DNA replication and global repair (see above), 
and the latter mainly results from transcription- 
coupled repair (Tornaletti 2009). Deficiencies in 
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mutator genes can dramatically increase the 
mutation rate. For example, in the absence of 
both mutY and mutM, thousandfold increases in 
GC-to-AT mutation bias were observed; the same 
magnitude of mutations is evident in mutT- 
deficient strains, but resulting in a bias toward 
GC (Horst et al. 1999). 

It has been demonstrated that the DNA poly- 
merase III « subunit and its isoforms participate 
in either replication (polC) or SOS mutagenesis/ 
translesion synthesis (dnaE2) and play dominant 
roles in determining GC variability (Wu et al. 
2012). The early phase of the SOS response is 
mostly dominated by accurate DNA repair, 
whereas the later phase is characterized by ele- 
vated mutation levels caused by error-prone DNA 
replication. SOS mutagenesis is largely the result 
of the action of DNA polymerase V (pol V), 
which has the ability to insert nucleotides oppo- 
site to various DNA lesions via translesion. Pol 
V, encoded by the umuDC operon, is largely 
responsible for the approximately 100-fold 
increase in chromosomal mutation induced by 
DNA damages. The mutations that occur may kill 
many cells; however, if replication is successfully 
restarted, the lucky cells will survive (Patel et al. 
2010). 

A correlation exists between variation in GC 
content and the dimeric combinations of DNA 
polymerase III « subunits, showing that eubacte- 
ria can be grouped into different GC variable 
groups: (1) the full GC spectrum or dnaE1 group, 
(ii) the high GC content or dnaE2—dnaE1 group, 
and (iii) the low GC content or polC—dnaE3 
group (Zhao et al. 2007). The existence of dnaE2 
and polC is associated with higher GC (>50 %) 
and lower GC (<50 %) contents, respectively. 
The DNA polymerase III a subunit and its iso- 
forms participate either in replication (polC) or in 
SOS mutagenesis/translesion (dnaE2), playing a 
dominant role in producing GC variations that 
can be classified into three basic spectra: GC 
variable, high-GC, and low-GC groups. For 
example, the presence of dnaE2 1s an indication 
of a higher GC content. On average, aerobic bac- 
teria exhibit higher GC contents than anaerobic 
bacteria because aerobes tend to carry the 
GC-enriching polymerase whereas anaerobes 
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tend to carry the AT-enriching polymerase. 
Concerning the relationship between genomic 
GC content and optimal growth temperature, 
thermophiles most frequently use _ the 
AT-enriching polymerase, whereas mesophiles or 
psychrophiles most frequently use the 
GC-enriching polymerase (Wu et al. 2012). Thus, 
the loss or gain of specific mutator genes is 
expected to alter the GC content of bacterial 
genomes (Akashi and Yoshikawa 2013). A simi- 
lar process is expected to occur in eukaryotes 
(Touchon et al. 2004). Experimental evidence is 
given in the following section to show that selec- 
tive processes also affect GC content in eukary- 
otic genomes. 


Correlations Between GC 
Content and Genomic 
Features in Eukaryotes 
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The universal inter-genomic correlation shows 
that CDSs from prokaryote or eukaryotic 
genomes that are homogeneous regarding GC 
content follow a regression line (D’ Onofrio et al. 
1999). This relationship also holds when consid- 
ering the intra-genomic correlation of GC in 2nd 
position of codons (GC2) vs. GC in 3rd position 
of codons (GC3) within genomes of heteroge- 
neous eukaryotes such as humans and rice (Carels 
and Frias 2013). In uncharacterized species, GC2 
and GC3 can be easily obtained from transcrip- 
tome data (Carels and Frias 2013). Genome size 
and GC content are easily obtained using flux 
cytometry. CsCl sedimentation remains the only 
technique able to faithfully report the whole- 
genome statistics of large DNA _ fragments 
>100 kb (Bernardi 2000; Clay et al. 2001; Carels 
2005a). However, the DNA sequencing of large 
DNA fragments has become more consistent due 
to the technology developed by Pacific Bioscience 
(Eid et al. 2009). Next-generation sequencing of 
large DNA fragments is expected to increase in 
the future due to the development of emerging 
sequencing technologies such as that developed 
by Oxford Nanopore (Clarke et al. 2009). 

CsCl sedimentation of large DNA fragments 
has allowed Bernardi’s group to investigate 


156 


genome evolution in vertebrates (Clay et al. 
2003) and plants (Carels 2005a). By analyzing 
the DNA fragment (>100 kb) distribution of large 
eukaryotic genomes, it has been shown that GC 
transitions occurred independently in birds and 
mammals (two warm-blooded vertebrate lin- 
eages that evolved independently from cold- 
blooded vertebrates) and in plants (Poaceae, 
Fig. 7.11). Interestingly, the platypus is also sur- 
prisingly GC-rich (Costantini et al. 2009) as is C. 
reinhardtii, indicating that compositional transi- 
tions toward GC have occurred independently 
several times in eukaryotes. Transitions toward 
AT also exist, for example, in unicellular para- 
sitic eukaryotes (Costantini et al. 2013); however, 
these transitions were investigated to a much 
lesser extent. The compositional characterization 
of unicellular eukaryotes also shows a mosaic of 
DNA fragments of different GC compositions, 
indicating that such a compositional patchwork 
already existed in the LUCA, which would not be 
surprising given that modular genetic structures 
were inherited from the RNA world. Correlations 


Fig.7.11 Taxonomic 
associations for eight plant 
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between the GC content of genomes and selec- 
tive pressures associated with species’ living 
conditions are difficult to investigate due to the 
inherent large variability associated with ecologi- 
cal variables and independent cellular strategies 
to cope with the environmental selective pres- 
sure. The search for correlations relating to GC 
content among genomic structures (especially in 
eukaryotes) or even nuclear compartments was 
much more productive (see the pioneering inves- 
tigations conducted by Saccone et al. 2002 in 
warm-blooded vertebrates). 

The first concept investigated was that of i1so- 
chores (typically found in human Pavlicek et al. 
2002; Costantini et al. 2006), 1.e., large genomic 
regions (>>300 kb) that are somewhat homoge- 
neous in GC (GC variation <4 %, Bernardi 2000; 
Costantini et al. 2009) and are present in large 
heterogeneous genomes such as those of mam- 
mals (ranging from 30 to 60 % GC in humans, 
Zoubak et al. 1996) and birds. In further develop- 
ments, correlations between the CDS distribution 
and the GC level of isochores were sought. Genes 


Solanum 





melongena 
species with their Solanum i 
corresponding GC3 Heliantus — 
distributions. Sorghum ns sree a 
(Poaceae) show a clear GC Arabidopsis ee 
transition toward high GC3 Carica 
contents compared to Eudicots Eucalyptus 
Arabidopsis, taken as an Eucalyptus globulus 
example. Another partial | 
compositional transition 2 Sorghum 
can be observed in Sorghum oe bicolor 
Eucalyptus (Modified from o 
Serres-Giardi et al. ® Elaeis 
(2012)). Other transitions . c guineensis 
should appear as sequenc- Monocots —Elaeiss & 
ing efforts proceed 
Allium 
Allium cepa 
Basal | 
angiospem Amborella 
Amborella trichopoda 
G — Ginkgo 
yINHOSHS Gingko : biloba 


0 20 40 60 8010 
GC3, “Yo 


7 AHistory of Genomic Structures: The Big Picture 


were found to be concentrated in a gene space 
whose GC level is larger than the mode of DNA 
fragment distribution according to GC content 
(Fig. 7.12). Such a CDS distribution bias accord- 
ing to GC level led to the introduction of the con- 
cept of gene deserts, which are associated with 
GC-poor genomic fragments in which genes are 
rare (Bernardi 2012). Sets of genes can also be 
classified as GC-poor or GC-rich according to 
their intron number and size (Carels and Bernardi 
2000b). A similar correlation has been found in 
humans by Duret et al. (1995), but we verified 
that this is much less striking than the correlation 
in plants. Interestingly, the GC level of exons is 
generally larger than that of introns and inter- 
genic sequences in vertebrates (Costantini and 
Bernardi 2008) and in plants (Carels et al. 1998; 
Carels and Bernardi 2000a, b; Carels 2005b). 


Fig.7.12 Relationship 
between genome size, gene 
expression, physical 
distribution of genes and 
repeated sequences in 
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According to the classification of genes as 
GC-poor or GC-rich, we also verified that pro- 
moters of GC-rich genes in Poaceae match the 
definition of CpG islands, which is not the case 
for GC-poor genes, suggesting that different 
types of gene regulation apply to either GC class 
(Carels 2005a). In humans, a positive correlation 
is observed between GC and CpG (Varriale and 
Bernardi 2010); however, even in the GC-rich 
compartment, the ratio of CpG observed over the 
expected level is generally lower than 1 
(Costantini et al. 2009), although it is larger or 
equal to 1 in rice (Carels 2005a), showing that 
CpG targets remain unmethylated in the GC-rich 
compartment of Poaceae despite the more com- 
plex methylation pattern observed in plants. The 
resistance of plant CpG islands against GC ero- 
sion in GC-rich genes of plants demonstrates an 
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active maintenance against the CpG erosion into 
TpG due to the mutation of 5’-*CpG-3’ (SmC) to 
5’'-TpG-3’. In humans, genes in the gene desert 
compartment that are essentially formed by 
AT-rich isochores appear to be subject to a much 
more sophisticated regulation machinery 
(Nobrega et al. 2003) than genes from GC-rich 
isochores; this is somewhat analogous with plant 
organization with respect to the difference in 
gene regulation between GC-poor and GC-rich 
compartments, as outlined above. The gene des- 
ert typically matches Giemsa+ bands in verte- 
brates whereas the gene space liesin Giemsa- bands 
(Costantini et al. 2007a) where chromatin is more 
open and the recombination rate is higher (see in 
Carels 2005a; Tian et al. 2009; Nam and Ellegren 
2012; Flowers et al. 2012). In maize (Carels 
2005b), the situation is somewhat different 
because all genes (except those encoding storage 
proteins) that cover the same GC3 interval as all 
human genes (30 % < GC3 <90 %) are grouped 
into a gene space of 4 % GC, 1.e., an interval that 
is equivalent to the GC variability of DNA frag- 
ments belonging to a single isochore family (the 
human genome includes five isochore families, 
Bernardi 1989). Although it occurs to a lesser 
extent, the difference in gene density between 
GC-poor and GC-rich chromosomal compart- 
ments is observed even in such a compact genome 
as that of Arabidopsis (Carels and Bernardi 
2000a). In wheat, the gene distribution is orga- 
nized as islands of higher density (Sandhu and 
Gill 2002; Erayman et al. 2004). 

In humans, the isochore GC level has been 
found to correlate with the 3D organization of the 
nucleus in interphase in such a way that GC-poor 
sequences are on the nuclear periphery and stack 
onto the nuclear membrane. In contrast, GC-rich 
isochores loop toward the center of the nucleus 
(Saccone et al. 2002). This 3D organization sug- 
gests that GC-rich and GC-poor isochores are 
involved in very different functions. In plants, 
GC-rich genes include genes for carbohydrate 
transport and metabolism, cell wall synthesis, 
extracellular structures, and secondary metabo- 
lite synthesis; these genes are essential for energy 
management, physical structure, and defense, 
1.e., for the relationship between a cell and its 
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environment. In contrast, GC-poor genes mainly 
address information functions, such as cell 
mechanics, and general functions such as cell 
identity (Carels 2005a). 

Isochore families are also found in cold- 
blooded vertebrates, although usually within a 
more homogeneous context (Costantini et al. 
2009) with a similar preference for the gene 
space to be associated with GC-rich isochores 
(Bucciarelli et al. 2009); this shows that the 
GC-rich compartments experienced the composi- 
tional transition as seen in its most expressive 
form in birds and mammals (Costantini et al. 
2007b). Thus, the gene landscape (Cruveiller 
et al. 2004) is a sophisticated feature that might 
have emerged with compositional heterogeneity 
in large genomes. A selective advantage associ- 
ated with such a sophisticated organization of 
chromosomal territories might be found in the 
regulation system, allowing genes to be regulated 
spatially and temporally under different compo- 
sitional contexts (different promoter and enhancer 
motifs) (Bernardi 2012). Differential levels of 
gene expression between the GC-poor and 
GC-rich compartments have been observed in 
humans (Arhondakis et al. 2004, 2011) and plants 
(Carels and Bernardi 2000a, b). 


7.7 Genome Size 

The regional structure of heterogeneous genomes 
may have been promoted by the interplay of (1) 
an increase in genome size by polyploidization 
(Fig. 7.13), which is rather common in plants 
(Matassi et al. 1991; Tate et al. 2005); (41) iso- 
pycnic retrotransposon integration (Rynditch 
et al. 1998); and/or (111) a mutational bias toward 
GC under selective pressure, possibly on the 
regulation of gene expression. In animals, poly- 
ploidization events also exist in metazoa 
(Gregory and Mable 2005), such as fish (the tet- 
raploidization event in the stem lineage of ray- 
finned fish resulted in >20,000 living species), 
amphibians, and reptiles (lizards). Diploid—trip- 
loid mosaics are known in humans, cats, mice, 
rabbits, cattle, pigs, mink, and chickens, but 
these are usually malformed or at least sterile 
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Fig. 7.13 Gene and DNA fragment distributions in the 
amphidiploid genome of tobacco. Genes (NAR = nitrate 
reductase; POD = lignin-forming peroxidase) located in 
the GC-poor and GC-rich compositional fractions of N. 
sylvestris are shown as black boxes while the distribution 
of the same genes in N. tomentosiformis are shown as 
white boxes. The compositional distribution of Tnt-1 
transposon is shown by gray rectangles. This experiment 
shows how interspecific hybridization may affect the 
regional organization in plant nuclear genomes. DNA 
fragment distribution was obtained by CsCl sedimentation 
and gene distribution by Southern hybridization with 
Cs,SO, sedimentation fractions (Modified from Matassi 
et al. (1991)) 


(Fechheimer et al. 1983). The South American 
red viscacha rat Tympanoctomys barrerae has 
twice the DNA content and number of chromo- 
somes as its closest relatives, providing the first 
example of a naturally occurring tetraploid 
mammal (Gallardo et al. 2004). Two rounds of 
entire genome duplications would have occurred 
early during vertebrate evolution, the evidence 
of which can still be observed as duplicated 
blocks in the human genome (Van de Peer and 
Meyer 2005). 

In plants, polyploidy seems to have been deci- 
sive for the success of angiosperm diversification 
and radiation among many habitats, life forms, 
and biotic interactions; it has been estimated that 
70 % of all angiosperms had experienced one or 
more episodes of polyploidy in their ancestry. 
Subfunctionalization of gene duplicates has been 
shown to be a prominent molecular evolutionary 
force for coping with whole-genome duplication 
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in both flowering plants (Jiao et al. 2011) and 
ray-finned fish. The Hawaiian flora has the high- 
est incidence of polyploidy known, and most 
Hawaiian species are paleopolyploid most likely 
because dispersal into volcanic archipelagoes 
involves the colonization of habitats that change 
dramatically over relatively short periods of time. 
Molecular investigations of polyploid genome 
have indicated that intra-genomic rearrangement 
and altered gene regulatory relationships can 
contribute to evolutionary flexibility (see in Soltis 
et al. 2009). Several polyploidization events were 
documented in plants, 1.e., ¢ in the seed plant 
common ancestor, € in the LACA, o and p in 
monocotyledons, y in the eudicot ancestor, and « 
and 6 in Brassicaceae (Soltis et al. 2009; Jaio 
et al. 2011; Amborella Genome Project 2013, 
Fig. 7.14). A compilation of 2,802 angiosperm 
species has shown that haploid genome sizes 
range from ca. 120 Mb to over 148 Gb (Michael 
and Jackson 2013) with ~50 % of the flowering 
plants analyzed to date having genome sizes in 
the interval 120 Mb and 3.5 Gb (Bennett and 
Smith 1976; Leitch et al. 1998). Familiar exam- 
ples are rice, sorghum, potato, tomato, maize, 
barley, and wheat whose genome size are 450, 
750, 850, 950, 2,500, 5,000, and 16,000 Mb, 
respectively (Arumuganathan and Earle 1991; 
Lukas et al. 2009; The Potato Genome Consortium 
2011; The Tomato Genome Consortium 2012; 
Michael and Jackson 2013). Gymnosperms have 
highly repetitive genome of large (1-40 Gb, Ohri 
and Khoshoo 1986; Kovach et al. 2010; Burleigh 
et al. 2012), but rather homogeneous size (~40- 
fold variation compared to ~1,000-fold variation 
for angiosperms). 

Interestingly, genome reduction appears to be 
the dominant mode of evolution. Most lineages 
were apparently dominated by extensive loss of 
genes (Sankoff et al. 2010) and introns, and two 
distinct evolutionary phases can be distinguished: 
a short, explosive, innovation phase that led to an 
abrupt increase in genome complexity, followed 
by a much longer reduction phase (Fig. 7.15), 
which encompassed either a neutral ratchet of 
genetic material loss or adaptive genome stream- 
lining (Wolf and Koonin 2013). An example of 
plant genome reduction is sorghum whose 
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Fig.7.14 Overview of 
land plant phylogeny 
showing the relationships 
among major lineages of 
angiosperms. Putative 
polyploidy events in land 
plant evolution are 
indicated with red ellipses. 
The first whole-genome 
duplication (WGD) most 
likely occurred in the 
common ancestor of seed 
plants and is known as C. 
The next polyploidization 
events are known as € and 
y and occurred in the 
LACA and common 
ancestors of eudicots, 
respectively. Several other 
polyploidization events 
occurred more recently in 
Brassicaceae (a, 6) and 
Poaceae (6, p) indepen- 
dently as well as in other 
lineages (Modified from 
Amborella Genome Project 
(2013)) 


Fig.7.15 Model of 
punctuated evolution of 
genomes. Jop: alternance 
of species radiation burst 
over short time periods and 
long phases of diversifica- 
tion stasis in the history of 
a particular lineage. 
Bottom: the complexity 
profile of punctuated 
evolution shows sudden 
increased in genome 
complexity matching 
species radiation alternat- 
ing with long period of 
slow genome reduction 
(Modified from Wolf and 
Koonin (2013)) 
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chromosome number has been reduced from 10 
to 5. Similarly, the ancestor of A. thaliana 
(~120 Mb) went through three phases of genome 
reduction through chromosomal condensation to 
reach the final chromosome number of five. The 
ancestor of rice as well went from allotetraploid 
to diploid through genome shuffling and reduc- 
tion. Considering extant birds and lizards, their 
most recent common ancestor is expected to have 
lost nearly 20 % of its DNA content. Indeed, 
most of the correlations between recombination 
rate and genome contraction are also seen in the 
human genome (Nam and Ellegren 2012). 
Concerning isopycnic integration (Rynditch 
et al. 1998), nested retrotransposon families of 
different GC composition were remarkably well 
described in maize (SanMiguel et al. 1996; 
Meyers et al. 2001; Martienssen et al. 2004). Like 
maize, humans also contain 70 % _ repeat 
sequences (Pavlicek et al. 2001; Costantini et al. 
2012). Based on their structure, retrotransposons 
are divided into two groups: those that are flanked 
by long terminal repeats (LTRs) and those that 
are not flanked by LTRs (non-LTIR retrotranspo- 
sons) such as the long interspersed nuclear ele- 
ments (LINEs). LTR retrotransposons are further 
divided into two groups: Ty1 or copia (Peterson- 
Burch and Voytas 2002) and Ty3 or gypsy 
(Kidwell 2005). The major structural difference 
between the copia and gypsy groups is the order 
of the reverse transcriptase (RT) and integrase 
domains in their pol genes. In plants, Tyl/copia 
elements were first identified as insertions near 
maize genes, whereas the highly repetitive Ty3/ 
gypsy elements exhibit a preference for insertion 
into or near other repetitive elements (Bennetzen 
1996). Gypsy group elements exhibit similarities 
to retroviruses. Copia group sequences have been 
found in diverse species, including single-cell 
algae, bryophytes, gymnosperms, and angio- 
sperms. Gypsy-like elements have been reported 
in major taxonomic plant groups (pine, lily, 
maize, tomato, pineapple, rice, several angio- 
sperms and gymnosperms; see refs. in Friesen 
et al. 2001). In addition to these elements, short 
interspersed elements (SINEs) are nonautono- 
mous retroposons that also have successfully 
spread within the genome of almost all eukary- 
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otes. SINEs are ancestrally related to functionally 
important RNAs, such as tRNA, 5S rRNA, and 
7SL RNA; possess an internal promoter that can 
be recognized by the RNA polymerase III (pol 
II) enzyme complex; and are usually organized 
in a Monomeric or dimeric structure. Monomeric 
tRNA-related SINE families are present in the 
genomes of species from all major eukaryote lin- 
eages, and this organization is by far the most 
frequent. These elements comprise a 5’ tRNA- 
related region and a central region of unknown 
origin, followed by a stretch of homopolymeric 
adenosine residues or other simple repeats with 
the opposite organization. SINEs likely retropose 
by target site-primed reverse transcription 
(TPRT) using the enzymatic machinery of 
LINEs. Pol II SINE transcription is induced by 
cellular stresses such as heat shock, treatment 
with cycloheximide, DNA-damaging agents, and 
viral infections (see Pélissier et al. 2004 and refs. 
therein). The dispersion of retrotransposons fol- 
lows the master sequence model (Deragon et al. 
1994); 1.e., only a small number of loci that match 
the master sequences are responsible for the 
amplification of the family, whereas the majority 
of the members are inactive on the evolutionary 
timescale. The consensus sequence of each fam- 
ily represents the starting point of the process and 
is copied in the genome in a great number of 
inactive retroposons. Each member of the sub- 
family diverges according to compositional con- 
straints (Paces et al. 2004) after its dispersion 
from the master sequence. In contrast to ret- 
rotransposons, transposons (class II), such as 
miniature inverted-repeats (MITEs) (Jiang et al. 
2004), Touristes, Ac, Dc, and Mu, have a low rate 
of amplification (approximately 100 copies per 
genome). The amplification of class II transpo- 
sons relies on DNA, and they are associated with 
genes. 


7.8 Genome Collinearity 

Despite tremendous variability in genome size 
and chromosome number in angiosperm, com- 
parative genetic mapping has revealed genome 
collinearity (gene order conservation) between 
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species of the same family. Thus, the map-based 
cloning of genes of a crop plants with a large 
genome size can be made easier by comparing it 
with a species of the same family but having a 
small genome. Comparative genetic mapping 
between species of the Poaceae revealed exten- 
sive genome collinearity (Fig. 7.16), but collin- 
earity has also been observed in Brassicaceae, 
Solanaceae, pines, legumes (see in Gale and 
Devos 1998), and Euphorbiaceae (Chauhan and 
Sood 2013). If gene order tends to be conserved 
among species of the same family, it is not true 
for intergenic sequences, and micro-collinearity 
(collinearity at DNA level) can be hard to find. 
Numerous small rearrangements, insertions/dele- 
tions, duplications, inversions, and translocations 
have been detected. Duplications and subsequent 
gene loss might have occurred during the evolu- 
tion of the genomes being compared. The proba- 
bility of these event increases with the phylogenic 
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Fig. 7.16 Genome collinearity in Poaceae. The ancestral 
genome of Poaceae is represented by 19 linkage groups 
from rice aligned with the wheat and maize collinear 
regions (circles). The genome segments corresponding to 
chromosomes are themselves divided by dashed lines to 
indicate the linkage groups that are homologous among 
the three species. The numbers are for chromosomal loca- 





N. Carels 


distance but may also be consistent within the 
same family. Only five chromosomal inversions 
are needed to explain differences in marker orga- 
nization between the 12 tomato and potato chro- 
mosomes. In contrast, collinear regions span only 
few centimorgans in the comparison of A. thali- 
ana and Brassica nigra because of about 90 chro- 
mosomal rearrangements that occurred since the 
divergence of these two species. In distantly 
related species, it can be difficult to detect orthol- 
ogous genes, and noncollinear positioning of 
transposon sequences can increase the difficulty 
of collinear segment detection. Size differences 
in intergenic regions are not always explained by 
retroelement sequences, and lack of collinearity 
can be due to different processes such as chromo- 
somal rearrangements, unequal crossing-over, 
DNA slippage, point mutations, or gene conver- 
sions (Schmidt 2002). The rate of chromosome 
rearrangements detected by comparative map- 


tions. The alignment is based on the genetic map of the D 
genome of wheat. The linkage groups that form a part of 
wheat chromosome 5 are referred to by “pt” and linked by 
bridges. The chromosomes where a linkage group is 
inserted within another are indicated by arrows showing 
the insertion point (Modified from Bennett (1996)) 
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ping of species pairs belonging to different fami- 
lies has been calculated to be 0.14 (+0.06) 
structural mutations per chromosome and per 
million years of divergence on average. By 
extension, it was estimated that 43-58 % of chro- 
mosomal tracts of <3 cM should remain collinear 
over a period of 130-200 million § years. 
According to these predictions, distantly related 
species, such as those of monocotyledon and 
dicotyledon that diverged about 130-200 Mya, 
should share small collinear chromosome seg- 
ments, which is actually the case (Paterson et al. 
1996). Collinearity has also been shown between 
the genomes of grape, peach, and cacao 
(Amborella Genome Project 2013). 

Comparing sea anemone (Nematostella vec- 
tensis) and human genomes, which evolved inde- 
pendently during approximately the last 0.7 Gya, 
Putnam et al. (2007) found that more than 30 % 
of conserved linkage groups accounted for a 
large fraction of the putative ancestral eumeta- 
zoan set. Typically, 40-50 % of genes from large- 
scale conserved collinearity have anemone or 
human counterparts and vice versa. Based on the 
genes conserved among anemones and humans, 
the LECA genome was not very different from 
ours. The putative LECA genome was intron-rich 
and contained the nearly complete toolkits for 
animal biochemistry and development (which 
can now be recognized as pan-eumetazoan), as 
well as the core gene set required to perform 
sophisticated neural and muscular functions. The 
common ancestor had blocks of linked genes that 
remain together in the modern human and anem- 
one genomes. An ancestral linkage group of par- 
ticular interest includes the human Hox clusters 
of homeobox transcription factors that regulate 
anterior—posterior identity in bilaterians (see in 
Putnam et al. 2007). 


7.9  Multicellularity 

Many requirements for multicellular organiza- 
tion (cell adhesion, cell-cell communication and 
coordination, programmed cell death) likely 
evolved in ancestral unicellular organisms. 
However, the evolution of multicellular organ- 
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isms from unicellular ancestors may be opposed 
by genetic conflicts that arise when mutant cell 
lineages promote their own increase at the 
expense of the integrity of the multicellular 
organism. There are only three lineages that pro- 
duced complex organisms, all from the eukaryote 
superkingdom: plants, animals, and _ fungi. 
Although the transition to multicellularity may 
be relatively easy, obstacles to the evolution of 
multicellular complexity are less well under- 
stood. Size initially favored can still promote the 
persistence of multicellularity. Increased size 
may have been favored to (i) provide storage 
reserves when nutrients are limiting, (11) expand 
feeding opportunities, (111) generate an internal 
environment protected by an external layer of 
cells, (iv) allow novel metabolic opportunities, or 
(v) enhance motility for dispersal or foraging 
(see in Grosberg and Strathmann 2007). In addi- 
tion, some key metabolic processes cannot con- 
currently take place within a single cell. Division 
into germ and soma in Volvocacean green algae 
indicates benefits from size increasing in the 
course of evolution. The advantages of multicel- 
lularity depend upon cooperation among cells, 
but cooperation also leads to conflicts between 
genetically distinct cell lineages, some of which 
may be devoted to efficacy and others to selfish 
proliferation (for instance, in cancers). The price 
to pay for multicellularity is a sophisticated 
molecular machinery to manage complexity that 
relies on homeobox (Hox) genes, signaling path- 
ways, and DNA methylation. 


7.9.1 Homeobox Genes and Body 


Development 


In animals, tetrapods have four Hox clusters that 
arose by duplication on the vertebrate stem 
(HoxA, HoxB, HoxC, and HoxD). Hox genes are 
typically found together in a single complex on a 
chromosome. Hox genes promote the identity of 
body parts or segments along the anterior—poste- 
rior axis of the embryo in the same order in which 
they lie on the chromosome. Most Hox genes 
interact with each other to maintain proper 
expression domain boundaries by both positive 
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and negative regulation (Miller et al. 2001). In 
general, a more posterior Hox gene will suppress 
the expression or the function of a more anterior 
one (Garcia-Fernandez 2005). Genes within Hox 
clusters show temporal collinearity in addition to 
spatial collinearity (anterior genes are expressed 
earlier than posterior genes) and are regulated by 
long-range controlling enhancer elements 
grouped into islands acting on several neighbor- 
ing Hox genes in a coordinated manner. The vari- 
ous islands contact each other and form 
megabase-scale regions of chromatin interactions 
(topological domains). As outlined above, such 
regulatory landscape was identified within gene 
deserts and displays intricate expression patterns 
(Montavon and Duboule 2013). From an evolu- 
tionary perspective, Hox genes are important 
because Mendelian inheritance of mutations in 
regulatory genes and nonlethal mutations in the 
recessive state may explain the punctuated evolu- 
tion observed in fossil records (Schwartz 1999). 
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Fig. 7.17 Land plant phylogeny showing predicted 
developmental trait acquisition at each node (green “S” 
and “G” for sporophyte and gametophyte, respectively). 
The approximate time of divergence is given at each node 
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In plants, Hox genes range from 6 to 148 in 
number according to the species considered 
among flowering plants, Selaginella, and mosses 
(Figs. 7.17 and 7.18). Plant Hox genes can be 
unambiguously classified by sequence evolution- 
ary analysis into 14 distinct classes that are also 
characterized by their conserved intron—exon 
structure and by unique co-domain architectures 
that were already differentiated in the last com- 
mon ancestor of mosses and vascular plants 
420 Mya (Dolan 2009) and evolved subsequently 
with tunned function in ferns (Huang et al. 2014), 
gymnosperms (Becker et al. 2003), and angio- 
sperms (Kater et al. 2006). The total number of 
Hox genes found among plant genomes reflects 
the complex patterns of genome duplication and 
gene duplication or loss that have characterized 
the evolutionary histories of these species and the 
higher developmental and organizational com- 
plexity in angiosperms than in mosses and uni- 
cellular algae (Mukherjee et al. 2009). 
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in blue. The bar on the right indicates class divisions, 1.e., 
A angiosperms, G gymnosperms, M monilophytes, L 
lycophytes, B bryophytes (Modified from Langdale and 
Harrison (2008)) 
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Fig.7.18 Model of the 
putative evolutionary 
history of plant homeobox 
gene classes and co- 
domains. A class or motif 
represented in a parental 
branch indicates that the 
same class/motif is also 
present in all of its child 
branches unless otherwise 
indicated. The HD-ZIP I to 
IV classes are represented 
by the single domain 
architecture HD-LZ- 
CPSC-START-HD-SAD- 
MEKHLA. Loss of the 
START domain in the 
genomes of unicellular 
green algae and red algae 
is represented by the 
domain crossed in red. The 
arrow indicates the 
MEKHLA domain 
acquisition through 
Cyanobacteria and 
chloroplasts (Modified 
from Mukherjee et al. 
(2009)) 
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In contrast to animals, plants entail the 
continuous development of new organs as time 
progresses (Geuten and Coenen 2013), which has 
implications for the type of regulatory elements 
used by Hox plant genes compared to animals. A 
number of genes that encode proteins involved in 
determining and maintaining the identity of 
shoot, root, vascular, and flower meristem cells 
have been identified in plant systems: 

(1) Plant apical growth. Angiosperm shoot api- 
cal meristems (SAM) are layered; slowly 
dividing stem cells are surrounded by a 
peripheral morphogenetic zone of more rap- 
idly dividing cells that are recruited for the 
differentiation of stem and leaf tissues. 
Growth is sustained through the activity of 
the WUSCHEL (WUS)/CLAVATA (CLV) 
pathway. The apical dominance is under a 
complex gene balance. In addition to the 
KNOX inhibitory effect and regulation by 


| HD |(@GPSG] START CDESC) 


(11) 


165 


selaginella 








Moss 


Unicellular 
green 
Algae 


HD 
HD |SAWADEE 
NDXA| HD |NDXB 
Td 


PLING] Ho 


S) sRT 


= Cyanobacteria 
TALE HD 


BEL AS 


miRNAs, AtMBD9 controls gene expression 
by chromatin structure modification through 
histone acetylation and by decreasing whole 
DNA methylation level. AAMBD9Y inactiva- 
tion leads to altered expression of a number 
of genes related to the axillary branching 
and flowering pathways (Yaish et al. 2009). 
Leaves. In angiosperms, leaf growth is finite 
and usually exhibits an adaxial/abaxial 
asymmetry whose opposing developmental 
programs coexist at the shoot apex. Class I 
KNOX genes are downregulated in the leaf 
primordia at the periphery of the SAM by 
members of the myb-domain protein family 
including ASYMMETRIC LEAVES! (AS/) 
in A. thaliana. Leaf blade polarity is estab- 
lished by the coordinated expression of a 
number of different transcriptional regula- 
tors. In Arabidopsis, factors such as AS] and 
AS2 myb-factors, zipperlII (HD-ZIP I) 
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transcription factors, and PHABULOSA 
(PHAB), PHAVOLUTA  (PHAV), and 
REVOLUTA (REV) genes confer adaxial 
identity while YABBY and KANADI genes 
confer abaxial identity (Friedman et al. 
2004). 

Vascular system. The activity of the two 
transcription factors VASCULAR 
RELATED NAC-DOMAIN6 (VND6) and 
VND7 is sufficient to induce transdifferen- 
tiation of various cells into protoxylem or 
metaxylem vessel elements in Arabidopsis. 
Roots. The developmental program associ- 
ated with root apical meristems (RAM) is 
similar to that of SAM. There is a reiteration 
of the receptor-like kinase pathway in RAM, 
and ligands of the CLV3/ESR-related (CLE) 
gene family are involved. A homologous 
WUS-like homeodomain gene, WUSCHEL- 
related homeobox5 (WOXS5), is_ also 
expressed in A. thaliana roots. Two mem- 
bers of the GRAS-transcription factor fam- 
ily, SCARECROW (SCR) and 
SHORTROOT (SHR), are required for the 
establishment of the endodermal layer in 
both roots and shoots (Friedman et al. 2004). 
Flowers. The growth of the flower’s indi- 
vidual organs has been modeled using the 
ABC model with the aim to assign homeotic 
genes as determinant of organ’s identity in 
order to describe the biological basis of the 
developmental process. This model was 
then extended with D and E. Each letter 
depicts the pattern of floral organ develop- 
ment. Floral primordia are arranged into 
successive whorls of sepals, petals, stamens, 
and carpels, all these organs being homolo- 
gous to leaves. The A genes act as determi- 
nants of sepal—petal identity and contribute 
to the repression (APETALAI/ and 
APETALA2) of B, C, and E genes. The B 
(APETALAS and PISTILLATA) and C 
(AGAMOUS) genes establish the identity of 
stamens and carpels. The D (FLORAL 
BINDING PROTEIN7 and FLORAL 
BINDING PROTEINIL) genes specify the 
identity of ovules as a separate reproductive 
feature compared to carpels. The E genes 
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function as a physiological requirement nec- 
essary for the development of the three 
innermost whorls. When D function is lost, 
the structure of ovules becomes similar to 
that of leaves, and when E function is lost, 

all verticil structures are similar to leaves. 
The A, B, C, and D floral genes are MADS- 
box genes, which encode transcription factors 
involved in controlling all major aspects of devel- 
opment. Animal and fungal genomes generally 
possess only around one to five MADS-box 
genes while angiosperm genomes have around 
100 MADS-box genes. The MADS-box family 
has been divided into two main groups. Type I 
includes ARGSO/SRF-like genes of animals and 
fungi, also designated as M type genes in plants, 
while type II includes MEF2-like genes of ani- 
mals and yeast as well as MIKC-type genes of 
plants. It is proposed that an ancestral duplication 
before the divergence of plants and animals gave 
rise to these groups. The MIKC-type genes are 
also characterized by the presence of the K 
domain that could have evolved after the diver- 
gence of these lineages. Type II genes have been 
categorized into MIKCC and MIKC* based on 
structural features. MIKCC genes have been fur- 
ther classified into 14 clades based on phylogeny. 
Type I genes have also been categorized into M 
(a, B, y, 6) and N types based on their protein 
motifs as well as on their phylogenetic relation- 
ships (Arora et al. 2007). Members of the plant 
class I MADS-box domain subfamily from 
Arabidopsis includes 61 genes from which only 
PHERESI, AGAMOUSLIKES80 — (AGLSO), 
DIANA, AGL62, and AGL23 were functionally 
characterized and revealed important roles for 
these genes during female gametophyte and early 
seed development. A complete atlas of their 
expression patterns in Arabidopsis revealed that 
the expression of class I MADS-box genes is pre- 
dominantly observed in the central cell, antipodal 
cells, and chalazal endosperm (Bemer et al. 
2010). Seventy-five rice MADS-box genes were 
also characterized for expression profiling by 
microarray and were mapped along rice chromo- 
somes showing some regional segregation 
between MIKC and M types (Arora et al. 2007). 
In seed plants, the MIKCC shows considerable 
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diversity, but the MIKC* has only two subgroups, 
P and S clades, which show conserved expression 
in the gametophyte. All three MIKC* genes are 
expressed late in pollen development (Liu et al. 
2013). The absence of class I MADS-box genes 
in gymnosperms suggests that they were recruited 
in basal angiosperms for the evolution of the 
double fertilization process (Bemer et al. 2010). 
In contrast to A, B, C, and D genes, E genes 
belong to miRNAs. miR/56 and miR172 act as 
the main players in the regulation of develop- 
mental timing in flowering plants. In early stages 
of development, miR/56 levels are high and, 
then, decrease during plant development, while 
miR172 shows the opposite pattern. These two 
microRNAs contribute to both the juvenile—adult 
phase transition and the transition to flowering 
through their sequential and antagonistic actions. 
miR156 represses targets of the SQUAMOSA 
PROMOTOR BINDING PROTEIN-LIKE (SPL) 
gene family and maintains juvenile features. 
When miR156 levels decline, the level of SPL 
proteins increases, miR/72 is activated, flowering 
genes are activated, and adult leaf features are 
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induced. When miR/72 levels are increasing, the 
APETALA2-like (AP2-like) genes are progres- 
sively silenced and adult leaf traits as well as 
flowering are induced (see in Geuten and Coenen 
2013). More miRNAs are being described as 
important players in plant development as 
expected from their early evolution (Fig. 7.19). 
Interestingly, extensive RNA silencing can be 
generated, in plants, through ncRNA diffusion 
from cell to cell through plasmodesmata and 
from cell to organs over long distances through 
the vascular phloem. For example, the trans- 
acting siRNA (ta-siRNA) silencing activity of 
miRNA165/166 over a short distance is crucial 
for the regulation of the HD-ZIP II transcription 
factor in a dosage-dependent manner in roots and 
is also involved in xylem patterning. The extent 
of silencing through plasmodesmata 1s limited to 
as few as 10—15 cells beyond the site of initiation. 
Silencing over greater distances may be achieved 
by an amplification relay. A signal initiated in a 
few cells of a source leaf can be actively spread 
over longer distances through phloem to induce 
silencing in sink leaves as 1s the case for disease 
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Fig.7.19 An overview of the emergence of some important ncRNAs in eukaryotes (Modified from Avesson (201 1)) 
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control by grafting the shoot of one species on 
the root system of another (see in Uddin and Kim 
2011). 

Signaling pathways evolved by gene duplica- 
tion and subsequent protein sequence divergence. 
Plants have unique features that evolved in 
response to their environments and ecosystems. 
The protein-protein interaction (PPI) patterns in 
Amborella are generally consistent with those in 
other angiosperms and show clear differences 
from those in gymnosperms. Evolutionary shifts 
in PPI patterns after gene duplications along with 
changes in gene sequence and expression pat- 
terns have likely been crucial for functional inno- 
vations in the regulatory network for reproductive 
organ development and the flower origin as well 
as for functional diversification of the many floral 
forms among lineages of angiosperms (Amborella 
Genome Project 2013). The complete Arabidopsis 
protein—protein interactome, excluding isoforms, 
is estimated to be 299,000+79,000 binary inter- 
actions. Despite the fact that the Arabidopsis 
interactome is estimated to be larger than those of 
yeast, worms, or humans (Brown and Jurisica 
2005), the number of interactions per possible 
protein pair is similar among all four species 
(5-10 per 10,000). The dynamic rewiring of 
interactions following gene duplication events 
appears as an evidence that evolution acted upon 
interactome networks. The difficulty to unveil 
network interactions in plants is because more 
than 60 % of the protein-coding genes of the 
model plant A. thaliana remain functionally 
uncharacterized (Arabidopsis — Interactome 
Mapping Consortium 2011). 


7.9.2 DNA Methylation 


Dynamic DNA methylation is important in main- 
taining the epigenomic plasticity to enable effi- 
cient and coordinated responses to developmental 
signals and environmental stresses. DNA meth- 
ylation refers to the covalent addition of a methyl 
group (—CH3) to the base cytosine (C) in the 
dinucleotide CpG to form 5mC. Most CpG dinu- 
cleotides in the human genome are methylated. 
However, unmethylated CpGs are not randomly 
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distributed but are usually clustered together in 
CpG islands in the promoter region of many 
genes. 

A relatively high degree of nuclear DNA 
methylation is a specific feature of plant genomes. 
Targets for cytosine DNA methylation in plant 
genomes are CG, CHG, and CHH (H is A, T, C) 
sequences. DNA methylation in plants is species, 
tissue, organelle, and age specific; it is involved 
in the control of all genetic functions including 
transcription, replication, DNA repair, gene 
transposition, and cell differentiation. DNA 
methylation is engaged in gene silencing and 
parental imprinting; it controls the expression of 
transgenes and foreign DNA in cells. Plants have 
a much more complicated and sophisticated sys- 
tem of genome methylations compared to ani- 
mals. The level of methylation ranges from 3 to 
8 % of cytosines in vertebrates and from 6 to 
30 % in plants. Considering Arabidopsis, about 
19 % of the entire genome sequence is methyl- 
ated. As expected, extensive DNA methylation is 
found in the heterochromatic centromeric and 
pericentromeric regions, reflecting the dense 
methylation of transposons and other repetitive 
sequences. Consistent DNA methylation is also 
found in euchromatic regions, both in intergenic 
regions and genes, but promoter methylation 
seems to be more important for downregulating 
gene expression than CDS methylation. The rela- 
tive distribution of SmC residues between differ- 
ent sequence contexts is 55 % in CG, 23 % in 
CHG, and 22 % in CHH. Thus, nearly half of all 
cytosine methylated sites throughout Arabidopsis 
genome are non-CG ones. Most CG sites are 
either unmethylated or highly methylated (80- 
100 %), CHG sites are unmethylated or partially 
(between 20 and 100 %) methylated, and CHH 
sites are either unmethylated or lowly methylated 
(about 10 %). All three methylation types are 
found in repeat-rich pericentromeric regions, 
whereas gene bodies contain almost exclusively 
CG methylation. Among all expressed genes, 
more than ~60 % appeared to be entirely unmeth- 
ylated and about ~5 % are methylated at their 
promoter regions (within 200 bp upstream of 
transcription start site). About 33 % of genes are 
methylated in their CDSs and unmethylated in 
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their promoter regions (body methylated). The 5’ 
and 3’ flanking sequences of most body- 
methylated genes tend to be relatively hypometh- 
ylated. Body-methylated genes tend to be highly 
and constitutively expressed; many genes coding 
for enzymes occur in this category. Genes with 
methylated promoters tend to be expressed at a 
relatively lower level, often in a tissue-specific 
fashion (see in Vanyushin and Ashapkin 2011). 
In Arabidopsis, establishing methylation in all 
sequence contexts is entirely dependent on the de 
novo methyltransferase activity by DOMAINS 
REARRANGED METHYLTRANSFERASE 
(DRM) (similar to the mammalian Dnmt3 fam- 
ily), which functions through the RNA-directed 
DNA methylation (RdDM) pathway. The 
Arabidopsis MET1 (a homologue of the Dnmt/ 
that is responsible for maintenance of CG meth- 
ylation in mammals) is controlling CG methyla- 
tion and also likely acts in the maintenance of 
non-CG methylation. METI is responsible for 
60 % of methylation. CHG methylation is mainly 
maintained by the plant-specific 
CHROMOMETHYLASE 3  (CMT3) and 
KRYPTONITE (SUVH4). These enzymes are 
important to warrant the maintenance of preex- 
isting methylation patterns after DNA replication 
(see in Zhang et al. 2010), which is referred to as 
the passive mode of cytosine removal. In the 
active mode of demethylation, the methy! mark is 
removed by DNA glycosylases (DME and ROS1) 
that are directed toward methylated cytosines 
even in the absence of DNA replication. 
Methylation levels change during plant devel- 
opment with the highest methylation levels being 
observed in senescence tissues. The methylation 
pattern is largely stable over multiple generations 
in plants and is consistent with genomic imprint- 
ing. Genomic imprinting refers to the parental 
specific expression of a gene. This process 
evolved independently in flowering plants and 
mammals. In mammals and flowering plants, 
imprinting occurs in the embryo and its nourish- 
ing tissues, 1.e., placenta and endosperm, respec- 
tively. Imprinted genes are differentially marked 
in the gametes before fertilization, which makes 
maternal and paternal chromosomes functionally 
different. Imprinting is mainly accomplished by 
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the active removal of methylated cytosines by 
DNA glycosylases, rather than through sequen- 
tial genome-wide erasure followed by the tar- 
geted de novo methylation of paternal alleles at 
the imprinted loci as is the case in mammals. 
Maternal DME activity is required for develop- 
ment of viable seeds, which reflects its expres- 
sion specificity or imprinting in the central cell of 
the female gametophyte. In plants, imprinting 
might have evolved as a by-product of a defense 
mechanism destined to control transposon activ- 
ity in gametes. A selective advantage of this pro- 
cess might have been that imprinted genes control 
the nutrient flow from the mother to the offspring 
(see in Kohler and Weinhofer-Molisch 2010). 
Mechanistically, methylated cytosines are capa- 
ble of attracting methyl-binding proteins (MBD), 
which in turn recruit histone deacetylases and 
chromatin remodeling proteins to form a com- 
plex, which hinder the binding by transcription 
factors. At least six Arabidopsis MBD genes 
(AtMBDI, 2, 4, 5, 6, and 7) exhibited specific 
binding activity for methylated CG sequence 
in vitro (Zhang et al. 2010). The tendency to be 
inherited through multiple generations seems to 
be one of the major differences between DNA 
methylation and other epigenetic modifications, 
such as histone modifications and chromatin 
remodeling. Adaptive responses to environmen- 
tal factors may be conferred by a local change in 
DNA methylation and be stable for multiple plant 
generations. However, many regions of an artifi- 
cially demethylated genome gradually regain ini- 
tial methylation in few generations. A classical 
example of adaptive epimutation is a naturally 
occurring flower mutant of Linaria vulgaris 
changing flower symmetry from bilateral to 
radial. This mutation was first described by 
Linnaeus but only recently was shown to be a 
result of an epigenetic modification. This epimu- 
tation rarely reverts to wild-type flowers. Thus, 
an epigenetic mutation may be heritable for hun- 
dreds of generations and play a significant role in 
evolution. Other examples relative to disease and 
salt resistance were also given (Vanyushin and 
Ashapkin 2011). 

Comparison of methylated genome sequences 
and known collections of Arabidopsis smRNA 
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sequences showed that the majority of siRNAs 
correspond to heavily methylated DNA 
sequences, which is not surprising since siRNA 
in plants is preferentially associated to intergenic 
sequences such as retrotransposons and other 
repetitive elements. In wild-type plants 85.4 % 
smRNA loci contained, at least, one 5mC. Overall 
half of all cytosines are methylated at these loci, 
whereas about 4 % of all cytosines are methyl- 
ated at non-smRNA loci. However, only a third 
of all methylated cytosines are associated with 
smRNA loci (see in Vanyushin and Ashapkin 
2011). The 24 nt siRNAs cause epigenetic silenc- 
ing by directing de novo cytosine methylation 
through the RdDM pathway, where the siRNAs 
are generated by the action of the putative DNA- 
directed RNA polymerase Pol IV, RDR2 (RNA- 
dependent RNA polymerase 2), and DCL3 
(Dicer-like 3). The siRNAs are then loaded into 
AGOs to direct cytosine methylation by the de 
novo DNA methylase DRM2 (see in Zhang et al. 
2010). 


7.10 Synthetic Biology 


Synthetic biology is an emerging field focused on 
the engineering of biomolecular and cellular sys- 
tems for a variety of medical or industrial appli- 
cations for product development (Ruder et al. 
2011); this field came into reality with genetic 
engineering in the 1970s. After plasmid cloning, 
signals of a newly emerging era were seen in arti- 
ficial chromosomes, which were first constructed 
in budding yeast (YAC, Murray and Szostak 
1983) and Escherichia coli (BAC, O’Connor 
et al. 1989). BACs and YACs have proven invalu- 
able for the sequencing, mapping, and character- 
ization of many large genomes due to their utility 
in cloning large DNA fragments. In plants, 
telomere-mediated truncation coupled with the 
introduction of site-specific recombination cas- 
settes has been used to produce mini- 
chromosomes consisting of little more than a 
centromere. The idea is to use synthetic chromo- 
somes to add multiple transgenes, multigene 
complexes, or entire biochemical pathways to 
plants to change their properties for agricultural 
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applications or to use plants as factories for the 
production of foreign proteins or metabolites 
(Gaeta et al. 2012). 

Synthetic biology is now moving into an era 
of whole-genome synthesis. The genome of 
Mycoplasma mycoides has been synthesized 
in vitro based on in silico data, assembled in 
yeast, and reintroduced into recipient cells, 
thereby giving rise to a self-replicating cell line 
(Gibson et al. 2010). The cell lines behaved nor- 
mally in culture and had a relatively stable phe- 
notype, although they contained some mutations 
introduced during their development. This study 
demonstrated that genetically functional living 
cells can be produced from computer-designed 
genome annotations. 

The applications of synthetic genomics now 
extend beyond the engineering of genes to 
include whole metabolisms, regulatory networks, 
and even ecosystems. Wild cells use genetic cir- 
cuits comprising interacting genes and proteins 
to implement functions including growth and 
division, signaling, and differentiation. A major 
goal of synthetic biology is to develop biological 
design by building circuits that are not able to 
survive in the natural world but that have biotech- 
nological applications through their ability to 
perform processes at low energy cost. Engineered 
organisms are expected to change our lives in the 
near future, leading to cheaper drugs, biofuels, 
biomaterials, and targeted treatments against 
infective diseases and physiological dysregula- 
tion, such as cancer (Khalil and Collins 2010). 
One can already often modify (1.e., rewire) parts 
of wild circuits, thus providing insights into the 
design principles of native architectures. 
Rewiring and building autonomous and _ inte- 
grated circuits de novo will become one of the 
primary tools to understand, control, envision, 
and create new biological systems for biotechno- 
logical or medical purposes (Nandagopal and 
Elowitz 2011; Ruder et al. 2011). Synthetic con- 
struction methods have applied to genetic 
switches, memory elements, and oscillators, as 
well as to other electronics-inspired genetic 
devices, including pulse generators, digital logic 
gates, filters, and communication modules 
(Khalil and Collins 2010). 
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The boot-up and propagation of synthetic 
information involves its transcription, translation, 
replication, and regulation to produce the desired 
effect or phenotype (Shimizu et al. 2001), which 
has ultimately led to the encapsulation and 
expression of DNA in artificial cells (Amidi et al. 
2011; Hosoda et al. 2008; Kuruma et al. 2009; 
Murtas 2009). Artificial cell engineering 
(Noireaux et al. 2011; Noireaux and Libchaber 
2004) has led to the encapsulation of entirely 
reconstituted FE. coli translational machinery (the 
PURE system, Shimizu et al. 2001) inside vesi- 
cles (Luisi et al. 2010; Pereira de Souza et al. 
2009). Consequently, the need to develop meth- 
ods for the propagation of DNA and vesicles has 
led to the production of an advanced protocell 
that links the process of DNA replication to the 
multiplication of encapsulating vesicles (Kurihara 
et al. 2011). However, synthetic genomics may 
involve a variety of DNA templates from a vari- 
ety of different organisms, thus complicating the 
process. This creative endeavor requires the 
assistance of software, which is being progres- 
sively written (Xia et al. 2011), to assist the entire 
genome design process (Khalil and Collins 
2010). The task of synthetic genome design is 
conceptually similar to the task of computer pro- 
gram writing, involving version management, 
tracking of dependencies, metadata management, 
and tools that promote code reusability. 
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Abstract 

Arguably, the two most important organelles of the eukaryotic cell are not its 
original inhabitants. Mitochondria and chloroplasts have evolved by endo 
symbiosis and have a prokaryotic ancestry. This is reflected perfectly in their 
genome organization and the basic functioning of their genetic system. Over 
the long course of evolution, they have given away most of their genes. The 
genes were either lost because they were no longer needed or were relocated 
to the nucleus. Nuclear transfer of their DNA is an ongoing process, and the 
two organellar genomes are still evolving. The DNA sequences are in a con- 
stant state of motion because of interorganellar and horizontal gene transfer 
within and between different plant species. In contrast to the chloroplast 
genomes, which are much conserved and do not generally exhibit any struc- 
tural or functional anomalies, mitochondria are notorious for their lack of 
synteny, frequent DNA rearrangements, constantly varying intergenic regions 
and accumulation of heterologous DNA sequences. Since the nucleus cannot 
work in isolation, all the random and mischievous genetic activities of organ- 
ellar genomes have had a direct or indirect impact on nuclear genome evolu- 
tion. They act as one of the major mechanisms by which genetic novelty is 
brought about in the nuclear genome. As they say, to produce good music, 
both black and white chords have to work in harmony. Similarly, irrespective 
of individual nature of the three genomes in a cell, these three always work in 
harmony for optimum functioning of the plant cell. 
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8.1 Introduction 


From freedom to captivity, 

From securing a home to losing their identity, 
They were destined to live in servitude. 

Yet they fought, fought hard for their genomic 
treasure, 

Came out winners and then stayed on with 
pleasure 


The lines above broadly summarize the origin of 
mitochondria and chloroplast by endo symbiosis 
and their evolution as semi-autonomous organ- 
elles in the modern eukaryotic cell. The mito- 
chondria arrived first, followed by the plastids. 
Their genomes have undergone a sea change 
from their free-living ancestors to the present 
day, highly reduced and tightly regulated organ- 
ellar genomes of the much evolved flowering 
plants. Both the organelles had to lose a major 
share of their genes to the nucleus, yet they man- 
aged to retain some of the most essential ones in 
their genome compliment. They have also 
evolved a complex transcriptional and transla- 
tional machinery to coordinate their gene expres- 
sion with the nuclear genome. The field of 
organellar genome research has accelerated a lot 
in the past two decades with the availability of 
whole genome sequences of various plant spe- 
cies. It has helped in the better understanding of 
their genomic structure, organization, gene con- 
tents, sequence evolution and the _ genetic 
exchanges between the nucleus and organelles 
and also amongst the organelles. 

Comparative analyses of the chloroplast 
genomes suggest that over the course of evolu- 
tion, they have remained much conserved in their 
structure, gene composition and gene order 
(Barbrook et al. 2010). It’s just the opposite in the 
case of plant mitochondrial genomes. They 
exhibit a highly variable genome, show drastic 
size differences, contain many introns and repeat 
elements and experience frequent genomic rear- 
rangements due to recombination at the repeat 
sequence sites (Knoop 2012). Also, the mito- 
chondrial intergenic regions receive promiscuous 
DNA originating from both nuclear and chloro- 
plast genomes. Such anomalies and _ their 
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evolutionary significance continue to baffle the 
geneticists and the evolutionists alike. 

So, in the same chronological order in which 
they had first appeared in their eukaryotic host 
cell, we will discuss in brief various aspects of 
the two organellar genomes, their complexities 
and their motive for still surviving inside the 
organelles. 


8.2 | Endosymbiotic Evolution 
of Organelles and Their 


Genome 


Plastids and mitochondria are semi-autonomous 
organelles in a eukaryotic cell which perform the 
bioenergetic functions of photosynthesis and 
respiration, respectively. Till two billion years 
ago, they lived as free-living prokaryotes. So, 
they have not been together forever. The concept 
of their endosymbiotic origin in eukaryotic cells 
from the prokaryotic ancestors was laid more 
than a hundred years ago by a Russian botanist 
von C. Mereschkowski (1905) in his published 
work Uber Natur und  Ursprung der 
Chromatophoren im Pflanzenreiche (eventually 
translated in English by Martin and Kowallik in 
1999). It went unnoticed and unappreciated for a 
long time. In fact it took almost another 70 years 
for the idea to be rediscovered by Margulis 
(1970) and for the evidences to show up in sup- 
port of the theory. It began when Lynn Margulis 
first saw cytoplasmic incorporation of labelled 
DNA precursors in amoeba (unicellular eukary- 
otes) but could not immediately relate her obser- 
vations with the organelles. As the similarities 
between the organelles and bacteria became 
more apparent, she got convinced that the 
hypothesis was true and that is how the theory of 
endo symbiosis was rediscovered and presented 
with more conviction the second time (Wells 
2005). 

Endosymbiotic theory states that mitochon- 
dria and chloroplasts have originated from 
free-living prokaryotes (endosymbionts) — 
a-Proteobacteria and Cyanobacteria, respec- 
tively — when they entered into a symbiotic rela- 
tionship with a host cell (Gray 1999). This was 
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followed by a drastic reduction in their genome 
size. From harbouring thousands of genes in their 
prokaryotic ancestors, they were reduced to a few 
essential genes in their newly evolved organellar 
genomes. In fact in mitochondria, the number has 
sunk further down to below a hundred (Unseld 
et al. 1997). 

It is now an accepted fact that inception of the 
two organelles had occurred independently of 
each other, with mitochondria preceding chloro- 
plasts. As far as the nature of the host cell is con- 
cerned, it is still controversial. Although the 
classical endosymbiotic theory suggests that it 
was an amitochondriate eukaryotic cell which 
played as a host to our mitochondrial ancestor 
a-Proteobacterium, there is no evidence yet 
which proves the existence of an amitochondriate 
phase in eukaryotic evolution. So, it might be 
possible for them to have a concurrent origin 
(Gray and Archibald 2012). As for the chloro- 
plasts, the host was a eukaryotic phagotroph 
(Douglas 1998). 

Although there is an overwhelming molecular 
data to help understand explicitly even the most 
intricate details of the endosymbiotic theory of 
organellar evolution, some fundamental ques- 
tions pertaining to the earliest events leading to 
the establishment of organelles in the hypotheti- 
cal eukaryotic host cell are still unanswered 
(Gray and Archibald 2012). 


8.3. Organellar Genome 


8.3.1 Mitochondrial Genome 

In spite of the recent advances in next-generation 
sequencing methodologies, sequencing plant 
mitochondrial genomes (also known as plant 
chondromes) has not been as successful as their 
plastid counterparts in terms of the number of 
species whose chondromes have been completely 
sequenced. As per the NCBI organelle genome 
resources database, around 94 plant chondromes 
have been sequenced so far of which flowering 
plants account for just 37 (Shearman et al. 2014). 
Plants have a very large and complex mitochon- 
drial genome that exhibits enormous variation in 
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its size and structure and has long-endured 
frequent genomic rearrangements 1n its DNA. All 
these factors, along with a large number of repeti- 
tive sequences accumulated by the plant chon- 
dromes, made the sequencing of plant 
mitochondrial genomes very difficult (Zhang 
et al. 2011). It is because of these technical diffi- 
culties that it took 12 years for the first angio- 
sperm mitochondrial genome (Arabidopsis) to be 
sequenced (Unseld et al. 1997) after the first 
chloroplast genome (Nicotiana tabacum) was 
sequenced in 1985. 

Plant chondromes are characteristically 
peculiar in many ways. Firstly, in contrast to 
their metazoan counterparts that average just 
about 16 kb in size, plant chondromes are enor- 
mously large double-stranded (ds) DNA mole- 
cules showing dramatic size variation, ranging 
between 200 and 2,900 kb in land plants, though 
their coding capacity (~50O—60 genes) is surpris- 
ingly very low (Unseld et al. 1997; Marienfeld 
et al. 1999; Handa 2003; Sugiyama et al. 2005). 
The enormous genome size is partly due to the 
accumulation and duplication of repetitive 
sequences and partly due to DNA intrusion from 
the intracellular nuclear and plastid genomes 
(Kitazaki and Kubo 2010). It may also receive 
DNA sequences by horizontal gene transfer 
(HGT) (Bergthorsson et al. 2003). Other fea- 
tures include extensive intramolecular recombi- 
nation leading to frequent genomic 
rearrangements and presence of a number of 
interrupted genes that require trans-splicing at 
the RNA level (Knoop 2012). Above all, they 
are still reeling under the ongoing evolutionary 
process of DNA transfer to the nucleus. 
Combined with the high dose of RNA editing, 
trans-splicing, low mutation rates and the fact 
that it has a rapidly evolving genomic structure 
yet the slowest in terms of sequence evolution 
amongst the three plant cell genomes (Palmer 
and Herbon 1988; Knoop 2004), plant chon- 
dromes make for an interesting study subject to 
trace phylogeny of early land plants (Knoop 
2004). These striking features, in short, make 
plant mitochondrial genome alluring for study- 
ing the dynamics and organellar genome evolu- 
tion inside its eukaryotic host cell. 
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8.3.1.1 Structural Organization 

The decades’ long quest for decoding the in vivo 
structure of mitochondrial DNA (mtDNA) paints 
a hazy picture. Akin to chloroplast genome, the 
general opinion about mitochondrial genomes 
was that they were circular DNA molecules, but 
there could be other structural forms present as 
well. The linear mtDNA molecules, which would 
often come up in the isolated mtDNA prepara- 
tions, were dismissed as artefactual degraded 
products of circular mitochondrial genomes. 
Over the years, numerous attempts of recovering 
large populations of intact circular mtDNA have 
failed due to the predominance of linear and 
branched structures which show up in most of the 
electrophoretic and microscopic analyses of 
mitochondrial DNA (Bendich 1993). There is a 
sufficient evidence to support both the circular 
and the linear, multibranched forms of mitochon- 
drial DNA, yet the idea of mtDNA as circular 
DNA molecules has persisted (Mower et al. 
2012; Sloan 2013). Deep sequencing analyses in 
recent years have proven to be the myth busters 
with overwhelming evidences to prove that the 
linear forms not merely exist but exist in good 
percentage. In fact in the case of maize CMS-S 
cytotype, the mitochondrial genome exists 
mainly as multiple linear molecules (Schardl 
et al. 1984; Allen et al. 2007). 

Mower et al. (2012) have precisely defined the 
paradox — ‘Circular maps continue to be pre- 
sented in genome _ sequencing publications 
because they are convenient indicators of genome 
content and sequencing completion’. So, irre- 
spective of what the real structure is, the mito- 
chondrial genome is conveniently described as a 
‘master circle’ which is a single circular DNA 
molecule that carries a complete set of genes 
(Lonsdale et al. 1984). Hence, the name was 
given ‘master’. In spite of having a defined physi- 
cal map of their genome, their structure in vivo 1s 
still way complicated and difficult to compre- 
hend. They may also sometimes appear larger 
than their size (Backert et al. 1997). It is because 
they carry repeat sequences, ranging from 6 bp to 
14 kb, dispersed throughout the genome and 
undergoing active recombination, that they have 
a multipartite genomic organization and exhibit 
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isomeric structural forms (Fauron et al. 2005). 
Frequent homologous recombination events 
within a pair of direct repeat sequences of the 
master circle may lead to ‘looping out’ of smaller 
circles called sub-circles (Kitazaki and Kubo 
2010). Similarly, large inverted repeats in mito- 
chondrial genome may also recombine to give 
rise to isometric genome structures. Both the iso- 
metric and sub-circular forms are subgenomic 
structural isomeric forms of the master circle 
genome (Chang et al. 2013). Sometimes, illegiti- 
mate recombination may occur at short repeat 
sequences that give rise to substoichiometric 
DNA molecules called sublimons. They differ in 
sequences, and hence only one of the forms sur- 
vives in subsequent divisions (Kitazaki and Kubo 
2010; Gabay-Laughnan and Newton 2012). 


8.3.1.2 Genome Size and Its Genetic 
Make-Up 

8.3.1.2.1 Genome Size 

Plant mitochondrial genomes show a wide size 
variation, ranging from 200 kb to up to 2,900 kb 
(Fig. 8.1). In the family Cucurbitaceae alone, 
genomes range from an estimated 390 kb in 
Citrullus lanatus (watermelon) to an unprece- 
dented 2.9 Mb in Cucumis melo (muskmelon) 
(Ward et al. 1981) which is even bigger than the 
genomes of many free-living bacteria (Moran 
2002). This extreme inflation in the mitochon- 
drial genomes of Cucurbitaceae family members 
is mainly due to the accumulation of large 
amounts of plastidial sequences, proliferation of 
dispersed short repeat sequences, expansion of 
existing introns and accumulation of large 
amounts of heterologous DNA sequences from 
various sources like nuclear and chloroplast 
genomes, viruses and bacteria (Alverson et al. 
2010, 201 1a). Ever since their inception in the 
eukaryotic cell, mitochondrial genomes of all 
plant species have undergone enormous size 
expansion in their intergenic regions and are still 
expanding. This is in much contrast to the meta- 
zoan mitochondrial genomes which have pro- 
gressively become smaller and more compact in 
size. Although their sequences are known to be 
still evolving, the gene number, their arrange- 
ment and the overall genome size have remained 
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constant over the long periods of evolutionary 
time (Boore 1999). True for all plant mitochon- 
drial genomes, despite the overall increase in 
their genomic content, the coding regions have 
remained conserved and limited. 

Both nuclear and plastid sequences contribute 
to size expansion of the mitochondrial genomes. 
Nucleus-derived mobile genetic elements, espe- 
cially retrotransposons, integrate and propagate 
in the intergenic regions. Until we have the infor- 
mation on more nuclear genome sequences, it 
puts a constrain on identifying new sequences of 
nuclear origin in the mitochondrial genomes. 
Plastid-derived sequences are also prominently 
found in the mitochondrial genome. Apart from 
the sequences from these two known sources, 
most of the sequences in the intergenic region are 
of unknown origin (Kubo and Newton 2008). 

Most of the intergenic regions are not even 
conserved between any two plant species (Kubo 
and Mikami 2007). Much of this diversity is due 
to the differences in the accumulation and activ- 
ity of repetitive sequences. Repeats of diverse 
size and number are found in the mitochondrial 
genomes which also lead to the expansion of 
intergenic regions. The repeat sequences affect 
their structural diversity by giving rise to subge- 
nomic circles on the account of undergoing inter- 
and intramolecular recombination (Alverson 
et al. 201 1b). 


8.3.1.2.2 The Genetic Make-Up 

During evolution, most of the essential mito- 
chondrial genes have been transferred to the 
nucleus where they are transcribed, and their 
products are transported right back into the mito- 
chondria either in the RNA or in the protein form. 
Some protein-coding genes though are retained 
in the mitochondria and require mitochondrial 
transcriptional, translational machineries for 
their expression (Unseld et al. 1997). Unlike the 
chloroplast genome which has retained a full set 
of their RNA genes like rRNA, tRNA and RNA 
components of ribonucleoproteins, mitochondria 
do not contain the complete set of tRNA genes 
sufficient to read all its codons (Marienfeld et al. 
1999). Consequently, it has to import the cellular 
tRNAs and aminoacyl tRNA synthetases to 


A. Choubey and MV. Rajam 


compensate for their missing function in 
mitochondria (Duchéne et al. 2009). Then, there 
are a few ‘chloroplast-like’ tRNA genes in mito- 
chondria which are plastid genome derived and 
code for mitochondrial tRNAs (Maréchal- 
Drouard et al. 1993). This way, the mitochondrial 
genomes contain both native and immigrant 
types of information. The native class comprises 
of those genes which have always been in the 
mitochondria, ‘the original occupants’ of the 
mitochondrial genome. This class includes the 
genes for respiratory chain functions, ribosomal 
protein genes, incomplete set of tRNA genes, 
proteins involved in cytochrome c biogenesis and 
some conserved open reading frames. The immi- 
grant class includes transposon sequences from 
the nucleus and tRNA genes from the chloroplast 
(Marienfeld et al. 1999). 

Mitochondrial genome may also contain pseu- 
dogenes — evolution’s misplaced witnesses 
(Bensasson et al. 2001) — scattered throughout 
the genome. They either represent the remnants 
of genes which have got transferred to the nucleus 
rendering their mitochondrial copies redundant 
and nonfunctional or they could be the nonfunc- 
tional versions of functional mitochondrial genes 
arisen by the gene duplication event. Another 
prominent feature of mitochondrial genes is the 
presence of introns which all belong to the cate- 
gory of group II introns with the only exception 
of horizontally acquired group I intron in cox! 
gene of mitochondria (Kitazaki and Kubo 2010). 
Group II introns include both cis-splicing and 
trans-splicing introns. Although group II introns 
are known to act as both catalytic RNAs and 
mobile genetic elements (Lambowitz and 
Zimmerly 2004), none of the plant mitochondrial 
introns have been shown to undergo self-splicing 
in vitro. In fact, they seem to use a complex splic- 
ing machinery, consisting of numerous nucleus- 
encoded factors, for their efficient excision 
(Bonen 2008). 


8.3.1.3 Genomic Rearrangements 

The mitochondrial genomes of angiosperms are 
amongst the largest organellar genomes ever 
known (Ward et al. 1981). They may show so 
much variation in their size and structure that 
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even within an organism mtDNA could be present 
in a heteroplasmic state, defined as ‘mitotypes’ 
(Kmiec et al. 2006). The coding sequences in 
mitochondria are highly conserved. It is the inter- 
genic sequences which house many repeat 
sequences that bring about structural diversities 
amongst different mitochondrial genome popula- 
tions. Long and short repeat sequences act as the 
substrates for extensive intra-genomic and inter- 
genomic recombinations (Palmer and Shields 
1984). These recombination events lead to 
dynamic changes in the mitochondrial genome, 
from creating chimeric genes, pseudogenes, 
duplication and deletions to multicellular mito- 
chondrial genomes (Conklin and Hanson 1994), 
Generally, recombinations create new copies of 
the large sequence repeats which further act as 
catalysts for more recombinations. Intramolecular 
recombination between direct repeat sequences 
leads to reshuffling of the gene order, whereas 
intermolecular recombinations may lead to large 
deletions and duplications. Drastic structural 
changes are usually brought about by homolo- 
gous recombination in large (>1 kb)- and 
intermediate-sized repeats (O.1—-1 kb) (Arrieta- 
Montiel et al. 2009). If the repeats are in inverted 
orientation, it leads to genomic inversions, 
whereas if they are in direct orientation, it can 
create subgenomes, each containing only a 
portion of the total genetic information 
(Abdelnoor et al. 2003). 

Intragenic ectopic recombination events can 
also occur at short repeat sites which results in 
the generation of novel gene chimaeras, possibly 
associated with mutant phenotypes like cytoplas- 
mic male sterility, plant variegation and other 
aberrant phenotypes (Abdelnoor et al. 2003). 
High-frequency recombinations take place in 
large repeat sites which give a number of recom- 
bination products that remain in stoichiometric 
equilibrium, whereas recombination across short 
repeat sequences tends to be asymmetric. What 
follows is a preferential recovery of only one of 
the two recombination products that are formed 
in the subsequent cell divisions leading to a phe- 
nomenon known as ‘substoichiometric shifting’. 
This may eventually lead to the replacement of 
the original predominant mitochondrial form by 


185 


the rare recombinant form (Gabay-Laughnan and 
Newton 2012). In the nutshell, genomic rear- 
rangements are arguably the biggest driving force 
behind evolutionary changes in the mitochon- 
drial genome organization and _ structural 
diversities. 


8.3.2 Plastid Genome 


Plastids are an integral part of a plant cell. 
Chloroplasts (the green plastids) are endowed 
with the function of photosynthesis which they 
carry out in a separate genetic compartment using 
proteins that are both endogenously made and 
imported from the cytoplasm. Plastids begin their 
life journey as undifferentiated organelles (proto- 
plastids) in the meristematic cells that later dif- 
ferentiate into different forms (chloroplasts, 
chromoplasts, leucoplasts, amyloplasts and etio- 
plasts) depending upon the tissue type they are 
going to be functional in. And just like mitochon- 
dria, chloroplasts also maintain their own genetic 
organization which reflects upon their cyanobac- 
terial ancestry. 

A major breakthrough in the chloroplast 
genome research came up in 1986 with the 
sequencing of whole plastid genomes of two land 
plants Nicotiana tabacum (Shinozaki et al. 1986) 
and Marchantia polymorpha (Ohyama et al. 
1986). The latest count till date, as per NCBI 
organellar genome database, of the number of 
chloroplast genome sequences available across 
all plastid-containing eukaryotes is 524. Land 
plants including green algae account for 470 
sequenced genomes. The extensive knowledge of 
plastomic sequences has helped us better under- 
stand the intricacies of plastome/genome organi- 
zation, plastid genome evolution and its structural 
and functional contribution to nuclear and mito- 
chondrial genomes. 


8.3.2.1 Structural Organization 

A generalized account of the plastid genome 
organization can be given because they are 
widely conserved amongst the seed plants. In the 
evolutionary perspective, they have undergone 
very little change in terms of their gene content 
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and organization (Barbrook et al. 2010). 
Chloroplast DNA (cpDNA) exists as a negatively 
supercoiled double-stranded molecules ranging 
in size between 37 and 217 kb with most of the 
photosynthetic organisms lying in the range of 
115-165 kb. 

The literature describes cpDNA as double- 
stranded circular DNA molecules. Though the 
statement is true, but it is not completely true. In 
fact as it turns out, just like their mitochondrial 
counterparts, only a small fraction of the total 
cpDNA may be present as circular entities 
(Bendich 2004). When the structure and organi- 
zation of a large number of intact cpDNA was 
analyzed from plants like Arabidopsis, tobacco 
and pea, it was found that only 25-45 % of the 
total cpDNA were circular DNA molecules, and 
the rest were in linear form (Lilly et al. 2001). 
Circular molecules can undergo multimerization 
(head to tail genomic concatemers), sometimes 
raking in up to even ten genome equivalents in 
rare cases. Generally, they are present as dimers. 
They even undergo isomerization with respect to 
the orientation of single-copy sequences sand- 
wiched between the two inverted repeat CIR) 
regions. The two forms are usually present in 
equimolar concentrations (Palmer 1983). 

Chloroplast genome has a bipartite structure 
with two copies of large inverted repeat regions 
(IRA and IRB) dividing the genome into one 
large and one small single-copy regions (SCRs) 
(Palmer 1985). Many of the plastid genes in 
higher plants are organized into operons, a trait 
that suggests their prokaryotic origin. The genes 
with related functions are usually clustered 
together in an operon, but it may not always be 
necessary (Sugita and Sugiura 1996). The genes 
in an operon are transcribed as polycistronic tran- 
script which undergoes further processing and 
modifications like splicing and editing to yield a 
functional transcript. 


8.3.2.2 The Genetic Make-Up 

As mentioned earlier, cpDNA of seed plants con- 
tains an average of 120 genes, all essential. The 
gene content and the gene order amongst the seed 
plants are generally conserved, especially within 
a group of organisms (Fig. 8.2). The numbers 
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may vary occasionally if there are gene 
duplications in the inverted repeats. The plastid 
genome maintains a core of basic set of genes 
required for photosynthesis and its gene expres- 
sion. The genes can broadly be classified into 
three categories: one that codes for the photosyn- 
thetic genes like rbcL, ndhA, ndhB, psaA, psbA, 
etc. and the second that includes genes required 
for the gene expression like rRNA, tRNA genes 
and those coding for the subunits of plastid- 
encoded RNA polymerase. The third category 
consists of conserved ORFs-ycfs (hypothetical 
chloroplast ORFs) and other potential protein- 
coding genes like matK (Ravi et al. 2008). Some 
of the chloroplast genes, ~20, are interrupted by 
introns. Introns are of group II type which have 
lost the self-splicing capacity and are dependent 
on nucleus-encoded pentatricopeptide repeat 
proteins (PPR) for their splicing. 


8.3.2.3 Plastid Genome Packaging 

Plastid DNA is packaged into a higher-order 
structure with the help of DNA-binding proteins 
to form highly conserved, discrete nucleoprotein 
complexes called plastid nucleoids. They form 
the structural and functional unit of plastid 
genome. This chromatin dense area is not sur- 
rounded by any membrane and remains attached 
to the intraplastidial membranes (Sakai et al. 
2004). Their number, location and size may vary 
depending upon the plastid type (Miyamura et al. 
1986), and this variation is reflected in the gene 
expression at different stages of chloroplast 
development (Melonek et al. 2012). 

In green plants and many green algal groups, 
the nucleoids exist as small, uniformly dispersed 
entities with a diameter of approximately 0.24. 
Depending on the plant species, tissue type and 
developmental stage, the number of plastid DNA 
copies per nucleoid may range between10 and 20 
(Kuroiwa 1991; Sato et al. 2003). Packaging of 
cpDNA into condense units also serves the pur- 
pose of ease in vegetative segregation of plastid 
DNA during plastid divisions (VanWinkle-Swift 
1980). Nucleoids do not exist as isolated groups. 
In fact, they form a dynamic group of nucleopro- 
teins, sensitive to the environmental and develop- 
mental cues, and are involved in the regulation of 
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Fig.8.2 Circular DNA maps of plastid genomes: genome 
map of tomato and potato chloroplast genomes. The thick 
lines indicate the two inverted repeats. Genes annotated 


gene —_- expression, function and 
differentiation. 

A wide range of proteins can be found in a 
nucleoid (Yagi and Shiina 2014), but this compo- 
sition may vary at different time points as nucle- 
oid is not an isolated structure, and its protein 
pool, which shows much flexibility, could only 
be defined depending upon the functional state a 
nucleoid is in (Kuroiwa 1991). The proteins are 
tightly bound to the DNA in the central region, 
whereas they are loosely bound to the DNA 
fibrils in the peripheral nucleoid space (Hansmann 


et al. 1985). This pseudo-compartmentalization 


plastid 





outside the circle are transcribed clockwise, and those 
inside are transcribed anticlockwise (Daniell et al. 2006) 


gives the plastid nucleoid a layered structure 
(Sakai et al. 2004). The core of the nucleoid is 
tightly organized and has all the necessary pro- 
teins required for the active transcription and rep- 
lication of plastid genome, while the peripheral 
regions are endowed with the task of translation 
and complex assembly. Many proteins have been 
identified and characterized from the enriched 
nucleoid fractions of various plant species like 
Arabidopsis, mustard, spinach, maize, pea, etc. 
(Phinney and Thelen 2005; Pfalz et al. 2006; 
Steiner et al. 2011; Melonek et al. 2012; Majeran 
et al. 2012). The plastid nucleoid associated 
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proteins (ptNAPs) can be classified into two 
categories based on their functions — first 1s of 
enzymatic ptNAPs that contribute to the DNA 
replication, DNA repair and transcriptional and 
translational activities of plastome. This category 
includes plastid-encoded RNA _ polymerase 
(PEP), PEP-associated proteins (PAP) and pen- 
tatricopeptide repeat proteins (PPR proteins) 
(Pfalz et al. 2006). 70S ribosomes, ribosome 
assembly factors and many other proteins are 
associated with DNA replication, DNA repair 
and mRNA splicing, editing and translation (Yagi 
and Shiina 2014). The second category is of 
architectural ptNAPs — those involved in mem- 
brane anchoring and DNA compaction like 
PEND, PD1, PD3, MFP1, TCP34, pTAC16, SiR, 
CND41, YLMGI1 and SWIB (Krupinska et al. 
2015). 


8.3.2.4 Plastid DNA Methylation 
Methylation of nuclear DNA is of common 
occurrence in eukaryotes and plays a pivotal role 
in modulating chromatin structure, regulating 
gene expression and in epigenetic inheritance 
(Suzuki and Bird 2008). However, whether or not 
cpDNA methylation occurs, it is controversial. 
The first report for cpDNA methylation dates 
back to 1972 when Sager and Lane (1972) pro- 
vided the first ever evidence of chloroplast meth- 
ylation and demethylation during the life cycle of 
unicellular green alga Chlamydomonas and its 
role in maternal inheritance of chloroplast 
genome. This report sparked off the quest for 
similar methylation marks in higher plants 
including tomato (Ngernprasirtsiri et al. 1988a; 
Kobayashi etal. 1990) and maize (Ngernprasirtsiri 
et al. 1989; Gauly and Kossel 1989). The basis 
for detecting methylation then was to use a pair 
of isoschizomer restriction endonucleases of 
which one would be methylation sensitive and 
the other would be methylation dependent so that 
they gave two different restriction patterns of the 
sample DNA depending upon whether one of 
them is methylated or not. Accordingly, when 
DNA methylation of plastid DNA from fully rip- 
ened fruits, green mature fruits and green leaves 
of tomato was compared (Ngernprasirtsiri et al. 
1988a), the isoschizomeric endonucleases 
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produced different restriction patterns of plastid 
DNAs in tomato fruits and tomato leaves. 
Different methylation patterns could also be seen 
for the genes that were expressed differentially in 
the chromoplasts. Further examination of meth- 
ylation status of the genes during transitional 
conversion of chloroplasts to chromoplasts in rip- 
ening tomato is correlated with their expression 
patterns making methylation one of the possible 
mechanisms for regulating gene expression in 
plastids (Kobayashi et al. 1990). 

Similarly in maize, tissue-specific differential 
expression of plastid-encoded genes was attrib- 
uted to a regulatory mechanism mediated by 
DNA methylation (Ngernprasirtsiri et al. 1989; 
Gauly and Kossel 1989). However, these obser- 
vations were refuted by subsequent studies in 
which no such stage-specific methylation pattern 
could be detected during tomato fruit ripening 
(Marano and Carrillo 1991). Another study 
around the same time on barley chloroplasts also 
failed to detect any methylation footprints on the 
plastid genes. Restriction patterns of leaf plastid 
DNA from different stages of growth and devel- 
opment were indistinguishable, although they 
could clearly see differential gene expression and 
protein synthesis at different stages (Tomas et al. 
1992). Same year, Ohta et al. (1991) had reported 
significant amount of cytosine methylation at 
most of the CC(A/T)GG sites and some of the 
GATC sites in the cpDNA of Pisum sativum. But, 
the methylation pattern could not be correlated 
with the gene expression as opposed to another 
report by Ngernprasirtsiri et al. (1988b) in which 
they had claimed that nonphotosynthetic plastid 
(amyloplast) DNA is heavily methylated which 
results in the transcriptional suppression of chlo- 
roplast genes, whereas non-methylated plastid 
DNA in photosynthetically active cells has tran- 
scriptionally active genes. 

Palmgren et al. (1991) observed varied levels 
of DNA methylation in different tissue types in 
carrot but failed to derive any conclusive correla- 
tion between the age, type and the differentiated 
stage of plant tissue with its levels of DNA meth- 
ylation. Almost a decade later, methylation of 
cpDNA in higher plants was again challenged by 
Fojtova et al. (2001) who, by _ performing 
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bisulphite genomic sequencing, failed to detect 
any methylated cytosine residues in tobacco 
plastome. 

In the midst of all the controversies surround- 
ing DNA _ methylation of plastid genome, 
Chlamydomonas remains arguably the best and 
the only example where the C-residues in cpDNA 
have been shown to be methylated (Umen and 
Goodenough 2001). In Chlamydomonas, methy1- 
ated DNA has a function in that only the methyl- 
ated maternal cpDNA is transmitted to the 
daughter cells. There are two mating types in 
Chlamydomonas: a heavily methylated maternal 
mating type plus, mt +, and a non-methylated 
paternal mating type, mt- (Burton et al. 1979). 
There are two theories to explain the inheritance 
of methylated maternal cpDNA. One is the 
methylation-restriction system, wherein upon 
successful mating, non-methylated mt- cpDNA is 
selectively degraded allowing for the uniparental 
inheritance of the maternal cpDNA (Sager and 
Grabowy 1983; Nishiyama et al. 2002). Another 
proposed mechanism is based on selective DNA 
replication of methylated genome. In an elegant 
experiment carried out by Umen and Goodenough 
in 2001, they demonstrated that methylation of 
maternal cpDNA is not for its protection from 
methylation sensitive nucleases but to enhance its 
replication in early zygotic stages. Probably, a 
different mode of cpDNA replication is adapted 
in germinating zygotes which selectively ampli- 
flies intact and properly methylated cpDNA 
molecules. 

The ambiguous status of cpDNA methylation 
in higher plants notwithstanding, experiments 
have been carried out to ectopically express ade- 
nine and cytosine DNA methyltransferases in the 
chloroplast genome of tobacco (Jaffe’ et al. 2008; 
Ahlert et al. 2009) by generating stable transpla- 
stomic lines. It was shown that the plastid 
genomes were efficiently methylated, yet they 
were phenotypically normal and viable, and the 
maternal inheritance was also not affected. More 
importantly, there were no alterations in plastid 
gene expression (Ahlert et al. 2009) which effec- 
tively dismissed any idea of transcriptional regu- 
lation of plastid gene expression by DNA 
methylation. 
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In conclusion, although DNA methylation of 
cpDNA in tissue-specific or developmental stage- 
specific manner cannot be ruled out with surety, 
it’s not yet clear so as to what functional role it 
may have in the chloroplast genome. A direct 
link with the transcriptional regulation of gene 
expression can clearly be ignored. 


8.3.3. Organellar Inheritance 


Unlike nuclear genome, organellar genomes in 
angiosperms follow a non-Mendelian, maternal 
pattern of inheritance (Birky 2008). It was first 
discovered independently by Carl Correns in 
Mirabilis jalapa (Correns 1909) and Erwin Baur 
(1909) in Pelargonium. Through carefully 
designed reciprocal crosses, Correns showed that 
the leaf colour of the progeny was always deter- 
mined by the phenotype of the maternal parent 
and not the paternal parent. Similar results echo- 
ing the cytoplasmic inheritance pattern were 
found in another set of reciprocal crosses between 
green and variegated plants of Pelargonium 
zonale by Baur who, in this case, demonstrated 
the biparental organellar transmission. Correns 
thought that there were certain cytoplasmic fac- 
tors which were responsible for the ‘diseased’ 
cytoplasmic state that confers variegated pheno- 
type, and the transmission of the diseased versus 
healthy cytoplasm depends upon the cytoplasmic 
state of the maternal parent. But, Baur could cor- 
rectly realize that it was the genetic factors in 
plastids that were randomly segregated and had 
given the observed phenotypes (Birky 2008). On 
the basis of the actual explanations given by the 
two scientists and for the overall understanding 
of the mechanism, only Erwin Baur can be cred- 
ited as the founder of the theory of plastid inher1- 
tance (Hagemann 2010). These two pioneering 
discoveries, though, marked the beginning of an 
exciting era of organellar genetics. 

Although uniparental mode of inheritance is 
the predominant mode of inheritance, it may 
extend beyond maternal inheritance to leaky 
paternal transmission or even biparental inheri- 
tance (McCauley 2013). Considerable research 
has been done in this field in order to study the 
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inheritance pattern across numerous plant 
species, deviations from the obvious and the 
mechanisms behind different modes of organellar 
inheritance. For example, Pelargonium, in which 
non-Mendelian inheritance was first discovered 
by Baur (1909), exhibits all the three modes of 
organellar inheritance (Weihe et al. 2009). 

In angiosperms, maternal inheritance of plas- 
tids accounts for about 80 % of cases, while the 
rest are known to be biparental (Corriveau and 
Coleman 1988). Deviation from a strictly mater- 
nal inheritance to a strictly paternal inheritance is 
of rare occurrence. Actinidia (Testolin and 
Cipriani 1997) is the only known example till 
date to exhibit an exclusive paternal inheritance 
of plastids. Uniparental paternal inheritance or 
biparental inheritance can be found mostly in 
interspecific crosses as a means to rescue defec- 
tive plastid genomes in hybrids with nuclear plas- 
tid incompatibility (Zhang and Sodmergen 2010). 
Examples include Passiflora (Hansen et al. 2007), 
Oenothera (Chiu et al. 1988), Larrea (Yang et al. 
2000), Turnera (Shore et al. 1994), Pelargonium 
(Weihe et al. 2009), Chlorophytum comosum 
(Liu et al. 2004), etc. 

Several mechanisms have been proposed for 
maternal inheritance that prevent paternal plas- 
tids from getting transmitted to the progeny. 
These include both pre- and post-fertilization 
preventive measures. First is the physical exclu- 
sion of plastid organelles from the generative 
cell. It can occur during pollen mitosis I or from 
the maturing generative cells by enucleated cyto- 
plasmic bodies. Post-fertilization mechanism 
involves differential rates of DNA replication 
between maternal and paternal plastid genomes 
(Nagata 2010). 

Many instances of paternal leakage of mito- 
chondrial genomes have also been reported. One 
of the initial reports that hinted at the possibility 
of paternal mtDNA inheritance was in alfalfa 
(Medicago sativa) in which large mtRNAs were 
found to be paternally inherited (Fairbanks et al. 
1988). In fact paternal inheritance of mtDNA in 
gymnosperms is known since the 1970s (Neale 
et al. 1989), and it is the most commonly found 
inheritance pattern in gymnosperm species. 
Amongst angiosperms, the prominent examples 
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include rapeseed (Erickson and Kemble 1990), 
banana (Faure et al. 1994) and cucumber (Havey 
1997). Otherwise, paternal leakage and hetero- 
plasmy normally occur in interspecific crosses. 
Barring these few examples, mitochondrial 
genome inheritance in flowering plants occurs 
predominantly via maternal transmission. One of 
the classical examples of maternally inherited 
mitochondrial trait is the cytoplasmic male steril- 
ity (CMS). Frequent genomic rearrangements in 
mtDNA can create chimeric open reading frames 
(ORFs) whose expression can result in defective 
mitochondrial functions which ultimately lead to 
pollen abortion (Hanson and Bentolila 2004). 

One of the reasons for a persistent uniparental 
mtDNA inheritance across the evolutionary time- 
line is the need to maintain organellar homo- 
plasmy. A biparental inheritance would mean 
mixing of maternal and paternal mitochondrial 
genomes giving rise to a heteroplasmic state 
(McCauley 2013). Such a genetic heteroplasmy 
of organelles is usually not tolerated by the cells, 
and in the subsequent divisions, they get rid of 
the mitochondrial genome type which 1s less 
competent of the two. Since mitochondrial 
genomes can undergo fusion, even in plants 
(Arimura et al. 2004), mtDNA from the two 
sources may recombine and cause unwanted 
genetic variations and heterogeneity in mtDNA 
sequences. 


8.3.4 Plant DNA Barcodes Based 
on Organellar Genome 


DNA barcoding uses short orthologous DNA 
sequences for species identification and for 
decoding its evolutionary history (Ali et al. 
2014). Phylogenetic analysis using DNA barcod- 
ing has revolutionized the traditional disciplines 
of taxonomy and phylogeny. Molecular phyloge- 
netic investigation relies upon the heritable dif- 
ferences at the molecular level to establish 
phylogenetic relationship between different 
organisms. The main aim of DNA barcoding is to 
establish a shared community resource of DNA 
sequences which will be unique for each species 
so that it can be used for organismal identification 
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and taxonomic clarification (Hollingsworth et al. 
2011). 

DNA barcoding surely offers many benefits. 
Apart from correctly identifying the known spe- 
cies, it has been instrumental in the identification 
of a number of probable cryptic species which 
were earlier thought to be single species based on 
morphological characters (Dasmahapatra and 
Mallet 2006). 

For a locus, coding or noncoding, to be con- 
sidered as a barcode, the foremost requirement is 
that it should easily get amplified using a univer- 
sal set of primers so that it can be used on a large 
sample size. More importantly, it should have an 
outstanding discriminating power, i.e. it should 
be able to make a clear distinction between dif- 
ferent plant species (Hollingsworth et al. 2011). 
In animals, mitochondrial gene cytochrome c 
oxidase I (COJ) is routinely used as the standard 
barcode for bio-identification (Hebert et al. 
2003). But in the case of plant mitochondrial 
gene markers, they are highly conserved 
sequences with a very low incidence of single 
nucleotide substitutions. This translates into their 
low discriminating powers making them unfit to 
be used as plant barcodes. Also, due to extreme 
genomic rearrangements in the intergenic regions 
of mitochondrial genome (Palmer et al. 2000) 
and the fact that sequence information is avail- 
able for only a limited number of plant species, 
mitochondrial genome in plants cannot provide 
enough suitable candidates for selecting a plant 
barcode (Hollingsworth et al. 2011). Using plant 
nuclear genes as barcodes may also be not very 
useful as they can be highly variable across dif- 
ferent plant species making the design of univer- 
sal primers difficult. Hence, a universal barcode 
is most likely to come from the plastid genome in 
plants. Initially, seven candidate plastid DNA 
regions, atpF-atpH spacer, matK gene, rbcL 
gene, rpoB gene, rpoC! gene, psbK-psbI spacer 
and trnH-psbA spacer (Pennisi 2007), were con- 
sidered, and on the basis of the selection criteria 
of recoverability, sequence quality and its species 
discrimination power, a 2-locus combination of 
rbcL/matK was eventually finalized as the univer- 
sal plant barcode (CBOL Plant Working Group 
2009). matK consistently shows a high level of 
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discriminancy amongst angiosperm species, but 
because of high sequence variability and the 
presence of mononucleotide repeats in the gene 
(Hilu and Liang 1997), it is inherently difficult to 
amplify and sequence. On the other hand, rbcL is 
easy to sequence and amplify but shows an aver- 
age discriminancy power. However, to circum- 
vent the problem of amplifying matK gene, new 
universal primers have now been designed with 
fairly strong amplification (93.1 %) and sequenc- 
ing (92.6 %) successes in the species tested (Yu 
et al. 2011). 

The use of plant barcodes in species identifi- 
cation 1s still in its initial stages. A compilation of 
the studies which have used DNA barcoding for 
plant identification has been illustrated in 
Table 8.1 (Hollingsworth et al. 2011). 

Barcoding offers many advantages like spe- 
cies identification in cases where only small 
amounts of tissue from sterile, juvenile or frag- 
mentary materials are available, and morphologi- 
cal identification is not possible or if it involves 
geographically restricted sample sets, such as 
studies focusing on the plant biodiversity of a 
given region or local area (CBOL Plant Working 
Group 2009). 

Although DNA barcoding is an effective phy- 
logenetic tool, no single locus can be conserved 
in all the species and still has enough discrimina- 
tion power to work across all species (Li et al. 
2015). Moreover, the species identification is 
based on a single DNA sequence which can cre- 
ate artefacts in the case of plants as sequences 
from a totally unknown source can reach a given 
species by horizontal transfer of DNA sequences. 


Genetic Crosstalk Between 
Nucleus, Chloroplasts 
and Mitochondria 


8.4 


It is known for a fact that ever since their incep- 
tion in eukaryotic host cell as subcellular organ- 
elles, mitochondria and _ chloroplasts have 
undergone massive reduction in their genome 
complexity with most of their genes being syph- 
oned into the nuclear genome only to reimport 
the products with the help of transit peptides and 
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Table 8.1 Applications of DNA barcoding in plants 


Application 


Identification of cryptic orchid species 
Identification of cryptic Conocephalum 


(bryophyte) spp. 


Identification of cryptic Herbertus 


(bryophyte) species 


Identification of cryptic Taxus species 


Identification of seedlings in tropical 


forest plots 


Community phylogenetics of tropical 


forest plot 


Identification of Chinese medicinal 


plants: Polygonaceae 


Identification of Chinese medicinal 


plants: Fabaceae 


Identification of Phyllanthus species in 


herbal medicines 


Identification of Actaea species in herbal 


supplements 


Identification of medicinal plant species 


Identification amongst berry species in 


foods 


Assessing the plant components of honey 


Identification of invasive species 
(Cardamine) 

Identification of invasive species 
(Hydrocotyle) 


Molecular identification of roots in 


grassland communities 


Identification of Osmunda gametophytes 


Identification of an aquatic fern 
gametophyte 


Identification of traded fern species 
Identification of CITES listed ramin 


timber and products 


Identification of poisonous plants 


Identification of plant components of 


herbivore diet 


Identification of plant components of 


herbivore diet 


Identification of plant components of 


herbivore diet 


Proof of concept for application to 


forensics 


protein import machinery. This means that they 
still have the full quota of proteins but not the 
genes (Martin 2003). In addition to the organellar 
DNA influx into the nucleus, nuclear DNA frag- 


Barcode markers used 
matK 
rbcL 


rbcL+matK+trnH-psbA+nrITS 


nrITS, trnL-F, trnH-psbA, matK, rbcL 


nrITS, trnH-psbA, rbcL, matK (and other 


regions) 
rbcL+trnH-psbA+matK 


trnH-psbA 
nrITS2, matK 
trnH-psbA 


matK 


nrITS2, nrITS, matK, rbcL, pcbA-trnH 


(and other regions) 
norITS 


trnL PG loop 
orlTS, trnL intron, trnL-F 


tmH-psbA 
rbcL 


rbcL 


rbcL, trnL intron (and other regions) 


rbcL (and other regions) 


matK (SNP assay of barcode sequence) 


matK, trnH-psbA and other regions 


trnL PG loop 
trnL intron 
trnH-psbA 


trnH-psbA, trnL-F 


A. Choubey and MV. Rajam 


Notes 


matK and nrITS rated as the 
best performing regions 
nrITS and trnL-F rated as the 
best performing regions 
trnH-psbA was rated as the 
best performing region 


nrITS2 rated as the best 
performing marker 

matK and trnH-psbA did not 
show enough variation 


matK proved troublesome to 
amplify so it was discarded 


trnL intron 1s less informative 
due to low coverage in 
reference database 


matK preferred (along with 2 
single-copy nuclear regions) 


ments have also found their way into the mito- 
chondrial genome. In fact, plastid sequences 
might also contribute to some plant mitochon- 
drial genes (Wang et al. 2012a). As far as plastid 
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genome is concerned, there is always a one-way 
movement of genes away from the plastid into 
the nucleus or mitochondria. 

As we will see in the next section, establishing 
an organellar gene in the nucleus is a very 
demanding task. So, what was the need for mov- 
ing the organellar genes into the nucleus during 
the course of organellar evolution? No matter 
how straightforward it seems, it is a very subjec- 
tive question. 

From the point in an evolutionary timeline 
wherein the two free-living prokaryotes — 
a-Proteobacterium and Cyanobacterium — were 
sequentially engulfed by their eukaryotic host 
cell, they were in the continuous process of 
evolving into the present-day organelles, mito- 
chondria and chloroplast, respectively. They had 
been assigned a specific function and could not 
possibly have continued to live as an independent 
entity as earlier. That made it imperative for the 
newly evolved organelles to share their genetic 
load with the host cell nucleus. This meant giving 
the nucleus a firm authority to regulate their func- 
tions, thus embarking on a long-lasting symbiotic 
relationship. This clearly indicates an evolution- 
ary pressure to centralize genetic information in 
the nucleus (Daley and Whelan 2005). 

Splitting of the organellar genes also means a 
well-co-ordinated gene expression between the 
nucleus and the organelles, ensuring more co- 
operations in cellular functions and a better regu- 
lation of their own function with respect to the 
internal and external cues as perceived by the 
nucleus. 

In pure genetic terms, there are many pro- 
posed hypotheses to explain the nuclear move- 
ment of the organellar genes: the need (1) for a 
more compact organellar genome that can natu- 
rally be selected for rapid replication (Selosse 
et al. 2001), (2) to avoid Miller’s ratchet which is 
the accumulation of deleterious mutations in 
asexually propagating nonrecombining organel- 
lar genomes by recombination in nuclear genome 
(Lynch 1996) and (3) to select for a better fit 
nuclear copy of the organellar gene (Blanchard 
and Lynch 2000). 

Having discussed all the possible reasons for 
the nuclear transfer of organellar genes, we will 
see to what extent the genes have been trans- 
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ferred to the nucleus and why some genes are still 
retained in the organellar genome. 


Promiscuous Plastid 
and Mitochondrial DNA 
in Nucleus 


8.4.1 


Boogaart et al. (1982) reported the presence of a 
gene sequence in nuclear genome that was 
homologous to the mitochondrial DNA sequence 
in Neurospora crassa. It was the first ever report 
of its kind. Later that year, John Ellis (1982) for 
the first time used the term ‘promiscuous DNA’ 
for the chloroplast genes inside the plant mito- 
chondrial genome. This was followed up by a 
burst of reports on promiscuous DNA across 
many systems, like in maize (Stern and Lonsdale 
1982), yeast (Farrelly and Butow 1983), fungus 
(Wright and Cummings 1983), locust (Gellissen 
et al. 1983) and so on. An event which was previ- 
ously thought as a rare event had suddenly 
become a ‘rampant process’. 

The large-scale DNA transfer from the organ- 
elles to the nucleus has significantly contributed 
to the nuclear genome evolution. The transferred 
DNA sequences in nucleus are called nuclear 
integrants of organellar DNA, norg, which is 
classified into nupt, for ‘nuclear plastid DNA’, 
and numt, for ‘nuclear mitochondrial DNA 
(Leister 2005). Most of these segments are gener- 
ally less than 1 kb in length (Richly and Lester 
2004a, b). 

Experimental attempts have been made to 
reconstruct the functional gene transfer from the 
chloroplast to nucleus and to show that the trans- 
ferred genes become functional in the nucleus 
when followed by suitable genomic rearrange- 
ments in the nuclear genome. Another finding of 
these experiments was that mitochondrial and 
chloroplast DNA transfer to the nucleus occurs at 
relatively high frequency (Huang et al. 2003; 
Stegemann et al. 2003). Stegemann and Bock in 
2006 employed a simple and effective experi- 
mental design in which they showed functional 
activation of a chloroplast marker gene in the 
nucleus. Stable transplastomic lines (chloroplast 
transformed) were raised using a cassette con- 
taining nptll gene under nuclear CaMV 35S 
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promoter upstream of the chloroplast marker 
aadA gene cloned under a chloroplast-specific 
promoter, Prrn. Spectinomycin (spec)-resistant 
cells were then challenged with kanamycin (kan) 
in culture, and many resistant lines could be 
recovered implying a nuclear transfer of the np7II 
gene. The lines were outcrossed to replace the 
transgenic chloroplast with the wild types. The 
kan-resistant and spec-sensitive lines were again 
screened for spectinomycin resistance to select 
for the lines in which the chloroplast gene had 
become functional in the nuclear genome. Again, 
the desired lines could be obtained implying the 
activation of aadA gene in the nucleus. But this 
could happen only at the cost of the associated 
nuclear gene whose segments were deleted to 
acquire the CaMV35S promoter. 

Transfer of the organellar DNA to the nucleus 
is an ongoing evolutionary process. Relocation of 
an organellar DNA segment to the nucleus does 
not guarantee a functional acquisition in the 
nucleus. There are many steps involved in the 
transfer process, from the actual movement of 
DNA towards nucleus to the final attainment of a 
function (Brennicke et al. 1993; Adams and 
Palmer 2003). The first step is to escape the 
organellar genome, i.e. gene escape. The DNA 
needs to be freed from the parent location and 
become available for the nuclear integration. 
DNA segments can escape the organelles by vari- 
ous mechanisms. In the past, it has been sug- 
gested to have occurred during the early stages of 
organellar evolution when the endosymbiont 
division was not synchronized with the host cell 
division, resulting in unequal partitioning. During 
those early divisions, some cells might have lysed 
to release their genomic contents in the cyto- 
plasm which somehow managed to reach the 
nucleus and got integrated into the nuclear host 
genome (Barbrook et al. 2006). Other mecha- 
nisms include escape during disruption of organ- 
ellar membranes which might occur at various 
occasions like during digestions of organelles by 
lysosomes or vacuoles (Adams and Palmer 2003) 
or during environmental stress conditions (Wang 
et al. 2012b). 

The next task would be to transfer the escaped 
DNA to the nucleus. But in what physical form 
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does it travel all the way to the nucleus? Is it 
DNA, RNA or cDNA? It could be any of the 
three. Large-scale DNA transfers have been 
shown experimentally as well as observed natu- 
rally which clearly implies that DNA escapes in 
its native double-stranded form and reaches the 
nucleus. Integration occurs via illegitimate, non- 
homologous DNA repair of double-stranded 
breaks (Noutsos et al. 2005) that can cause 
genomic rearrangements around the integrated 
sites. In fact there are many examples of ongoing 
endosymbiotic gene transfer occurring multiple 
times and independently in different angiosperm 
lineages. For example, mitochondrial gene rps 10, 
which codes for a protein component of the small 
ribosomal subunit, was found to be present in the 
mitochondrial genomes of some plants but absent 
in others (Knoop et al. 1995). The cases where it 
was missing from the mitochondrial genome, it 
had already migrated to the nucleus (Adams et al. 
2000). 

Some studies have shown that the transfer can 
take place via an RNA intermediate. Many organ- 
ellar genes may contain introns and are processed 
by splicing or may undergo editing before giving 
rise to the mature transcripts. Such edited ver- 
sions of the genes can be found as it is in the 
nuclei which suggest that transfer might have 
occurred via a reverse-transcribed RNA (Schljster 
and Brennicke 1987; Nugent and Palmer 1991; 
Henze and Martin 2001). 

The last step is the activation, the expression 
and the acquisition of signal peptides by the 
transferred gene. The transferred genes must 
acquire the regulatory elements for their proper 
expression followed by the organelle targeting 
signal peptide so that it reaches the destination it 
iS meant to be functional in (Adams and Palmer 
2003). Once a transferred gene becomes func- 
tional in the nuclear genome, there is a transi- 
tional phase where both the nuclear and the 
organellar copy might be expressing simultane- 
ously. This is exemplified by the nuclear and 
mitochondrial cox2 genes in several legumes 
where both the genes are transcribed albeit at 
varying levels (Adams et al. 1999). Following the 
co-expression, the next logical step is the gene 
silencing and the loss of organellar copy of the 
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gene. The nuclear copy is rarely ever lost, unless 
the gene product is toxic to the cell, outside its 
organellar location or is not transported back to 
the organelle. 

But, itis not always that a coding region has to 
be relocated. Many a times, a large chunk of non- 
coding organellar sequences are also transferred, 
and they remain devoid of any functions in the 
nuclear genome. The best possible example 
would be that in rice, where some very large 
insertions of cpDNA (>100 kb) have been found 
in the nuclear genome (The Rice Chromosome 
10 Sequencing Consortium 2003). Another 
example is 620 kb large mtDNA insertion in the 
chromosome 2 of Arabidopsis (Stupar et al. 
2001). 

Not all the organellar DNA fragments that get 
transferred into the nucleus can become func- 
tional. As they move to the new nuclear environ- 
ment, they are vulnerable to accumulating 
mutations: single nucleotide insertions/deletions 
(indels), point mutations or more commonly 
5-methylcytosine hypermethylation which 
appears to be the most prominent mechanism of 
mutational decay of organellar DNA (Huang 
et al. 2005). Also, the integration and elimination 
of organellar DNA remains in an equilibrium, 1.e. 
DNA which is transferred to the nucleus may get 
thrown out of the nuclear genome after much tai- 
loring and shuffling in due course of time, say in 
a million years. Large organellar DNA fragments 
preferentially localize to the pericentromeric 
region of the chromosomes, where integration 
and elimination frequencies are markedly higher 
(Matsuo et al. 2005). In summary, the transfer of 
organellar DNA into the nucleus is a dynamic 
evolutionary process, but for it to become suc- 
cessful, it needs to undergo various adjustments, 
genetic alterations before it gets fixed and become 
functional in the nuclear genome. 


8.4.1.1 Foreign DNA Invasion in Plant 
Mitochondrial Genome 
Mitochondrial genomes of flowering plants are 
prone to taking up foreign DNA from various 
sources. They can receive DNA from nucleus and 
plastids within the cell or from other mitochon- 
drial and plastid genomes via horizontal gene 
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transfer. Two peculiarities which make 
mitochondria more receptive to horizontal trans- 
fer of genomic material are firstly, they can easily 
undergo fusions and secondly, genetically dis- 
tinct mitochondrial genomes can recombine 
speeding up the exchange process (Bock 2009). 
Plastid DNA sequences contribute 1-11 % of 
the total mitochondrial genome (Lloyd et al. 
2012). These plastid-derived mtDNA sequences 
are abbreviated as mtpt sequences (Wang et al. 
2007). The transfer of cpDNA sequences to mito- 
chondria is estimated to have commenced at least 
300 million years ago with the oldest predicted 
mtpt being trnV(uac)-trnM (cau)-atpE-atpB-rbcL 
stretch of cpDNA. The first mtpt sequences dis- 
covery was made over three decades ago in maize 
genome by Stern and Lonsdale (1982) in which a 
12 kb mtDNA sequence was identified as identi- 
cal to a part of the IR of maize chloroplast 
genome. Since then, large chunks of plastid DNA 
have been detected in the mitochondrial genome 
of various angiosperm species. As the more num- 
ber of mitochondrial and plastid genomes are 
getting sequenced, more of such examples are 
expected to come up across the land plants. 
Notable examples of mtpt sequences include 
Phaseolus mitochondrial genome that harbours a 
promiscuous cpDNA fragment form tA gene 
intron which was acquired via a horizontal gene 
transfer event. This was the first report of a plas- 
tid sequence undergoing horizontal transfer to 
the mitochondrial genome of higher plants 
(Woloszynska et al. 2004). In silico studies had 
been carried out by Wang et al. (2007) to identify 
informative chloroplast gene clusters in mito- 
chondrial genome and to search for the pseudo/ 
functional protein-coding genes within the 
selected mtpt sequences. They even tried to look 
for particular regions in cpDNA, if any, that are 
regularly transferred to the mitochondrial genome 
so that DNA transfer hotspots can be identified. 
In their findings, which was in accordance with 
the previous reports (Notsu et al. 2002; Clifton 
et al. 2004), Wang et al. (2007) reported that all 
the protein-coding mtpt sequences in angio- 
sperms degenerate to become nonfunctional. 
Nevertheless, some chloroplast-derived tRNAs 
were shown to be functional in mitochondria 
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(Miyata et al. 1998). As far as the source of chlo- 
roplast sequences in the plastid genome is con- 
cerned, the transfer can occur randomly from any 
part of the genome (Wang et al. 2007). 

In a new finding by Wang et al. (2012a), some 
mtpt sequences have been shown to be contribut- 
ing codons to unrelated mitochondrial protein- 
coding sequences. For example, in Vitis vinifera 
(grapevine), Silene latifolia (campion) and 
Nicotiana tabacum (tobacco), mtpt sequences at 
3’ end of mitochondrial ccmC gene contribute 
four additional amino acids at carboxyl terminus 
of the polypeptide that are putatively involved in 
the mitochondrial mRNA maturation. Similarly, 
mtpt sequences at 3’ end of mitochondrial gene 
atp4 in Zea mays which provides the stop codon 
and mtpt sequences at 5’ end of the gene nad9 in 
Triticum aestivum (wheat) are all examples of 
mtpt sequences participating in mitochondrial 
functions. The plastid-derived sequences may 
remain dormant for years before they become 
functional again. 

Nuclear DNA sequences can also get stably 
integrated into the mitochondrial genome and 
contribute to its size expansion. Many of these 
sequences are homologous to retrotransposons 
(Kubo and Mikami 2007) as exemplified by 
copia-, gypsy- and LINE-like retrotransposon 
fragments in the mitochondrial genome of 
Arabidopsis thaliana (Knoop et al. 1996) that 
make up ~5 % of the total genome size. Similar 
findings are resonated in the mitochondrial 
genomes of other plant species also: 19 TEs (16 
retrotransposons and three DNA transposons) 
from rice mitochondrial genome (Notsu et al. 
2002), 4 small fragments (50-277 bp) with simi- 
larity to known retrotransposons in maize (Clifton 
et al. 2004), 21 retrotransposon-like sequences in 
sugar beet (Kubo et al. 2000) and 5 retrotranspo- 
sons in wheat mitochondrial genome (Ogihara 
et al. 2005). However, until we have the complete 
knowledge of nuclear and mitochondrial genome 
sequences of other plant species, it is difficult to 
estimate the share of nuclear sequences in mito- 
chondrial genome (Kubo and Mikami 2007). 

Horizontal gene transfer has played a signifi- 
cant role in eukaryotic genome evolution. HGT 
represents transfer of genetic material between 
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organisms (or even organelles) by asexual means. 
Mitochondria can also take up DNA by HGT. It 
can go beyond the species barrier. In fact the 
donor organism need not always be a plant. It can 
be as diverse as viruses or non-plant eukaryotes 
or even prokaryotes (Bock 2009). It happens 
more frequently in mitochondrial genomes 
resulting in massive DNA transfers even between 
distantly related flowering plants. It occurs 
strictly via a DNA-level intermediate, but the 
exact mechanism largely remains unknown 
(Mower et al. 2010). HGT is commonly seen in 
parasitic plants (Davis and Wurdack 2004; Bock 
2009) or in cases of plant-pathogen interactions 
where a cell-to-cell contact is established between 
the infecting agent (fungus, insect, nematode, 
etc.) and the host plant. In fact the endosymbiotic 
origin of chloroplasts and mitochondria and the 
ensuing large-scale genetic transfer are in itself 
the most striking examples of HGT. It is an 
extreme case wherein the entire genome of the 
endosymbionts has got transferred to the nuclear 
genome (Martin et al. 2002). 

As already mentioned, amongst the three cel- 
lular genomes, mitochondrial genome is the most 
amenable to foreign DNA invasion by HGT. It is 
evident by the occurrence of massive mitochon- 
drial gene transfers between parasitic flowering 
plants and their host plants (Davis and Wurdack 
2004). For example, in Rafflesia  cantleyi 
(Rafflesiaceae) and its close relatives, 24-41 % 
of their mitochondrial genome is derived from 
their host via HGT (Xi et al. 2013). Gene flow 
can also happen from the parasitic flowering 
plants to their host plants (Mower et al. 2004). 
Large-scale HGT can also be seen in Amborella 
trichopoda in which 1 or 2 copies of 20 of their 
32 protein-coding mitochondrial genes have been 
acquired from other plants. Interestingly, most of 
the transferred genes have retained their function 
in the Amborella mtDNA (Bergthorsson et al. 
2004). Another prominent example of HGT in 
mitochondria is that of highly invasive group I 
intron in angiosperm mitochondrial gene, cox/, 
coming from a fungal donor using double- 
stranded break repair pathway. It was_ first 
revealed in Peperomia polybotrya (Vaughn et al. 
1995) and later detected in over 640 angiosperms 
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(Cho et al. 1998; Sanchez-Puerta et al. 2008). It 
appears to have been acquired via 70 separate 
horizontal transfer events. Since fungal-plant 
transfer of mitochondrial introns does not happen 
too frequently, a chance transfer event from a 
fungal donor might have been followed by its lat- 
eral transfer from one angiosperm to another or 
by illegitimate crossing over (Sanchez-Puerta 
et al. 2008). 

The transferred foreign DNA in mitochondria 
can have four different fates. First, they could be 
rendered totally nonfunctional by frameshift 
mutations or indels (Lloyd et al. 2012). This hap- 
pens most often. Second, few sequences might 
retain the intact reading frames so that they are 
efficiently transcribed and remain functional in 
their new host (Miyata et al. 1998). In the third 
possibility, if the sequence similarity 1s very high, 
some of the invading genes can even replace the 
native mitochondrial genes via homologous 
recombination leading to gene conversions and 
generating mitochondrial diversity (Mower et al. 
2010). The native gene can either get completely 
replaced or only a part of it gets replaced result- 
ing in a ‘chimeric’ gene that is still functional 
(Hao and Palmer 2009). In the fourth type, novel 
mitochondrial genes can be created by the trans- 
ferred DNA segments, unrelated to the functions 
they previously had, making them evolutionary 
potent sequences (Wang et al. 2012a). 


8.4.1.2 Why Not Get Rid 

of the Organellar Genome 

Altogether? 
The modern-day organelles, chloroplasts and 
mitochondria, have a highly reduced genome. 
Their ancestors had thousands of genes in their 
genome, yet they themselves code for hardly a 
hundred genes. Some of the ancestral genes were 
dumped due to intracellular redundancy, while 
others were simply relocated to the nucleus. So 
far, the eukaryotic cell has failed to get rid of its 
mitochondrial and plastid genomes, retaining a 
minimum set of genes even in nonphotosynthetic, 
holoparasitic flowering plants, heterotrophic 
algae and apicomplexan parasites (Wolfe et al. 
1992; Nickrent et al. 1997; Wilson 2005; 
Barbrook et al. 2006). 
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This raises the following fundamental 
questions: Why is the evolutionary transfer of 
organellar genes to the nucleus not yet over? 
Why carry the additional load? What is stopping 
it? Is it the prokaryotic nature of their genes? Is it 
the codon usage? But that did not stop the other 
genes from moving out. So what 1s it? 

Many hypotheses have been put forth to crack 
this riddle. First, the ‘hydrophobicity’ hypothesis 
(von Heine 1986) which suggests that the genes 
which were left behind are the ones whose prod- 
ucts, being hydrophobic in nature, are difficult to 
transport back to the organelle from the cytosol. 
Gunnar von Heijne reasoned that mitochondrial 
proteins indeed contain long hydrophobic stretches 
which are not generally found in nuclear-encoded 
mitochondrial proteins. If these proteins were to 
synthesize in the cytosol, hydrophobic regions 
would serve as targeting signals for export across 
the endoplasmic reticulum, and the proteins could 
never reach the target organelle or the hydropho- 
bicity could make them toxic for the cell when 
expressed in the cytosol (Oca-Cossio et al. 2003). 
Many bioinformatic studies and experimental evi- 
dences have favoured this hypothesis (Daley and 
Whelan 2005). However, there have been some 
relocation experiments which have countered this 
view along with the reports of successful import of 
many hydrophobic proteins like LHCI (chloro- 
plasts) (Barbrook et al. 2006) and ADP-ATP carri- 
ers (mitochondria) (Allen 2003). Also, not all the 
genes in the organelles code for hydrophobic pro- 
teins. Despite these hiccups, hydrophobicity is still 
considered a major constrain in drawing the pro- 
teins back to the organelles. 

Second, the ‘CoRR’ or Co-location for Redox 
Regulation hypothesis (Allen 1993a, b, 2003) 
states that there are genes which are directly 
regulated by the redox state of their gene prod- 
ucts or that of their interacting partners. This 
necessitates the presence of such genes in the 
organelle itself so that any redox imbalance can 
be rectified instantly instead of delaying the 
response by first sending a signal to the nucleus 
and then waiting for the gene products to reach 
the organelle (Barbrook et al. 2006). But, there 
are flaws in this hypothesis too. Only few 
organellar genes have been shown to be under the 
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redox control, while there are many nuclear- 
encoded chloroplast and mitochondrial genes 
which are under redox regulation but are not 
coded by them (Pfannschmidt et al. 2003). 

Next in line is “code disparity hypothesis’ (de 
Grey 2005). There are issues of genetic code dis- 
parity between nuclear and mitochondrial 
genome and codon modifications in some plastid 
genes which may hamper the functional gene 
transfer to nucleus (Barbrook et al. 2006). Code 
disparity would mean a different amino acid get- 
ting incorporated during translation in the cyto- 
sol, probably altering its function altogether. 
Then their transcripts also undergo RNA editing 
to ‘correct’ the codons before they are translated, 
which interestingly confers increased hydropho- 
bicity to the proteins encoded (Jobson and Qiu 
2008). Also, there are plastid genes that use non- 
canonical start codon such as GUG which is 
bound to be missed in the cytosolic translation. 
To compensate for these disparities, additional 
mutations would be needed in the nuclear DNA 
which only complicates the matter further (de 
Grey 2005; Barbrook et al. 2006). 

Fourth would be ‘essential tRNA hypothesis’ 
(Barbrook et al. 2006). It says that tRNA encoded 
by the plastid gene trnE is unique in the sense 
that apart from its primary role in protein biosyn- 
thesis, it 1s also involved in tetrapyrrole biosyn- 
thesis (Kumar et al. 1996). So even if it is to be 
replaced by cytosolic tRNAGlu in protein bio- 
synthesis, it cannot take over its function in tetra- 
pyrrole biosynthesis. This makes the plastid copy 
indispensable and is forced to be retained by the 
plastid genome along with the machinery for its 
transcription (Howe and Smith 1991). 

Fifth is ‘limited transfer window’ hypothesis 
(Barbrook et al. 2006). One of the proposed 
mechanisms by which organellar DNA had 
moved into the nucleus was by their lysis during 
endosymbiont divisions, releasing its genomic 
contents in the host cytoplasm. This spilled over 
DNA then got integrated into the nuclear genome 
of the host. So according to this hypothesis, 
organisms which have only one organelle per cell 
will have ‘limited’ (or effectively closed) oppor- 
tunities to transfer their DNA to the nucleus as its 
breakdown will be essentially lethal to that 
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particular cell. This hypothesis is supported by 
the fact that plastid DNA-derived sequences are 
much rarer in the nuclear genomes of single plas- 
tid holders like Chlamydomonas and Plasmodium 
(Richly and Leister 2004b). 

“Work in progress’. It’s not a hypothesis per 
se, but there could be a chance that we do not see 
all the organellar genes in the nucleus yet simply 
because that evolutionary process is still under- 
way! We could just be witnessing the last stages 
of what must be a very slow evolutionary transfer 
of genes to the nucleus. 

In conclusion, no one hypothesis is full proof 
and sufficient enough to defend the retention of 
only a few kinds of genes by the organellar 
genome. Each of them has their share of merits 
and demerits. There could be other reasons too 
that are not covered here, but in practical terms, it 
is not an individual but the combination of differ- 
ent probabilities that can explain this dilemma. 


Conclusions and Future 
Perspectives 


8.5 


The knowledge of organellar genome sequences 
can be very resourceful in studying plant molecu- 
lar ecology and evolution. The ever growing 
organellar genome database is definitely a boost 
to the organellar research as it helps unravel the 
great mysteries of the organellar genomes which 
are still holding up. The current knowledge of 
genomes across a diversified range of plant spe- 
cies clearly indicates that all the three kinds of 
cellular genomes are in a continuous state of evo- 
lution and maintain a very healthy interorganellar 
communication. DNA fragments of all sizes are 
often moving in and out of nucleus and organelles, 
undergoing mutational alterations and rearrange- 
ments, creating novel sequence combinations, 
disrupting genes on arrival or sometimes even 
creating new genes where they integrate. 

Of the two organellar genomes, chloroplast 
genome is the one which is more stable and has a 
simpler genome compared to the mitochondrial 
genome. The quality of a chloroplast genome is 
such that it is most suitable to study the 
phylogenetic relationships amongst the major 
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cladles of flowering plants. One of the biggest 
attractions of chloroplast genome research in the 
future will be the use of cpDNA-based plant bar- 
codes in their taxonomical pursuits and to create 
a unique identification mark for each and every 
plant species on this planet. 

On the contrary, the plant mitochondrial 
genome, being the most dynamic and with an 
‘always changing’ attitude, makes for an interest- 
ing study from the evolutionary perspective. The 
animal mitochondrial genome, on the contrary, 
has remained docile and extremely stable with 
the relocation process of their mtDNA into the 
nucleus coming to an evolutionary halt. The plant 
mitochondrial genome is associated with so 
many peculiarities and jaw-dropping statistics 
that the more we think we know about this 
genome, the more mystical it appears. There are 
sO many unanswered questions that we can only 
hope the sequencing of more number of genomes 
will help understand them better. 
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Introduction 


Abstract 

Advances in knowledge about the molecular structure of plant genome 
and development of techniques to rapidly characterise genomic variations 
have established DNA fingerprinting as a powerful tool for identifying 
plant species, varieties, clones, individuals and even plant products. This 
article provides details of DNA polymorphism in plants and classical and 
more recent molecular marker techniques based on restriction fragment 
length polymorphism (RFLP), polymerase chain reaction (PCR), DNA 
microarrays and sequencing. The advantages and limitations of each tech- 
nique are given along with some recent examples of their application in 
plant identification. 
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whereby individual specific DNA fragment 
profiles are produced, thus providing an identifi- 


The term ‘DNA fingerprinting’ has its origin in 
the method developed by Sir Alec Jeffreys and 
his colleagues in 1985 (Jeffreys et al. 1985b) 
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cation tool having the same application as 
conventional fingerprinting. This restriction 
fragment length polymorphism-based method, 
its variants and several other techniques have 
found wide application across the biological 
spectrum to address issues of identification, 
diversity, relatedness, phylogeny and breeding 
(Karp et al. 1996, 1998; Staub et al. 1996; Henry 
2001; Weising et al. 2005; Jiang 2013; Nybom 
et al. 2014). 
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9.2 DNA Polymorphism 

DNA fingerprinting techniques are mostly targeted 
to locate polymorphism in the organisation of four 
nitrogenous bases adenine (A), guanine (G), cyto- 
sine (C) and thymine (T) which besides deoxyri- 
bose sugar and phosphate constitute the primary 
structure of DNA. These bases exist as single or 
low-copy-number coding sequences, promoters 
and regulatory sequences, introns and pseudo- 
genes, moderately repeated interspersed and tan- 
dem repeats and highly repeated satellite DNA 
sequences (Heslop-Harrison and Schwarzacher 
2011; Heslop-Harrison and Schmidt 2012; Pathak 
and Ali 2012). 


9.2.1 Polymorphism in Single 


Nucleotides 


Polymorphism in single nucleotides can origi- 
nate through insertion or deletion (indel) or sub- 
stitutions of single bases (single nucleotide 
polymorphisms — SNPs) between DNA 
sequences. SNPs are ubiquitous in eukaryotic 
genome and occur outside as well as within the 
coding regions (Gupta et al. 2001; Rafalski 
2002; Ganal et al. 2009). In maize with a genome 
size of 2.3 Gbp, on an average one SNP occurs 
every 44—75 base pairs (Ganal et al. 2011). SNPs 
are discovered either by mining sequence data- 
bases (in silico methods) or by experimental 
methods of direct sequencing or microarray 
technology (see later in the chapter). 


9.2.2. Interspersed Repeats 

Interspersed repeats are moderately repeated 
DNA sequences present throughout the genome 
and comprised of transposons, elements that can 
change their position within the genome (Frost 
et al. 2005; Morgante et al. 2007). Transposons 
are ubiquitous within eukaryotic genomes and 
are of two basic classes. 
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9.2.2.1 Retrotransposons or 
Retroelements 

These replicate through RNA _ intermediates 
which are reverse transcribed back at some ran- 
dom location in the genome. Four types of ret- 
rotransposons have been identified. Retroviruses 
have RNA genome which on entering host cells 
gets transcribed to form reverse transcriptase. 
The latter copies RNA into DNA which inte- 
grates into the host genome and replicates the 
viral genome to transcribe coat protein. Long ter- 
minal repeat retrotransposons (LTRs) are quite 
similar to retroviruses but have their coding 
sequences flanked on both ends by 300-500 bp 
direct repeats and lack protein-coding gene. Long 
interspersed elements (LINEs) and short inter- 
spersed elements (SINEs) do not bear terminal 
repeats but have poly (A) sequences at 3’ end. 
LINEs are several kilobases long and carry genes 
for their transposition, while SINEs are 100-500 
bp long and are unable to transcribe on their own. 


9.2.2.2 DNA Transposons 

These comprise sequences of one to two genes 
flanked by short inverted repeats. They move by 
getting cut and inserted at a new location in the 
genome. Since their movement does not involve 
replication, their copy numbers in a genome 
unlike retrotransposons are small. 


9.2.3 Tandem Repeats 


Tandem repeats consist of repeated units of up to 
several thousand nucleotides arranged end to 
end. Depending upon unit length, number of cop- 
ies and location in the genome, tandem repeats 
are Classified into three different types. 


9.2.3.1 Satellite DNA 

Satellite DNA derives its name from the observa- 
tion that it separates from the rest of DNA by 
buoyant density gradient centrifugation (Skinner 
et al. 1974). The basic sequence motif is gener- 
ally of 100-300 bp length but can vastly vary, and 
the motif is repeated 1,000—100,000 times at a 
locus. Satellite DNA is localised in the centro- 
meric and subtelomeric regions of chromosomes 
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and does not have much application for 
fingerprinting. 


9.2.3.2 Minisatellites 

Minisatellites have shorter repeat motifs (10-60 
bp) than satellites, have lower number of repeats 
(in hundreds) but occur at thousands of loci in the 
genome (Jeffreys et al. 1985a; Durward et al. 
1995). The number of minisatellite copies varies 
greatly among individuals; hence, these are also 
called as variable number of tandem repeats 
(VNTRs) (Nakamura et al. 1987). 


9.2.3.3 Microsatellites 

Microsatellites have 1-6 bp motifs repeated 
10-20 times and are widely distributed all over 
the genome including within genes and chloro- 
plast DNA, but rarely in plant mitochondrial 
DNA (Tautz and Renz 1984; Tautz 1989; Sharma 
et al. 1995). The repeat motif may be perfect (sin- 
gle uninterrupted array), imperfect (with one or 
more bases between repeat units) or compound 
(perfect and imperfect units combined). While 
microsatellites are randomly distributed in the 
genome, reports of their biased distribution are 
also available (Grover et al. 2007). 


9.3. Molecular Markers 
Several types of markers are used for characteri- 
sation of plants. Conventionally used morpho- 
logical features like form and size of vegetative 
and reproductive organs suffer from a number of 
drawbacks. They are expressed at different 
growth stages and, hence, require extended peri- 
ods of observation; many characters are affected 
by environmental variables thus making it diffi- 
cult to separate genetic effect from environmen- 
tal effect, and many morphological traits are 
controlled by more than one gene rendering 
genetic analysis of traits quite complex. 
Cytological features like chromosome num- 
ber, karyotype and chromosome banding patterns 
are highly reliable and have found much use in 
identification, classification and evolutionary 
relationships at species or higher levels (Heslop- 
Harrison and Schwarzacher 2011). However, 
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these features are usually not variable enough to 
detect polymorphism at individual or population 
level. 

Molecular methods encompass a wide range 
of techniques targeting variations in proteins and 
nucleic acids (Hillis and Moritz 1990; Karp et al. 
1997, 1998; de Vicente and Fulton 2003; Spooner 
et al. 2005; Weising et al. 2005; Semagn et al. 
2006; Agarwal et al. 2008; Nybom et al. 2014). 
Protein marker techniques generally measure the 
migration of constituent fragments through vari- 
ous kinds of gels, including starch and polyacryl- 
amide, across an electric field. Isozymes, 
functionally similar but electrophoretically dif- 
ferent forms of enzymes encoded by different 
genes, and their allelic forms (allozymes) have 
been widely used for analysis of genetic relation- 
ships and phylogeny in plant species (Gottlieb 
1984; Hamrick and Godt 1997). But they are lim- 
ited in number and much less polymorphic than 
nucleic acid markers, being products of func- 
tional component of the genome, generally 
housekeeping genes. 

Nucleic acid, largely DNA, marker techniques 
are based on hybridisation and/or electrophoretic 
properties of DNA sequences. Digestion by 
restriction enzymes or in vivo amplification by 
polymerase chain reaction is used to isolate the 
targeted fragments. Though not required in all 
the fingerprinting techniques, the fragments are 
separated by gel electrophoresis using horizontal, 
vertical or capillary electrophoresis with agarose 
(agarose gel electrophoresis — AGE) or poly- 
acrylamide (polyacrylamide gel electrophore- 
sis — PAGE) as the medium and visualised by 
radio- or non-radioactive labels or by staining the 
gel with ethidium bromide or silver nitrate. Brief 
details of these techniques are given below. 


9.3.1. Restriction Fragment Length 


Polymorphism (RFLP) 


RFLP (Botstein et al. 1980) involves digestion of 
DNA with restriction enzymes, which cut DNA 
at precise locations, separating the fragments by 
gel electrophoresis, blotting the fragments to a 
filter and hybridising labelled DNA probes of 
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interest to the separated fragments (Southern 
1975). Probes are short DNA fragments homolo- 
gous to a section of the genomic DNA. Specific 
combinations of enzymes and probes give highly 
reproducible band patterns. 

RFLP is a highly reliable technique with good 
reproducibility across laboratories. The markers 
are codominantly inherited; hence, genetic desig- 
nation is unambiguous. Using different probe and 
enzyme combinations, discrimination at species, 
population or individual levels is possible. For 
example, the use of minisatellites and microsatel- 
lites as probes produces multilocus individual 
specific fingerprints. The variation arises from 
changes in the copy number of basic repeats, 
referred to as variable number of tandem repeats 
(VNTRs). 

RFLP is used less frequently these days 
because of its cumbersome protocol, limited 
options for automation and requirement of large 
amount of high-grade DNA. However, in combi- 
nation with PCR, microarray and sequencing, 
restriction-based polymorphism analysis contin- 
ues to be popular. 


9.3.2 PCR-Based Techniques 


Polymerase chain reaction (PCR) (Saiki et al. 
1988) involves enzymatic amplification of short 
DNA fragments to amounts that can be visualised 
after electrophoresis. The technique consists of 
denaturation of double-stranded genomic DNA 
under high temperature (~92° C) followed by 
DNA polymerase-catalysed synthesis of DNA 
fragments using short single-stranded DNA 
sequences as primers that bind at complementary 
sites on the denatured DNA. The newly synthe- 
sised DNA is reamplified 25-45 times through 
repeated PCR cycles, and the products are sepa- 
rated by electrophoresis. Depending upon the 
type of primers and other techniques used in 
combination, PCR-based marker techniques are 
grouped into different classes. 


9.3.2.1 Arbitrarily Amplified Markers 
Collectively known as multiple arbitrary ampli- 
con profiling (MAAP) (Caetano-Anolles 1994), 


J.L. Karihaloo 


these techniques use arbitrary primers with no 
previous knowledge about the sequence of DNA 
being amplified. Random amplified polymorphic 
DNA (RAPD) (Williams et al. 1990) protocol 
uses one 10 bp primer and three PCR temperature 
cycles with a constant annealing temperature. 
Amplification products are separated on agarose 
gel and stained with ethidium bromide. DNA 
amplification fingerprinting (DAF) (Caetano- 
Anolles et al. 1991) uses shorter (5—8 bp) primers 
at higher concentration than RAPD and two tem- 
perature cycles. Generally, larger number of 
amplification products are obtained which are 
resolved on polyacrylamide gel and _ silver 
stained. Arbitrarily primed PCR (AP-PCR) 
(Welsh and McClelland 1990) uses 20 bp or 
higher sized primers and is conducted in three 
steps using different constituent concentrations 
and PCR stringencies. The products are separated 
on polyacrylamide gel and detected by autoradi- 
ography. In a modified protocol, arbitrary prim- 
ers were used to amplify cDNA to locate 
individual specific polymorphism in RNA, a 
technique known as RNA fingerprinting by arbi- 
trarily primed PCR (RAP-PCR) (Welsh et al. 
1992; Menke and Mueller-Roeber 2001). 

RAPD is by far the most widely used tech- 
nique of MAAP class. The advantage of this and 
related techniques is that sequence information 1s 
not required, cumbersome blotting and hybridi- 
sation are avoided and the techniques are ame- 
nable to automation. On the other hand, these 
markers are dominant, there is no certainty that 
comigrating bands represent genetically same 
loci and, due to several intrinsic and extrinsic fac- 
tors, undesirable variation in band intensity is 
often encountered. 


9.3.2.2 Amplified Fragment Length 
Polymorphism (AFLP) 

AFLP technique combines RFLP with PCR to 
selectively amplify DNA restriction fragments 
(Zabeau and Vos 1993; Vos et al. 1995). The 
DNA is digested with two different enzymes, 
one a rare cutter (6 bp recognition site) and the 
other a frequent cutter (4 bp recognition site). 
Double-stranded synthetic adapters are ligated 
to the DNA fragment ends thus generated. This 
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step is followed by two PCRs, the first a prese- 
lection PCR with 17—21 bp primers complemen- 
tary to the adapter sequences and restriction 
sites. The primers have an additional nucleotide 
to amplify only a subset of fragments. The 
amplified fragments are subjected to a second 
PCR using two more additional nucleotides on 
the primer to make further fragment selection. 
The thus generated amplification products are 
separated by PAGE with silver staining or capil- 
lary gel electrophoresis with fluorescent dyes. 
AFLP of cDNA (cDNA-AFLP) displays differ- 
ences in the expression of genes (Bachem et al. 
1998). 

AFLP combines the advantages of PCR and 
RFLP resulting in large number of highly repro- 
ducible bands. There is no requirement of 
sequence information and high level of polymor- 
phism is detected by AFLP. On the other hand, 
AFLPs are dominant markers and the technique 
is more demanding than MAAP. 


9.3.2.3 Sequence-Tagged Sites (STS) 
STS markers (Olson et al. 1989) are developed 
by using primers based on some degree of 
genomic information. A number of STS tech- 
niques varying in the type of primers used have 
been developed. 
9.3.2.3.1 Microsatellites or Simple 
Sequence Repeats (SSR) 
or Sequence-Tagged 
Microsatellite Sites (STMS) 
or Simple Sequence Repeat 
Polymorphism (SSRP) 
As detailed earlier in the chapter, SSRs are 1-6 
bp repeat sequences existing throughout the 
genome. Polymorphism in the number of repeat 
units is detected by PCR amplification using 
primers complementary to the flanking sequences 
of microsatellites (Saghai Maroof et al. 1994; 
Wenz et al. 1998). The amplified fragments are 
separated by AGE, PAGE or capillary electro- 
phoresis and visualised by ethidium bromide, sil- 
ver staining or fluorescent labelling. 

SSR markers are highly variable, reproduc- 
ible and codominant, due to which they have 
found wide application in characterisation and 
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diversity analysis. However, the technique 
requires elaborate marker development effort 
involving construction of genomic libraries, 
enriching the libraries for the targeted SSR 
motifs, sequencing libraries for SSRs and flank- 
ing sequences and testing their amenability to 
PCR protocol (Sharma et al. 2007). On the other 
hand, a number of developments have greatly 
facilitated discovery and application of SSR 
markers. SSR markers have now been developed 
in many plant species and are publicly available. 
Several cases of transferability of SSRs across 
species have been reported (Kuleung et al. 2004; 
Grover et al. 2009; Park et al. 2010). Li and col- 
leagues (2008) combined SSR with gene 
sequence amplification using multiplex PCR to 
fingerprinting potato cultivars. Increasing online 
availability of DNA sequence information, par- 
ticularly concerning expressed sequence tags 
(ESTs) (Wolfsberg and Landsman 2001; 
Parkinson and Blaxter 2009), is also facilitating 
in silico development of SSRs (Louarn et al. 
2007; Arya et al. 2009). Finally, next-generation 
methods have revolutionised DNA sequencing 
and consequently provided substantial boost to 
SSR development. 

9.3.2.3.2 Intersimple Sequence 

Repeats (ISSR) 

The technique involves DNA amplification using 
microsatellites as primers for PCR amplification 
(Gupta et al. 1994; Zietkiewicz et al. 1994). The 
primer sequences are generally 10-30 bp long 
comprising 2—5 repeats which are unanchored or 
anchored with 1—4 degenerate bases at 5’ or 3’ 
end. Accordingly, amplification of the interven- 
ing region of two identical microsatellites takes 
place. AGE or PAGE is used for separation of 
amplification products. Ye and colleagues (Ye 
et al. 2005) used one ISSR and one RAPD primer 
(R-ISSR) to detect new genomic loci in maize 
lines. 

The advantage of ISSR is that it is as simple as 
RAPD; the markers are highly polymorphic and 
show higher reproducibility than 
RAPD. Dominant inheritance and doubts about 
genetic homology of comigrating bands are the 
drawbacks the technique shares with RAPD. 
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9.3.2.3.3 |Sequence-Characterised 
Amplified Region (SCAR) 

SCARs are locus-specific markers developed 
from MAAP markers (Paran and Michelmore 
1993). The technique involves cloning MAAP 
fragments and sequencing the two ends to 
develop 24-mer PCR primers. SCARs are genetic 
markers, show dominant or codominant inheri- 
tance and are highly reproducible. While SCARs 
are often developed to tags traits, they have also 
been useful for genotype identification (Vidal 
et al. 2000; Polashock and Vorsa 2002). 
9.3.2.3.4 | Cleaved Amplified 
Polymorphic Sequence 

(CAPS) or PCR-RFLP 

The technique combines the advantages of PCR 
and RFLP but eliminates the need for probe 
development and blotting (Williams et al. 1991; 
Bertea and Gnavi 2012). PCR products obtained 
by locus-specific amplification are subjected to 
restriction digestion followed by separation of 
the products by AGE or PAGE. CAPS markers 
are codominant and highly reproducible, but 
polymorphism depends on existence of variation 
in the restriction sites alone. A variant technique, 
derived CAPS (dCAPS), involves introduction of 
mismatches in a primer (Neff et al. 1998). The 
new PCR product is subjected to restriction 
digestion. 
9.3.2.3.5 Random Amplified 
Microsatellite Polymorphism 
(RAMP) 

RAMP involves amplification of genomic DNA 
with primers comprising labelled repeat 
sequences with a 5’ anchor and with or without a 
RAPD primer (Wu et al. 1994). PCR is conducted 
at two temperatures to allow proper melting and 
annealing. PAGE with label detection is used so 
that only bands obtained by anchored labels are 
detected. 
9.3.2.3.6 | Sequence-Related Amplified 
Polymorphism (SRAP) 

SRAP technique targets polymorphism in the 
open reading frames (ORF) of genome (Li and 
Quiros 2001). Two sets of primers are used 


J.L. Karihaloo 


which have a core of 13—14 bases of which the 
first 10-11 are random sequences, but differ in 
reverse and forward primers, followed by CCGG 
sequence in the forward primer and AATT in the 
reverse primer. Three selection nucleotides are 
added at the end of core sequences. The first 
PCR cycle is carried out at lower annealing tem- 
perature than the rest. PAGE is used for fragment 
separation. SRAP markers are both dominant 
and codominant depending upon the type of 
mutation and are used for fingerprinting as well 
as gene tagging (Chen et al. 2010a; Yin et al. 
2014). 
9.3.2.3.7. Target Region Amplification 
Polymorphism (TRAP) 

The PCR-based technique reveals polymor- 
phism in candidate genes by targeting ESTs (Hu 
and Vick 2003). Two 18 nucleotide primers are 
used, one complementary to the EST sequence 
and the other a random primer with AT- or 
GC-rich core which can anneal with an intron or 
exon. 


9.3.2.3.8 | Single-Strand Conformation 
Polymorphism (SSCP) 

SSCP detects differences in electrophoretic 
mobility of single-stranded DNA due to changes 
in number and type of bases (Hayashi 1991). 
Target sequences are amplified by PCR followed 
by denaturation of the double-stranded DNA to 
yield single strands. The latter acquire three- 
dimensional shapes which differ in electropho- 
retic mobility among sequences with even 
single-base differences. 

9.3.2.3.9 —_Intertransposon Amplified 
Polymorphism (IRAP) 

As implied by its name, the technique reveals 
polymorphism between LTRs (Kalendar et al. 
1999). Pairs of primers complementary to 3’ 
end of one LTR and 5’ end of another LTR are 
developed and used for PCR amplification. 
Since LTRs can have same sequences at either 
end due to head-to-head or tail-to-tail orienta- 
tion, single primers can also yield amplification 
products. 
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9.3.2.3.10 Retrotransposon- 
Microsatellite Amplified 
Polymorphism (REMAP) 
REMAP targets polymorphism between LTRs 
and SSRs (Kalendar et al. 1999). Two primers, 
one complementary with the 5’ end of LTR and 
the other corresponding to SSR and anchored to 
3’ end for a selective base are used for PCR 
amplification. 
9.3.2.3.11 Sequence-Specific 
Amplification Polymorphism 
(S-SAP) 
S-SAP is a modified AFLP technique to detect 
polymorphism in transposons (Waugh et al. 
1997). Here the second AFLP amplification is 
carried out using Mse-I adaptor-specific and 
retroposon-specific primers. 


9.3.2.3.12 Transposon Display (TD) 

The technique involves use of primers corre- 
sponding to a restriction site and a transposon for 
AFLP so that polymorphism in the sequence 
comprising parts of transposon and _ specific 
restriction site is detected (Va et al. 1998; Jiang 
et al. 2003). 


9.3.2.3.13 Retroposon-Based Insertion 
Polymorphism (RBIP) 

In this technique a transposon-specific primer is 
used along with a primer complementary to the 
flanking region which generates a PCR fragment 
containing the insertion (Flavell et al. 1998). If 
the insertion is absent, primers specific to both 
the flanking regions will generate a fragment. 


9.3.3. DNA Microarrays 


Microarrays or ‘chips’ comprise hundreds of 
DNA samples (known as probes) immobilised on 
glass or nylon slides which are hybridised by dif- 
ferent DNA samples (known as targets) (Kehoe 
et al. 1999; Galbraith and Edwards 2010). 
Hybridisation between probe and target is 
detected and quantified by appropriate labelling, 
detection and quantification techniques. Besides 
other applications, DNA microarrays are used to 
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compare genomic constitution, detect SNPs and 
identify biological contaminants/adulterants in 
food or feed (Niu et al. 2011). Different 
microarray-based technologies are used for fin- 
gerprinting purposes. 


9.3.3.1 Diversity Array Technology 
(DArT) 

DArT involves restriction digestion of a gene 
pool, PCR amplification of selected fragments 
and cloning and spotting on a microarray 
(Jaccoud et al. 2001). The DNA of a sample to be 
identified is restricted with the same enzymes 
and hybridised to the array. The advantage of 
DArT is that it does not require sequence infor- 
mation, is robust, detects single-base differences 
and is highly automated. On the other hand, the 
technique is expensive and _ technology 
demanding. 


9.3.3.2 Suppressive Subtractive 
Hybridisation Array (SSHA) 

The method is based on suppressive subtractive 
hybridisation (SSH) which excludes sequences 
common to two samples under comparison and 
thus greatly increases the probability of capturing 
polymorphic DNA fragments which may be in 
low abundance (Li 2004; Lukyanov et al. 2007). 
The subtracted and PCR-amplified fragments are 
printed on microarrays and hybridised with target 
DNA fragments. 


9.3.3.3 Subtracted Diversity 
Array (SDA) 

In this technique, pooled genomic DNA from one 
set of samples is extracted from a pooled genomic 
DNA of other sets of DNA samples to obtain 
DNA unique to the second set (Jayasinghe et al. 
2007). The latter is used as microarray probe to 
fingerprint a set of samples genetically closer to 
the second set. 


9.3.3.4 Oligonucleotide and Gene-based 
Microarrays 

Species-specific oligonucleotide sequences or 

gene sequences are used as probes. Sequences of 

target species corresponding to the probes are 

amplified, labelled and hybridised onto the oligo- 
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nucleotide array (Zhu et al 2008; Niu et al. 2011). 
Alternatively, target DNA is sheared, labelled 
and hybridised (Edwards et al. 2008). 


9.3.4 DNA Sequencing 


Till recently, the method developed by Sanger 
and colleagues was the most widely used one for 
DNA sequencing (Sanger et al. 1977; Franca 
et al. 2002). It involves selective addition of 
chain-terminating nucleotides during in vitro rep- 
lication of template DNA. Electrophoresis is 
carried out to separate the amplified fragments 
from which the sequence of template DNA is 
read. 

Sequencing is one of the steps in SSR, SCAR, 
SNP and sequence-based microarray protocols. 
In addition, sequence information of some spe- 
cific genomic regions has been found to be quite 
useful in species differentiation (DNA barcod- 
ing), including identification of constituent spe- 
cles in mixtures (He et al. 2010; Chen et al. 
2010b; Pang et al. 2011; Newmaster et al. 2013; 
Tripathi et al. 2013). In plants, nrITS, nrITS2, 
accD, matK, ndhJ, rpoB, rpoC1, ycf5, atpF-H, 
psbK-I, rbcL, rbcLa, trnH-psbA, trnL (P6) and 
UPA, singly or in combination, have been pro- 
posed as suitable barcodes (Hollingsworth et al. 
2011). While the discovery of a universal bar- 
code that could differentiate all the world’s plants 
is yet to come true, the technology has been quite 
effective in differentiating species in some plant 
groups (Hollingsworth et al. 2009; Spooner 2009; 
Hollingsworth 2011). 

A new generation of sequencing methods, 
next-generation sequencing (NGS), has emerged 
during the last few years which has vastly 
increased the capacity (up to one billion bases per 
run) and dramatically reduced the cost of 
sequencing (Egan et al. 2012; Kumar et al. 2012), 
the cost per mb having reduced from US$ 5,224 
by Sanger technology to US$ 0.12-0.39 by 
[lumina (Zalapa et al. 2012). Three main meth- 
ods, sequencing by synthesis, sequencing by 
ligation and single-molecule sequencing, have 
been used for NGS. Global sequencing efforts 
along with new bioinformatics tools have resulted 
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in generation and analysis of massive amounts of 
sequence data from model as well as non-model 
plants (Bateman and Quackenbush 2009; Lee 
et al. 2011; Michael and Jackson 2013). 

In plants where reference genomes are avail- 
able, whole-genome resequencing (Huang et al. 
2009) provides an excellent tool for analysing 
genome-wide diversity in an array of samples, 
including SNP discovery and genotyping (geno- 
typing by sequencing) (Bekele et al. 2013; Sonah 
et al. 2013; Finkers 2014). To overcome the prob- 
lems of too large a size of genome and relatively 
small informative regions, several approaches 
have been adopted to reduce complexity by 
enriching specific regions before sequencing. 
Transcriptome sequencing and restriction site- 
associated DNA sequencing (RADseq) are some 
such approaches (Miller et al. 2007; Priest et al. 
2010). 


9.4 DNA Fingerprinting for Plant 
Identification: Some 


Examples 


In their publication, DNA Fingerprinting in 
Plants: Principles, Methods, and Applications, 
Weising et al. (2005) have elaborately covered 
the literature on the subject published till 2004. 
In Table 9.1, some subsequent publications 
exemplifying the applications of conventional 
and more recent fingerprinting techniques in 
plant identification are listed. 


9.5 Which Markers to Use? 

The choice of marker techniques for DNA finger- 
printing will depend upon several factors: 
whether the purpose is to discriminate individu- 
als, populations or higher taxonomic categories; 
is there a need for a highly robust method or can 
reproducibility be compromised for some other 
advantage; what is the required number of mark- 
ers and their inheritance to yield results of accept- 
able confidence level; what is the availability of 
laboratory facilities, technical expertise and bud- 
get for consumables; and what is the scale of 
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Table 9.1 Recent examples of DNA fingerprinting for plant identification 


Reference 

Akbari et al. (2006) 
Akond et al. (2012) 
Arya et al. (2009) 


Bassil et al. (2009) 
Ben-Ayed et al. (2012) 
Bertea et al. (2006) 


Bhatia et al. (2009) 


Biswas et al. (2010) 
Caramante et al. (2011) 
Chen et al. (2006) 

Chen et al. (2010b) 
Cheng and Huang (2009) 
de Souza et al. (2010) 


Edwards et al. (2008) 
Faleiro et al. (2010) 
Ferrari et al. (2005) 
Figueiredo et al. (2013) 


Fu et al. (2014) 
Grover et al. (2009) 
He et al. (2010) 
Hu et at. (2005) 
Hu et al. (2007) 


Jiang et al. (2010) 
Kacar et al. (2006) 


Kafkas et al. (2008) 
Kashyap et al. (2005) 
Keshavachandran et al. (2007) 
Kiani et al. (2012) 
Kim et al. (2012) 

Kjos et al. (2010) 
Korir et al. (2013) 
Kumar et al. (2013) 
Kwon et al. (2005) 
Lamia et al. (2010) 

La Mantia et al. (2005) 
Lezar et al. (2004) 

Li et al. (2006) 

Li et al. (2008) 

L6pez et al. (2009) 
Louarn et al. (2007) 
Lu et al. (2009) 


Technique 
DArT 
SSR 
EST-SSR 


SSR 
SSR 
PCR-RFLP 


ISSR 


RAPD, ISSR, IRAP, REMAP 
SSR 

AFLP 

DNA barcoding 

SSR 

RAPD 


Oligo-microarray 
RAPD 
AFLP 
RAPD 


EST-SSR 

SSR 

DNA barcoding 
TRAP 

TRAP 


SSR 
SSR 


AFLP 
RAPD 
RAPD 

ISSR 

RBIP 

AFLP 
RAPD 

SSR 

TD 

AFLP, SSR 
SSR 

DArT 

SSHA 
Multiplex SSR and gene PCR 
SSR, AFLP 
SSR 

RAPD, ISSR 
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Plants/products 
Wheat cultivars 
Rose species and cultivars 


Millet and pearl millet genotype 
identification 


Hazelnut clones in gene bank 
Authenticity of virgin olive oil 


Salvia species from green and dry 
samples 


Clonal fidelity of micropropagated 
gerbera plants 


Citrus genotypes 

Tomato cultivars in food chain 
Oncidium cultivars 

Medicinal plant species 

Peach cultivars and landraces 


Intraspecific and interspecific hybrids 
in Annonaceae 


Oryza species 
Mango cultivars 
Corylus avellana 


Olea europaea L. ssp. europaea 
accessions 


Celery cultivars 

Potato cultivars and lines 

Medicinal Aconitum species 

Lettuce cultivars and wild accessions 


Spinach commercial hybrids and 
accessions 

Genotyping of indica and japonica rice 
varieties 


Turkish sour cherry germplasm 
accessions 


Morus accessions 

Fragaria genotypes 

Piper nigrum and P. longum cultivars 

Tulipa cultivars 

Pear cultivars 

Argyranthemum frutescens cultivars 

Tomato varieties 

Mango varieties 

Oryza species and genotypes 

Apricot germplasm 

Sicilian olive germplasm 

Closely related Eucalyptus trees 

Dendrobium species 

Potato cultivars 

Grapevine cultivars 

Brassica oleracea cultivars 

Broccoli and related cultivars 
(continued) 


214 


Table 9.1 (continued) 


Reference Technique 
Mantri et al. (2012) SDA 

Mifano et al. (2009) RAPD 

Naito et al. (2006) TD 

Newmaster et al. (2013) DNA barcoding 
Niu et al. (2011) SDA 

Olarte et al. (2013a) SDA 

Olarte et al. (2013b) SDA 

Park et al. (2010) SSR 

Pathak and Dhawan (2010) ISSR 

Pang et al. (2011) DNA barcoding 


Peng et al. (2010) 


PCR-RFLP, ITS sequencing 


Pooler and Townsend (2005) AFLP 
Prakash et al. (2005) SSR, AFLP 
Riaz et al. (2008) SSR 

Sarwat et al. (2008) ISSR, RAPD 
Shiran et al. (2009) AFLP 

Singh et al. (2007) ISSR 

Singh et al. (2010) SSR 

Somsri and Bussabakornkul (2008) DAF 
Spaniolas et al. (2006) PCR-RFLP 
Syed et al. (2005) SSAP 

Tiwari et al. (2009) RAPD, ISSR 
Tripathi et al. (2013) DNA barcoding 
Wang et al. (2010) SNP 

Xi et al. (2012) ISSR 

Xuan et al. (2009) SSR 

Yang et al. (2013) SCAR 

Yu et al. (2012) SSR 

Zhang et al. (2013) SCAR 

Zhou et al. (2005) AFLP 

Zhu et al. (2008) Oligo-microarray 
Zhu et al. (2011) ISSR 
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Plants/products 
Asterid species 


Chrysanthemum cultivars and 
somaclonal variants 


Rice genotypes 


Detecting contamination and 
substitution in herbal products 


Panax species from dried samples 
Salvia species 

Echinacea species 

Rose and strawberry cultivars 


Clonal fidelity of micropropagated 
apple rootstock 


Rosaceae species 

Distinguishing Lonicera japonica from 
other Lonicera species 

Elm clones and hybrids 

Indian robusta coffee germplasm 
Muscadine grapes 

Tribulus terrestris accessions 
Iranian almond cultivars and wild 
species 

Mango cultivars 

Rice genotypes 

Sex identification in papaya 
Authentication of coffee 

Cashew genotypes 

Brinjal cultivars 

Tropical tree species 

Panax ginseng cultivar 

Lily mutants 

Sweet and sour cherry 
Dimocarpus longan varieties 
Gossypium species and genotypes 
Sorghum halepense genotypes 
Populus species, clones and cultivars 
Ginseng plants and drugs 

Wheat cultivars 


operation including the number of participating 
laboratories and data sharing (Karp et al. 1997; 
Henry 2001; de Vicente and Fulton 2003; Spooner 
et al. 2005; Weising et al. 2005; UPOV 2010). 
Table 9.2 provides a general comparison of 
the commonly used fingerprinting techniques 
with respect to their efficiency, ease and cost of 
application. A number of studies are also avail- 
able that compare these techniques in specific 
groups of taxa (Powell et al. 1996; Jones et al. 


1997; Pejic et al. 1998; McGregor et al. 2000; De 
Riek 2001; Archak et al. 2003; Sarwat et al. 2008; 
Lamia et al. 2010). In general, SSRs are the pre- 
ferred markers because of their abundant varia- 
tion among but relatively low variation within 
cultivars, high reproducibility across laborato- 
ries, codominant inheritance, primer availability 
in many crops and ease of experimental proce- 
dure. Several of these advantages are true of 
SNPs as well. In fact, due to their robustness, 
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SSRs and SNPs are best suited for genotyping 
involving forensic and IPR issues (Testolin and 
Cipriani 2010; UPOV 2010). While marker 
development expertise and cost is an issue with 
many laboratories, with the NGS-driven high 
speed and reducing cost of sequencing and 
expanding databases and software tools, SNPs 
along with SSRs are becoming the preferred fin- 
gerprinting options for a wide range of 
laboratories. 
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Abstract 

Physiological responses, developmental programs, and cellular functions 
depend on complex networks of interactions at different scales and levels. 
Systems biology unites biochemical, genetic, and molecular approaches to 
generate omics data that can be analyzed and used in computational mod- 
els to uncover these global networks. Various approaches, including 
genomics, transcriptomics, proteomics, interactomics, and metabolomics, 
have been employed to obtain these data at the cellular, tissue, organ, and 
whole organism levels. 
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10.1 Introduction 


Large-scale genome projects produce a large 
amount of data that are used to build models to 
understand how biological systems manage infor- 
mation. In contrast to the static representation of 
structural genomics, functional genomics focuses 
on the dynamic aspects of genome expression. 
Functional genomics emerged as a field of molec- 
ular biology that involves high-throughput meth- 
ods in genome-wide investigations. In a broad 
sense, functional genomics is dedicated to the 
understanding of the relationship between the 
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genome of an organism and its phenotype. The 
fundamental goal of functional genomics is to 
understand how biological functions arise from 
the information encoded in a genome. 

Briefly, the biological information necessary 
to manage a biological system is stored in the 
genome and transcribed into the transcriptome to 
be finally translated into the proteome. The pro- 
teome is the sum of signaling and metabolic net- 
works. The interactome represents the signaling 
networks because the response of the cell to a 
specific situation is obtained by the activation of 
gene expression; gene expression depends on the 
activation of a signaling pathway, and the signal- 
ing pathway is activated through the interaction 
of a receptor with a specific signaling molecule. 
By contrast, the subsection of the proteome that 
is dedicated to enzymatic activities describes 
metabolic pathways and interactions with the 
metabolome. The metabolome encompasses not 
only molecules that are substrates and products 
of enzymatic reactions but also cofactors and 
non-peptidic organic signaling molecules. 
Ionome, the core of ions, gives information on 
the state of the system under specific environ- 
mental conditions. The genome, transcriptome, 
proteome, and metabolome are considered the 
four pillars of functional genomics (Salt et al. 
2008). The dynamic response and the interaction 
of these four pillars define how a living system 
operates, which is currently one of the greatest 
challenges in science (Salt 2004). 

The genomic revolution began with the report- 
ing of the complete genome sequence of the bac- 
terial pathogen Haemophilus influenzae 
(Fleischmann et al. 1995). For the first time, 
science was in possession of a complete DNA 
catalog with the genetic information for every 
phenotype. Two decades later, thousands of pro- 
karyotic and eukaryotic genome sequences are 
available. The first plant genome fully sequenced 
was Arabidopsis thaliana, which was selected 
because of the small size (~150 Mb) of its genome 
(The Arabidopsis Genome Initiative 2000). 
Suites of standard protocols are now applied to 
each new genome that is being sequenced, and 
the released information is made publicly acces- 
sible through GenBank (Benson et al. 2006), the 


L.H.F. Gomes et al. 


European Molecular Biology Laboratory 
(Cochrane et al. 2006), and the DNA Data Bank 
of Japan (Okubo et al. 2006). The data format 
that is covered by these three databases was ini- 
tially dedicated to the representation of gene 
structure. However, these databases have been 
progressively evolving toward the representation 
of intergenic sequences on a chromosomal scale 
and integrating data from multiple sources such 
as functions, cellular levels, and ontologies. 
Many individual laboratory initiatives have since 
developed their own websites with a specific for- 
mat and links to other databases and have adapted 
to the type of released information. By conse- 
quence, the annotation pipelines of these work- 
flows are globally independent from the scientific 
team that conducted the genome sequencing, but 
specificities exist in the way the tools are assem- 
bled. From a post-genomics perspective, the 
pioneer annotation pipelines can be considered as 
the starting point that has facilitated the develop- 
ment of technologies for high-throughput struc- 
tural and functional genomics. 

Among numerous notable websites, the fol- 
lowing websites are highlighted. (1) PlantGDB is 
a database of molecular sequence data for all 
plant species with significant sequencing invest- 
ments. The database organizes EST sequences 
into contigs that represent tentative unique genes. 
Genome sequence fragments are assembled on 
the basis of their similarity. Contigs are annotated 
and, whenever possible, linked to their respective 
genomic DNA. The goal of the PlantGDB web- 
site is to establish the basis for identifying sets of 
genes common to all plants or specific to particu- 
lar species. To achieve this goal, PlantGDB inte- 
grates a number of bioinformatics tools that 
facilitate gene predictions and cross-comparisons 
in species with large-scale genome sequencing 
investments. (11) Open Sputnik is a database for 
the comparison of plant genomes. This website 
uses sequence resources to fill information gaps 
in non-sequenced plant genomes and provides a 
foundation for in silico comparative plant 
genomics (Rudd 2005). (411) Mercator is a Web 
server for the genome-scale functional annota- 
tion of plant sequence data (Lohse et al. 2014). 
(iv) PlantTFDB 3.0 is a website for the functional 
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and evolutionary study of plant transcription 
factors (Jin et al. 2014). For additional database 
and bioinformatics tools, see Table 10.1 and 
Martinez (2011). 

Functional genomics data can be subdivided 
into sequence and experimental datasets. The 
sequences are used to perform fundamental 
genetic analyses such as a homology search, 
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nucleotide substitutions, indels search, SNP 
detection, nucleotide composition analyses, gene 
expression measure, and gene structure descrip- 
tion. These processes are commonly performed 
genome-wide on sequence datasets using auto- 
mated algorithms running in silico. For example, 
the principle of conserved operons (Overbeek 
et al. 1999) can be used to predict the function and 


Table 10.1 Databases for functional genomics of plant resources 


Types 
Genomic databases 


See a list on plant 
organelles database 


Computational biology and 
functional genomics 
laboratory 


Plant repeat databases 
TropGENE database 
Genomics of small grain 
crops 

Comparative grass 
genomics 

Plant genome database 


The Arabidopsis 
Information Resource 
(TAIR) 


Rice genome project 
(RGP) 
SNPs 


Insertional mutagenesis of 
Arabidopsis thaliana 


The floral genome project 


Intron polymorphism 
markers 


Plant promoter database 


Plant cis-acting regulatory 
elements 


Small RNA from cereals 


Plant microRNA 
knowledge base 


Plant microRNA database 
Plant MPSS database 
Plant resistance genes 
Predictive RNA editor 
Expression database 


See a list on: plant 
organelles database 


At TAIR 
At TIGR 


URL 


http://podb.nibb.ac.jp/Organellome/bin/listLinks.php 


http://compbio.dfci.harvard.edu/tgi/plant.html 


http://plantrepeats.plantbiology.msu.edu/downloads.html 
http://tropgenedb.cirad.fr/tropgene/JSP/index.jsp 
http://wheat.pw.usda.gov/GG2/index.shtml 


http://www.gramene.org/ 


http://www.plantgdb.org/ 
http://www. arabidopsis.org/ 


http://rgp.dna.affre.go.jp/ 


http://autosnpdb.appliedbioinformatics.com.au/ 
http://signal.salk.edu/cgi-bin/tdnaexpress 


http://fgp.huck.psu.edu/tribedb/10_genomes/index.pl 
http:/Abi.zju.edu.cn/pgl/pip/ 


http://www.softberry.com/berry.phtml?topic=plantprom& group=data&subgroup=plant 
prom 


http://bioinformatics.psb.ugent.be/webtools/plantcare/html/ 


http://sundarlab.ucdavis.edu/smrnas/ 
http://bis.zju.edu.cn/pmirkb/ 


http://bioinformatics.cau.edu.cn/PMRD/ 
http://mpss.udel.edu/ 
http://www.pathoplant.de/ 
http://prep.unl.edu/ 


http://podb.nibb.ac.jp/Organellome/bin/listLinks.php 


http://www. arabidopsis.org/servlets/Search?action=new_search&type=expression 
http://jcvi.org/arabidopsis/qpcr/search.php 
(continued) 
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Table 10.1 (continued) 


Types URL 


EBI: the array express 
database 

GARNet 

RMOS 


Stanford microarray 
database 


http://smd.princeton.edu/ 


Database resource for the 
analysis of signal 
transduction in cells 


http://www.drastic.org.uk/ 


Crop plant expression 
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https://www.ebi.ac.uk/arrayexpress/ 


http://www.garnetcommunity.org.uk/resources/microarray-tool-kit 
http://cdnaO1.dna.affrce.go.jp/RMOS/index.html 


http://www.agbase.msstate.edu/ 


Crop plant ESTs http://perc.ipk-gatersleben.de/cr-est/ 
PLEXdb http://www.plexdb.org/ 
Proteomics 


See a list on: plant 
organelles database 


Crop plant metabolism 


Plant transcription factor 
database 


Systematic comparison of 
proteome data 


functional interactions of unknown open reading 
frames (ORFs). In eukaryotes, the concept of 
ORF is only useful to describe transcriptome 
sequences (see in Carels and Frias 2013) because 
genomic coding sequences (CDS) are interrupted 
by introns. To cope with the challenge of gene 
prediction on a genomic scale in eukaryotes, 
sophisticated hidden Markov models (HMMs) 
were designed to account for the conditional 
probability of exons, introns, splicing signals, and 
promoters for different contexts such as GC con- 
tent (Stanke et al. 2004; http://bioinf.uni- 
greifswald.de/augustus/). Putative exons may 
even be aligned with external datasets or tran- 
scriptome data to increase the consistency associ- 
ated with these predictions (Keller et al. 2011). 
However, these models still produce large error 
rates (Coghlan et al. 2008), and a large-scale 
homologous comparison between genomic and 
transcriptomic sequences is generally preferred 
when available. 

Experimental datasets, however, are adapta- 
tions of established molecular biology protocols 
that are scaled up to become high throughput. For 
example, DNA microarrays are a miniaturization 


http://podb.nibb.ac.jp/Organellome/bin/listLinks.php 


http://metacrop.ipk-gatersleben.de/apex/f?p=269:7:0::NO::P7_PATHWAY_ID:49 
http://pIntfdb.bio.uni-potsdam.de/v3.0/ 


http://salad.dna.affrc.go.jp/salad/en/ 


of standard Southern hybridization techniques to 
cover the whole genome. 

While there is a clear separation between 
sequence and experimental datasets, the primary 
utilization of these datasets 1s identical: the large- 
scale prediction of functional interactions (Muley 
and Ranjan 2012). From a systems biology per- 
spective, a functional interaction can be reduced 
to a binary construct between two proteins or 
genes. Although this representation is a drastic 
oversimplification, it allows both sequence and 
experimental datasets to be converted and com- 
bined into interaction sets. Interacting pairs of 
genes can be represented by “1” and noninteract- 
ing pairs by “0.” Because this binary character- 
ization of an interaction is only a model of the 
reality and, thus, characterized by an error rate, 
any interaction predicted using a functional 
genomics dataset must be experimentally verified 
and characterized according to spatial and tem- 
poral variables. Furthermore, the process of inter- 
action characterization lacks consensus regarding 
the definition of the term functional interaction. 
The problem resides in the interpretation of the 
word interaction and, by extension, the purpose 
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of the modifier, functional. A functional 
interaction may refer either strictly to the pro- 
teins that are engaged in direct physical contact 
(Cusick et al. 2005) or broadly to all proteins that 
are involved in a response or pathway (Eisemberg 
et al. 2000). The difficulty in constructing a con- 
sensus definition for functional interaction is 
similar to the current controversy in defining the 
concept of species in bacteria (Cohan 2002; 
Moreno 1997). 

Ontology determines whether categories are 
fundamental and considers in what sense an item 
can be considered to be (legitimately) in one of 
those categories. By extension, ontology is a rep- 
resentation (semantic model) of something we 
know. Thus, the ontology is useful only when it 
successfully represents the targeted object. 
Ontologies provide insight into the relationships 
that exist between a gene and the genome by 
reducing the set of possible interactive partners 
from the whole genome to a subset. Rather than 
represent specific pairwise functional interac- 
tions, ontologies have been developed to classify 
proteins into functional groups including those in 
the Clusters of Orthologous Groups (COGs) for 
prokaryotes (Tatusov et al. 2000), Eukaryotic 
Orthologous Groups (KOG) for eukaryotes 
(Hirakawa et al. 2013), Gene Ontology (GQ) 
(Ashburner et al. 2000), Kyoto Encyclopedia of 
Genes and Genomes (KEGG) (Kanehisa et al. 
2004), and Protein Families (Pfam) (Finn et al. 
2006). Although these ontologies use indepen- 
dent schemes in their classifications, computer 
algorithms can convert the data from one ontol- 
ogy into another, strongly indicating that these 
ontologies are based on similar underlying 
concepts. 

In the following section, we characterize the 
most common types of sequence and experimen- 
tal datasets currently available. 


10.2 Sequence Datasets 


One of the major strengths of genomic sequence 
datasets is that they are based on a technology 
that is well established and that produces robust 
data available for further biological inferences. 
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However, these procedures heavily rely on bioin- 
formatics. In the design of bench experiments, 
bioinformatics models have to be verified at the 
experimental level prior to further investigation 
to avoid the unnecessary loss of time and money 
because of eventual errors in these models. 
Despite this error load, genomic data allow 
detailed sequence comparison across species; 
this cross-species analysis becomes increasingly 
powerful as the number of available genomes 
increases. Prokaryotic genomes are relatively 
small compared to eukaryote genomes, and bac- 
teria therefore have the largest number of 
sequenced genomes. However, the number of 
eukaryote genomes that are completely sequenced 
is increasing rapidly, and there are already 50 
complete genomes for plants (Michael and 
Jackson 2013). 

The release of any given genome sequence is 
accompanied by an initial genome annotation. 
The first, and perhaps the most important, anno- 
tation step is the mapping of predicted CDSs on 
a genome. In prokaryotes, this step has been 
successfully performed with algorithms such as 
GLIMMER (Delcher et al. 1999); eukaryote 
algorithms are based on HMM trained with 
samples of true positives such as in TigrScan, 
Fgenesh++, and Augustus. True positive coding 
ORFs are ORFs that were proven to code 
sequences in experimental studies. Coding 
ORFs have a purine bias (Rrr) within the codon 
that is universal to all living organisms (Carels 
et al. 2009) and that results from selective con- 
straints on physicochemical properties of the 
encoded proteins (Ponce de Leon et al. 2014). 
The Rrr is a feature that is specific to coding 
DNA and is detected by Markov models. In 
eukaryotic genomes, ORF prediction is consid- 
ered accurate for sequences larger than 300 bp 
(Brent 2005; Wang et al. 2006), but a rate of 
~15 % error between frames +1 and —1 is cur- 
rently observed (Carels and Frias 2009; Carels 
and Frias 2013a). This error may however be 
reduced to 5 % if an additional filter 1s added to 
the common algorithm. This filter must contain 
the conditions that guanine in the first position 
of codons (G1) is larger than in the second posi- 
tion (G2) and that the adenine in the second 
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position of codons (A2) is larger than the 
thymine in the first position (T1). These condi- 
tions are true in 95 % of proteins that were 
crystallized (Carels and Frias 2013b). In eukary- 
otes, gene prediction is more difficult than ORFs 
because the coding sequence is interrupted by 
noncoding introns. Because the average enzyme 
size 1s ~400 amino acids (1,200 bp), most exons 
are below the 300 bp size threshold that is 
required to warrant statistical consistency of the 
prediction. For these genomes, the best solution 
is to extract coding sequences from the tran- 
scriptome because mRNAs are quickly pro- 
cessed to remove introns. Thus, cDNA is 
generally deprived of introns. Obtaining the 
cDNA allows an easy classification of the cod- 
ing ORFs in the coding frame (Carels and Frias 
2013b). The homologous comparison of coding 
ORFs from contigs of cDNAs with the genomic 
sequence also allows the retrieval of introns; 
thus, the coding ORFs form the objective train- 
ing set of introns and exons used for the training 
of a HMM for the case under consideration. 
Examples of freely accessible HMMs for gene 
searches in eukaryotes are GeneZilla (Allen 
et al. 2006) and Augustus (Stanke et al. 2004). 
This procedure should allow the discovery of 
the majority of genes even for species with no 
previous information. 

Each predicted ORF is then assigned a puta- 
tive annotation based on various methods that 
rely on the search for homologous pairs between 
a sample of query sequences and a sample of sub- 
ject sequences. Applying the Basic Local 
Alignment Search Tool (BLAST; Altschul et al. 
1997) to compare each ORF against a reference 
database of characterized genes allows the trans- 
fer of an annotation from the reference (subject) 
to the query when the score of the homologous 
comparison is above a given threshold to pre- 
serve Statistical consistency. The p-value associ- 
ated with the comparison decreases. with 
increasing score values. Consequently, the confi- 
dence in the function that is transferred from sub- 
ject to query also increases with decreasing 
p-values. Depending on the type of data within 
the reference database, other features can be 
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deduced from the significant homologous com- 
parisons between sequence pairs (e.g., compar- 
ing a sequence with KOGs). The association of 
unknown CDSs with known enzymes in KEGG 
(Kanehisa et al. 2004) may allow the attribution 
of a CDS to a particular pathway. Furthermore, 
the homologous identification of protein domains 
using the HMM profiles (Eddy 1998) using Pfam 
annotations may allow the classification of an 
unknown CDS within a given enzymatic family. 
Therefore, evolutionary roles may be inferred 
for that CDS. 

In addition to the annotation of CDSs, the 
genome sequence is also analyzed for additional 
features including noncoding regulatory RNAs 
(Eddy 2002; Gottesman 2005; Schattner et al. 
2005; Vogel and Sharma 2005; Winkler 2005) 
such as transfer RNAs (tRNA), ribosomal RNAs 
(rRNA), microRNAs (miRNAs), and small 
untranslated RNAs (suRNA). These analyses are 
considered by experimentalists as a starting point 
for the characterization of specific genes or 
genetic networks. 

Plant genome datasets were constructed 
according to their specific features. For species 
associated with large-scale genome sequencing, 
PlantGDB provides genome browsing abilities 
that integrate all available ESTs and cDNAs for 
current gene models. A special section dedicated 
to A. thaliana is named AtGDB (http://www. 
plantgdb.org/AtGDB). 

The MapMan BIN ontology is another useful 
bioinformatic tool in plant functional genomics. 
MapMan was initiated by Thimm et al. (2004) 
and has constantly improved over time (Usadel 
et al. 2009; Urbanczyk-Wochniak et al. 2006; 
Rotter et al. 2007; Sreenivasulu et al. 2008). The 
goal of the MapMan approach is to assign genes 
to as few functional categories as possible with- 
out losing information. The MapMan ontology is 
organized as a hierarchically structured tree that 
comprises 35 main biological categories that are 
named BINs. Currently, approximately 1,000 
distinct enzymatic activities are associated with 
1,906 BINs in the MapMan hierarchy. Complete 
transcriptome annotations were obtained for 
many different plant species using the MapMan 
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classification system and are publicly available 
via the MapMan website (http://mapman.gabipd. 
org/web/guest/mapmanstore). 


10.2.1 Cofunctional Network 
Inference and Analysis 


Advances in high-throughput technologies have 
allowed the generation of extensive gene expres- 
sion datasets at a genome-wide scale. The obser- 
vation that genes sharing a common function 
can be coordinated at the level of transcription 
(Stuart et al. 2003) has enabled these datasets to 
be analyzed to assign putative functional asso- 
ciations between genes. The evaluation of gene 
coexpression uses statistical metrics to establish 
correlations between gene expression profiles 
across many samples based on the principle of 
guilt by association (Usadel et al. 2009). 

Systems biology coordinates high-throughput 
data from biochemical, genetic, and molecular 
approaches. These coordinated data can be ana- 
lyzed through mathematical and computational 
modeling to uncover interactive networks on a 
global scale. The elucidation of coexpression 
networks is a top-down approach that relies on 
gene clustering from transcription data. A cluster 
whose annotated genes are overrepresented in a 
Gene Ontology (GO) category suggests that the 
biological process associated to that category 
(Ashburner et al. 2000) can also be extended to 
the other members of this cluster. Thus, unchar- 
acterized genes within the cluster are potentially 
participating in the same biological process as 
the annotated genes. Similarly, genes directly 
connected to (or co-expressed with) known cen- 
tral regulators of a developmental process are 
potentially within the identical metabolic path- 
way. Coexpression networks created using vari- 
ous combinations of these approaches have led 
to the identification, in Arabidopsis, of previ- 
ously uncharacterized genes involved in cell 
wall synthesis (Persson et al. 2005), flower 
development (Usadel et al. 2009), seed germina- 
tion (Bassel et al. 2011), and plant growth 
(Mutwil et al. 2010). 
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10.2.2 Online Resources 
for Sequence Datasets 


Prolinks (Bowers et al. 2004), PLEX (Date and 
Marcotte 2005), and STRING (von Mering et al. 
2005) online resources combine methods that 
predict functional interactions in genomes. These 
databases contain precomputed predictions for a 
number of sequenced bacterial genomes and can 
be searched for specific proteins. In addition, 
these databases allow users to submit their own 
query sequences and obtain the calculated func- 
tional predictions for all three methods. 

In plants, the effectiveness of functional 
genomics based on sequences increases with 
the number of completely sequenced genomes. 
In addition to the limited number of sequenced 
plants genomes, there is also a_ specific 
sequencing bias because annotations rely on a 
small number of model organisms. For exam- 
ple, the accuracy of the structure determined to 
be associated with a gene fusion (a hybrid gene 
formed from two previously separate genes) 
depends on how well the non-fused proteins 
that were involved in the fusion event are 
described. 


10.3. Experimental Datasets 


Complete genome sequences have enabled the 
development of technologies based on bench 
experiments. Whereas the largest variety of 
experimental technologies are currently being 
applied to model organisms such as yeast, mice, 
worms, and flies, a number of these technologies 
have been extended to plants. For example, pro- 
teomic datasets available for A. thaliana (Joshi 
et al. 2012) are based on DNA microarrays 
(Magrimi et al. 2008), chromatin immunopre- 
cipitation chips (Kaufmann et al. 2010), localiza- 
tion studies (Ito et al. 2014), gene essentiality 
studies (Zhang and Lin 2009), and metabolomics 
(Zenobi 2013; Landesfeind et al. 2014). In this 
section, we will focus on the datasets of func- 
tional genomic data based on bench experiments 
available for plants. 
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10.3.1 Proteomics 


The field of proteomics has rapidly evolved with 
the availability of whole genome sequences, and 
a number of techniques have been developed to 
probe the interactions that occur between pro- 
teins within a cell. The goal of proteomics is to 
provide evidence for protein—protein interactions 
that occur within cells. One of the most difficult 
challenges to this approach is the ability to extract 
and screen large numbers of proteins. In bacteria, 
plants, and animals, two major types of pro- 
teomic approaches have been applied for the 
genome-wide study of protein—protein interac- 
tions: yeast two-hybrid assays and protein profil- 
ing. Both approaches provide fundamentally 
different types of information about protein—pro- 
tein interactions, and each method has been suc- 
cessfully utilized to characterize the protein 
interactions that occur within cells and their asso- 
ciations to biological functions. Several assays 
have been developed to identify and predict pro- 
tein interactions in yeast, in cell suspensions, in 
plants, or in silico, and various public databases 
have compiled these interactions (Table 10.2). 


10.3.1.1 Yeast Two Hybrids 

The yeast two-hybrid (Y2H) system, first pro- 
posed by Fields and Song (1989), remains one of 
the most applied techniques to detect direct inter- 
actions between proteins. In this approach, the 
direct interaction of two proteins is detected by 
the activation of a specific reporter in an organ- 
ism such as yeast. The protein of interest, known 
as the prey protein, is fused to a DNA-binding 
domain of a specific transcription factor such as 
GAL4; a second protein, the bait protein, 1s fused 
to the activator domain of the same transcription 
factor. Both protein fusions are combined through 
the yeast mating system, and the interaction of 
both proteins causes the activation of the reporter 
transcription factor. Yeast colonies are plated on 
media that require the activation of the reporter 
gene through the successful interaction of both 
proteins. As a result, protein—protein interactions 
can be deduced from yeast growth. A high- 
throughput implementation by parallel sequenc- 
ing of PCR products has been proposed 
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Table 10.2 Freely accessible databases and tools for net- 
work analyses 


Types Tools 


Databases and warehouses 

BAR, Genevestigator, ATTED, 
AT-TAX, TileViz 

Pep2pro 

ANAP, BAR, PAIR, IntAct, 
AtPIN, AtPID, AthPPI, 
Associomics, PRIN (rice), 
BioGRID 

SUBA, Predotar, TargetP 
BAR, AMIGO, DAVID, 
FuncAssociate, BiNGO, 
ATCOECIS, MapMan, AraNet, 
AgriGO 

Comparative genomics Plaza, OrthologID, DoOP 


Gene expression 


Protein expression 


Protein—protein 
interactions 


Protein localization 
Gene set analysis 


Gene regulation and promoter architecture 

Motif detection BAR, ATCOECIS, TAIR 
patmatch, DoOP, STAMP, 
Weeder 

PLACE, AthaMap, AGRIS 
LeMoNe, ENIGMA, CORNET, 
Ondex, AraNet, GeneMania 

R, MATLAB, Systems Biology 
Workbench 


Binding site analysis 
Data integration and 
network inference 


Generic computation 
and modeling 
environments 


Data visualization AraCyc, PlantCyc, Cytoscape, 


MapMan, REACTOME 


(Calvenzani et al. 2012). In comparison with the 
Y2H, the yeast one-hybrid (Y1H) screen is 
designed to detect protein-DNA interactions, 
1.e., DNA-binding proteins that recognize a DNA 
sequence cloned upstream of yeast reporter 
genes. 


10.3.1.2 Protein Profiling 

In contrast to Y2H, in which individual protein— 
protein interactions are predicted, protein profil- 
ing serves to identify those proteins that are 
expressed by a cell under specific conditions. The 
ability to profile the protein output of an organ- 
ism can reveal how the cell responds to certain 
environmental conditions and provide clues to 
the function of these expressed proteins. The 
detection of the full complement of proteins 
expressed by a cell is obtained by the combina- 
tion of two-dimensional (2D) electrophoresis and 
mass spectrometry (MS). In this approach, a cell 
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is harvested for its proteins and separated using a 
2D slab gel. This results in the separation of pro- 
teins based on their isoelectric point (the pH at 
which a protein displays no net charge) and 
molecular mass. MS is used to further identify 
the proteins separated by 2D electrophoresis. 
More recent advances in MS technology, such as 
matrix-assisted laser desorption ionization time 
of flight (MALDI-TOF), have further increased 
the ability to identify proteins extracted from a 
gel. Specific aspects of these technologies were 
summarized in Hamrang et al. (2013). The latest 
advances in MS have also allowed the emergence 
of the profiling of more than 1,000 proteins in 
one study through Orbitrap technology (Scigelova 
and Makarov 2006; McAlister et al. 2008). This 
technology allows the relative quantification of 
profile components through a Fourier transform 
combined with the qualitative determination of 
proteins from _ trypsin-digested samples 
(Finkemeier et al. 2011). 


10.4 Large-Scale Transcriptional 
Regulatory Networks 


Tightly controlled gene expression is a hallmark 
of multicellular development and is accom- 
plished using transcription factors (TFs) and 
miRNAs. Although many studies have focused 
on identifying downstream targets of these mol- 
ecules, less is known about the factors that regu- 
late their differential expression. The mapping 
and analysis of the large-scale regulatory net- 
works associated with TFs and miRNAs allows 
the inference of their properties (Long et al. 
2008). A gene regulatory network (GRN) 
describes the interaction of regulatory factors and 
their targets. Two complementary ways have 
been described to map GRNs: TF oriented and 
target oriented. 

Chromatin immunoprecipitation (ChIP) cou- 
pled with microarrays (ChIP-chip) or next- 
generation sequencing (ChIP-seq) is a 
TF-centered approach for mapping GRNs. This 
technique involves immunoprecipitation of a TF 
and the bound target DNA with subsequent iden- 
tification of downstream targets. The ChIP 
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approach results in a network that describes the 
in vivo binding targets of a single TF. 

The YIH assay is also a _ target-oriented 
approach that allows the identification of 
upstream TFs that bind to promoters or motifs of 
interest. This assay requires the initial develop- 
ment of a TF collection or cDNA library of prey 
TFs that are assayed against bait promoters of 
interest in yeast (Deplancke et al. 2004). By con- 
trast, Y2H is TF oriented. 

ChIP studies have focused on TFs that are 
well characterized and typically abundant. In 
plants, ChIP studies have been performed pri- 
marily in Arabidopsis but also in rice (Oryza 
sativa) and maize (Zea mays) (Fornalé et al. 
2010; Moreno-Risueno et al. 2010). A protein 
expressed in a few cell types or developmental 
stages can be difficult to precipitate because of 
low abundance. Traditionally, this has been over- 
come by overexpressing and _ conditionally 
inducing the gene of interest by fusion to the glu- 
cocorticoid receptor or to a hormone-inducible 
promoter (e.g., the estrogen-inducible promoter). 
Although this approach provides the investigator 
with potential binding targets, this method 
obscures the developmental or tissue/cell type- 
specific context of the transcriptional regulation 
identified. Additionally, high-quality antibodies 
against TFs are required to achieve specific and 
sufficient immunoprecipitation. The use of TFs 
fused with a characterized protein (i.e., green 
fluorescent protein) or epitope allows the circum- 
vention of the development of protein-specific 
antibodies. However, the tagged protein must be 
verified to ensure that its expression reflects the 
wild type and that it can functionally comple- 
ment a loss-of-function phenotype. 


10.4.1 DNA Microarrays 


The analysis of gene expression data generated 
by high-throughput microarray transcript profil- 
ing experiments has shown that transcriptionally 
coordinated genes are often functionally related. 
Recently, several studies have demonstrated 
through the integration of gene homology and 
expression information how correlated gene 
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expression patterns can be compared between 
species. The incorporation of detailed functional 
annotations and experimental data describing 
protein-protein interactions, phenotypes, or 
tissue-specific expression provides an invaluable 
source of information to identify conserved gene 
clusters and translate biological knowledge from 
model organisms to crops. 

The purpose of a microarray experiment is to 
compare the steady-state mRNA levels between 
two samples that have been exposed to different 
conditions, either environmental or genetic. 
Messenger RNA is prepared from both samples 
and converted to cDNA with a reverse transcrip- 
tase. The fluorescent dyes green cyanine (Cy3) 
and red cyanine (Cy5) are used to label the two 
samples, respectively. Both samples are then 
combined and hybridized to the identical micro- 
array that typically contains thousands of regu- 
larly spaced DNA spots. The DNA in each spot 
contains the complement of a fragment of one 
ORF in the genome, and any gene that was 
actively transcribed in either sample will hybrid- 
ize with the complementary DNA spot on the 
microarray. Because the two samples are labeled 
with different dyes, the relative steady-state 
mRNA levels of each gene can be measured for 
each sample. Further explanation of DNA micro- 
array technology is available in Rhodius et al. 
(2002), Ray et al. (2010), Gibriel (2012), and 
Movahedi et al. (2012). 
10.4.1.1 Comparative Microarray 
Analysis 
The analysis of DNA microarrays typically 
revolves around the fundamental observation that 
genes sharing similar expression patterns from a 
collection of microarray experiments have a 
higher likelihood of being functionally related. 
Multiple experiments in a large number of model 
organisms have confirmed this hypothesis 
(Jansen et al. 2002). One of the first applications 
of DNA microarrays to bacteria was the compari- 
son of two genetically identical populations 
under different environmental conditions. For 
example, analyses of the gene expression of E. 
coli have been conducted by comparing microar- 
ray data for cells in minimal vs. rich media (Tao 
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et al. 1999) and for groups of E. coli containing 
small and large populations. Similar microarray 
experiments have also been reported for other 
bacteria under comparative conditions such as 
drug treatment in both M. tuberculosis (Shi et al. 
2012) and Streptomyces coelicolor (Weber et al. 
2005) and stress conditions in both B. subtilis (Ye 
et al. 2000) and P. putida (Puchatka et al. 2008). 
Additionally, genes associated with the pollen 
transcriptome in A. thaliana were described 
using this method (Becker et al. 2003). Thus, 
many studies have used microarrays to probe dif- 
ferential gene expression between two strains of 
the same bacteria or compare the difference in 
the plant life cycle due to alternations between 
diploid sporophytic and haploid gametophytic 
generations. In one type of experiment, a specific 
gene of interest is inactivated, and a comparative 
microarray analysis is conducted between the 
mutant strain and the wild type. 

The widespread use of microarrays as a tool 
for the large-scale investigation of differential 
gene expression in different organisms has 
resulted in the accumulation of a substantial 
amount of publicly available microarray data. A 
number of online databases exist to house these 
data, such as the ArrayExpress (Brazma et al. 
2006), the Gene Expression Omnibus (Davis and 
Meltzer 2007), and the Stanford Microarray 
Database (Hubble et al. 2009). The availability of 
a large number of microarray experiments under 
a wide variety of experimental conditions has 
stimulated the development of tools to analyze 
this abundance of data. 


10.4.2 RNA-Seq and Ampliseq 


RNA-Seq (for RNA sequencing) is the high- 
coverage sequencing of a transcriptome obtained 
by massive parallel sequencing (next-generation 
sequencing — NGS) to quantify mRNAs, 1.e., the 
mRNA count for each expressed gene in a 
genome. The number of reads that corresponds to 
a particular exon normalized by length yields 
gene expression levels that correlate with those 
obtained through qPCR (Li et al. 2008). RNA- 
Seq does not necessarily correlate with proteome 
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profiles because of posttranscriptional gene 
regulation events (such as RNA interference); 
however, measuring mRNA concentration levels 
is still a useful tool to determine how the tran- 
scriptional machinery of the cell 1s affected in the 
presence of external signals (e.g., chemical treat- 
ment) or how cells differ between a healthy and a 
diseased state. The correlation between the RNA- 
Seq quantities of mRNAs and proteomic profiles 
is obtained by sequencing polysomal mRNA 
from ribosomes isolated by sedimentation in 
sucrose gradients (Kleene et al. 2010). 

The mRNA composition of the transcriptome 
of a cell is continuously changing as opposed to 
the stable genome. High-coverage sequencing by 
NGS allows the quantification of mRNA with 
marginal frequencies such as those resulting from 
alternative transcript splicing (Eksi et al. 2013), 
posttranscriptional changes, gene fusion, muta- 
tions (indels, SNPs), and changes in gene expres- 
sion (Maher et al. 2009). In addition to mRNA 
transcripts, RNA-Seg can address different RNA 
populations such as total RNA, small RNAs (e.g., 
miRNA), tRNA, and ribosomal RNA profiling 
(ngolia et al. 2012). The limited coverage of 
microarrays increases the attractiveness of RNA- 
Seq. RNA-Seq data have been recently used to 
infer genes involved in specific pathways based 
on Pearson correlations with variance-stabilizing 
transformation approaches (Giorgi et al. 2013). 

RNA-Seq is obtained by aligning millions of 
reads to CDSs. Several software packages exist 
for short read alignments, and specialized algo- 
rithms for transcriptome alignments have been 
proposed, such as Bowtie to align short reads 
(Langmead et al. 2009), TopHat to align reads to 
a reference genome to discover splice sites 
(Trapnell et al. 2009), Cufflinks to assemble the 
transcripts and compare/merge them with others 
(Trapnell et al. 2010), and FANSe (Zhang et al. 
2012). These tools can also be combined to form 
a comprehensive system (Trapnell et al. 2012). 

Ampliseq is the one-shot sequencing of a high 
number of mRNA amplicons, typically in the 
range of 400 or more (Towler et al. 2008). This 
approach requires a previous study to uncover the 
gene pool that is worthwhile to measure by this 
method. Ampliseq is a type of parallel PCR that 
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measures the expression of a range of selected 
genes. This method has the advantage of being 
cheaper than RNA-Seq. 


10.4.3 Gene Expression Mapping 


The focus of the majority of microarray, RNA- 
Seq, and Ampliseq experiments is to determine 
the genes that are differentially expressed under 
specific conditions; however, identification of 
genes that are similarly expressed under the larg- 
est possible set of conditions is also worthwhile. 
Combining the differential gene expression pat- 
terns for an organism for a multitude of experi- 
mental conditions provides insights into genes 
whose expression is tightly correlated, which is a 
strong indication of their functional interaction. 
The first application of this approach was 
reported by Kim et al. (2001) for the model 
organism Caenorhabditis elegans. These authors 
obtained data from over 500 microarray experi- 
ments from a number of sources conducted over 
a wide range of experimental conditions and 
constructed a gene expression map. Bray (2004) 
studied changes in gene expression that can be 
used to model the response of a plant to environ- 
mental challenges. For instance, one study used 
microarrays to report the expression patterns of 
thousands of genes simultaneously to understand 
the types and quantities of RNAs present in cells 
in response to a water stress. The analysis of the 
resulting map showed that the genes exhibiting 
similar annotations were found to cluster 
together. 


Induced Mutations and Gene 
Editing 


10.5 


Mutagenesis has historically been a classical 
method to stably modify plant genome character- 
istics by exposing seeds to X-rays, gamma rays 
(Ahloowali et al. 2004), or chemical mutagens 
such as ethyl methanesulfonate (EMS) and 
dimethyl sulfate (DMS). Mutagenesis has also 
been performed by transposon activation (inser- 
tional mutagenesis) or random T-DNA insertion 
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(Alonso et al. 2003). The maize system (typi- 
cally, the suppressor-mutator (Spm) and activa- 
tor/dissociation (Ac/Ds)) was introduced in 
several plants of interest such as tomato (Cooley 
et al. 1996). Because the transposon sequence is 
known, any modification to the tagged gene can 
be easily identified by PCR and sequencing. The 
current preferential methods to induce mutagen- 
esis are experimental techniques that tag a par- 
ticular gene of interest. 


10.5.1 RNA Interference Technology 


The first description of virus-induced gene silenc- 
ing was performed in plants at the end of the last 
century (Lindbo et al. 1993; Kumagai et al. 
1995). Some years later, RNA silencing was 
described in other eukaryotes and was referred to 
as RNA interference (RNAj1) (Katoch and Thakur 
2013). Plants and insects have evolved to use 
RNAi as a protection against virus infection 
(Obbard et al. 2009). RNAi silences genes 
through a process 1n which double-stranded RNA 
(dsRNA), also referred to as hairpin RNA 
(hpRNA), is broken into small interfering RNAs 
(siRNAs) by a RNase III named dicer. The siR- 
NAs then assemble into a RNA-induced silenc- 
ing complex (RISC) that uses the siRNAs as 
guides for complementary mRNA degradation. 
Plants possess specific RNAi machinery to 
degrade the dsRNAs produced by viruses, and 
the siRNAs produced may act as guides over long 
distances through phloem transportation (Voinnet 
and Baulcombe 1997). The introduction of long 
dsRNAs or siRNAs has now been adopted as a 
potential tool for inactivating gene expression in 
living organisms. 

The development of RNAi technology has 
allowed the investigation of gene function and 
regulation by knocking out the specific gene. 
Metabolic pathways controlling notable traits 
were uncovered through this type of biological 
engineering. In plants, RNAi is induced by intro- 
ducing expression vectors that transcribe a self- 
complementary dsRNA (Horiguchi 2004). The 
transformation method is one of the factors that 
affect the efficacy of gene knockouts in plants. 
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The Agrobacterium-mediated transformation 1s 
generally the preferred transformation technique. 
In plants that are incompatible withA grobacterium 
infection, the direct introduction of RNAi vectors 
via particle bombardment and electroporation 
has also been used (Katoch and Thakur 2013). 
The use of RNAi technology to (i) improve plant 
traits offers a huge potential for agricultural 
development. RNAi technology has been used to 
improve traits, including those related to nutri- 
tional quality, content of food allergens and toxic 
compounds, and defense against biotic and abi- 
otic stresses; (11) induce morphological varia- 
tions, male sterility, and secondary metabolite 
synthesis; (411) produce seedless plant varieties 
(Saurabh et al. 2014; Tables 10.3); and (iv) create 
resistance to pathogens such as viruses, nema- 
todes, insects, bacteria, and fungi (Table 10.4). 


10.5.2 Clustered Regularly 
Interspaced Short Palindromic 
Repeats (CRISPRs) 


Each CRISPR locus consists of a series of short 
repeats that are separated by non-repetitive spacer 
sequences derived from foreign genetic elements 
(Fig. 10.1). This conserved repeat-spacer-repeat 
architecture was originally observed in the E. coli 
genome in 1987 (Ishino et al. 1987), but the func- 
tion of these repeats remained enigmatic until 
2005. The spacer sequences in CRISPR loci are 
often identical to sequences in bacteriophage 
(virus) genomes and plasmids (Bolotin et al. 
2005). 

CRISPRs and their associated genes (Cas) are 
essential components of cellular defense systems 
against infections by foreign nucleic acids and 
are widespread in bacteria and archaea. Similar 
to the RNAi pathways in eukaryotes, CRISPRs 
rely on small RNAs for the sequence-specific 
degradation of invading nucleic acids such as the 
nucleic acids of viruses (Sorek et al. 2013). 
CRISPR differs from DNA restriction enzymes 
in the sense that being RNA guided, CRISPR 
may be programmed to degrade any RNA or 
DNA substrate. The guided RNA nuclease activ- 
ity of CRISPR is now being explored to engineer 
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Table 10.3 RNAi for crop plant improvement 
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Plant Gene targeted Trait Benefit References 
Arabidopsis ACR2 gene Increased arsenic uptake Phytoremediation of heavy Dhankher et al. 
by the plant metal soils (2006) 
Arabidopsis MS1 Degeneration of the Male sterility Wilson et al. 
tapetum (2001) 
Coffee bean CaMxMt 1 gene Reduced caffeine Decaffeinated coffee Ogita et al. 
content in the seeds (2003) 
Cotton ghSAD-1 and Increased stearic acid Useful for cooking Liu et al. (2002) 
ghFAD2-1 genes and oleic acid content of applications without the 
seed oil need for hydrogenation 
Cotton 6-Cadinene synthase Reduction in gossypol Health benefits Sunilkumar et al. 
gene (2006) 
Maize Gene for starch- Increase in amylase Maize quality improvement Chai et al. (2005) 
branching enzyme content 
Maize Ms45 Male sterility in maize Male sterile lines Cigan et al. 
(2005) 
Maize Opaque-2 Reduced leucine levels Higher lysine content Segal et al. 
(2003) 
Oilseed rape BPI gene Reduced or absent petals Improved photosynthesis Byzova et al. 
(2004) 
Onion Lachrymatory factor Reduced lachrymatory — Tearless onion Eady et al. (2008) 
synthase (LFS) factor 
Petunia Chalcone synthase Anthocyanin Improved market value Hirner et al. 
(CHS) genes biosynthesis (2001) 
Petunia TAZ1 Degeneration of the Male sterility Kapoor et al. 
tapetum (2002) 
Rice Lgcl Low glutelin content Health benefits Kusaba et al. 
(2003) 
Tobacco CHI gene Flower coloration Ornamental plants Nishihara et al. 
(2005) 
Tomato 1-Aminocyclo Longer shelf life Reduced ethylene Xiong et al. 
propane-1l-carboxylate associated with slower __ sensitivity (2005) 
oxidase ripening 
Tomato DET1 gene Increased carotenoid and Nutraceuticals and health Davuluri et al. 
flavonoid content benefits (2005) 
Wheat SBEIa and SBEIIb High amylose Improved bowel functions Regina et al. 
(2006) 


Adapted from Katoch and Thakur (2013) 


nucleic acids in various model systems, including 
plants (Fig. 10.2). 


10.5.3 Zinc Finger Nucleases (ZFN) 
as Precision Tools for Genetic 
Engineering 


Most endonucleases, such as restriction enzymes, 
recognize short sequences of four to eight base 
pairs and will therefore cut the genome many 
times. By contrast, zinc finger nucleases are syn- 


thetic endonucleases that have larger recognition 
sites. These nucleases are produced in two halves 
(ZFN-A and ZFN-B). Each half binds a target 
sequence of 12—18 bases and contains a mono- 
mer of the endonuclease Fok I. When the two 
halves are close enough for the Fok I domains to 
dimerize, the endonuclease becomes active and 
cuts the intervening sequence (Fig. 10.3). 

The requirement for such extended binding 
sites focuses the activity and restricts the binding 
to a single site in a genome. In mammalian cells, 
a double-stranded break in DNA can be repaired 
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Table 10.4 RNAi for pest and disease resistance 
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Trait Pest/disease Target gene Host References 
Insect Helicoverpa armigera CYPAE14 Cotton Mao et al. (2007) 
resistance Corn rootworm V-ATPase A Maize Baum et al. (2007) 
Virus Rice dwarf virus (RDV) PNS12 Rice Shimizu et al. (2009) 
resistance — Bean golden mosaic virus (BGMV) AC1 gene Bean Bonfim et al. (2007) 
BYDV (barley yellow dwarf virus) BYDV-PAV Barley Wang et al. (2000) 
Nematode Meloidogyne incognita Splicing factor and Tobacco Yadav et al. (2006) 
resistance integrase 16D10 
Meloidogyne Tisl1 Arabidopsis Huang et al. (2006) 
Meloidogyne javanica Tobacco Fairbairn et al. (2007) 
Bacteria Xanthomonas citri subsp. citri (Xcc) PDS and CalS1 Lemon Enrique et al. (2011) 
resistance 
Fungus Phytophthora infestans SYRI Potato Eschen-Lippold et al. 
resistance (2012) 
Phytophthora parasitica var. GST Tobacco Hernandez et al. (2009) 
nicotianae 
Blumeria graminis f. sp. tritici MLO Wheat Riechen (2007) 


Adapted from Saurabh et al. (2014) 
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Fig. 10.1 CRISPR-Cas is a microbial adaptive defensive 
system that uses RNA-guided nucleases (Cas) to cleave 
foreign genetic elements. CRISPR system comprises a 
cluster of CRISPR-associated genes (Cas), noncoding 
RNAs, and a distinctive array of repetitive elements 
(direct repeats). These repeats (R, black diamonds) are 
interspaced by short variable sequences derived from 
exogenous DNA targets known as protospacers (red and 
blue lines), and together they constitute the CRISPR RNA 
(crRNA) array. Within the DNA target, each protospacer is 
always associated with a protospacer adjacent motif 
(PAM), which can vary depending on the specific CRISPR 
system. The CRISPR system II is composed of the nucle- 
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ase Cas9, the crRNA array that encodes the guide RNAs, 
and a required auxiliary’ trans-activating crRNA 
(tracrRNA) that facilitates the processing of the crRNA 
array into discrete units. The exogenous DNA, typically 
from virus, 1s processed by proteins encoded by Cas genes 
into small elements (~30 base pairs in length), which are 
then inserted into the CRISPR locus of the bacterial 
genome. RNAs from CRISPR loci are constitutively 
expressed and are processed by tracrRNA and Cas pro- 
teins into crRNAs. Cas9 protein binds to the dsDNA/ 
crRNA complex to cut the double DNA strands at the tar- 
get motive (Adapted from Wilkinson and Wiedenheft 
(2014)) 
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by nonhomologous end joining, leading to gene 
disruption, or by repair via a donor plasmid. 
Depending on the donor, this method can be 
used to correct mutations or to insert heterolo- 
gous DNA. In recent years, ZFN technology has 
been used to create transgenic mammals, 
amphibians, fish, insects, and plants (Urnov 
et al. 2010). 


10.5.4 Genome Engineering 


Upon the double-stranded breakage induced by 
CRISPR or ZFN, a donor DNA identical to the 
wild-type sequence may be used to restore the 
DNA to the original sequence by homologous 
DNA repair (Ran et al. 2013). However, foreign 
DNA may also be used to introduce directed 
mutation such as single nucleotides (single 
nucleotide polymorphism — SNP), indels, or even 
whole gene replacement. The directed delivery of 
foreign DNA to specific locations in the genome 
driven by CRISPR RNA-guided nucleases 
(Fig. 10.2) or ZFN (Fig. 10.3) is being proposed 
as a gene therapy methodology to repair, modify, 
or replace defective genes. 

CRISPR and ZFN allow the knockout of bi- 
alleles in plants or animals using a single-step 
process that is useful in hastening in vivo genetic 
studies in living systems (Chang et al. 2013; 
Hwang et al. 2013a, b; Nakayama et al. 2013; Li 
et al. 2013a, b). A notable feature of CRISPR and 
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ZEN is the possibility of editing several genes in 
the identical genome or excising large genomic 
segments located between two different cleavage 
sites (Gratz et al. 2013). Multiplexing may be 
particularly useful for knocking out redundant 
genes or pathways (Wilkinson and Wiedenheft 
2014). 
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Fig. 10.2 Cas9 DNA targeting by PAM recognition and 
crRNA base pairing. The Cas9 nuclease recognize PAM 
and the pairing product of crRNA and target DNA. DNA- 
induced double-stranded breaks are repaired by nonho- 
mologous end joining (NHEJ) or homology-directed 
repair (HDR). NHEJ is error-prone, resulting in insertions 
or deletions (indels) that disrupt the target site. HDR relies 
on a donor template that can be used to deliver foreign 
DNA at a specific location (Adapted from Wilkinson and 
Wiedenheft (2014)) 
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Fig. 10.3 Principle of DNA double break induction by 
ZEN. A dimer of four ZFs bounds to the target cleavage 
site. Each ZF recognizes 3 bp; thus, a chain of four ZF 
recognizes 12 bp. A combination of four ZFs is chosen 
to design a ZEN specific of each strand at the DNA motive 


to be broken. A spacer of 5—7 bp can be left between 
the break points on the + and — strands according to the 
application. Typically, the ZFN construct is brought 
together with a sequence to be inserted at the break point 
by homologous recombination 
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The simplicity in programming RNA-guided 
Cas9 nucleases has contributed to their fast 
implementation by the genome engineering com- 
munity. Currently, a shared list of expression 
plasmids is available to the scientific community 
at Addgene.org (Wilkinson and Wiedenheft 
2014). 


10.6 Integration of Functional 
Genomics Datasets: Systems 


Biology 
10.6.1 Metabolic Networks 


Genome-scale metabolic network models have 
been built and applied to study a wide variety of 
topics, including adaptive evolution, network 
properties, and metabolic engineering (Feist and 
Palsson 2008). Genome-scale metabolic models 
in metabolic engineering have been used to cou- 
ple growth rates with the overproduction of lac- 
tate (Fong et al. 2005) and L-Val (Park et al. 
2011) in Escherichia coli and to prioritize alter- 
native engineering strategies for overproducing 
ethanol in yeast (Bro et al. 2006). Although the 
majority of the work in metabolic network recon- 
struction and application has been performed in 
microbes, several advances in plant metabolic 
network reconstruction and modeling have been 
described recently. These advances pave the way 
for rational plant metabolic engineering. 

Two types of plant metabolic networks have 
been produced: descriptive and _ predictive. 
Genome-wide descriptive metabolic networks 
have been obtained by referring to metabolic 
pathway databases that are being curated with 
experimental data from the literature (Caspi et al. 
2012; Kanehisa et al. 2012). These genome-wide 
metabolic network representations are now 
available for several plant species, such as 
Arabidopsis (Zhang et al. 2010), Populus tricho- 
carpa (Zhang et al. 2010), and grasses (Youens- 
Clark et al. 2011). Recently, the plant metabolic 
network group published online genome-wide 
predictive metabolic networks for maize, soy- 
bean (Glycine max), cassava (Manihot escu- 
lenta), and wine grape (Vitis vinifera) (www. 
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plantcyc.org; Zhang et al. 2010). Most of these 
networks (except for Arabidopsis) are based on 
searches for homology with existing descriptive 
networks and are eventually not curated through 
extensive reference with experimental data from 
the literature; thus, caution should be used when 
interpreting these models. Descriptive and pre- 
dictive networks were already used for a variety 
of studies, including the identification of novel 
genes in glucosinolate metabolism (Chan et al. 
2011), the role of duplicated enzymes (Hanada 
et al. 2011), and metabolomic responses to UV-B 
(Kusano et al. 2011). Because they are descrip- 
tive, qualitative networks have been useful in 
many studies; however, they are not predictive 
and are therefore limited to the purpose of facili- 
tating metabolic engineering. 

The accumulation of a wide variety of differ- 
ent functional genomics datasets has resulted in 
the development of the field of systems biology 
(Kitano 2002; Hwang et al. 2005a). Systems biol- 
ogy attempts to decipher the complex behaviors 
exhibited by an organism by understanding how 
these behaviors are derived from the interactions 
of the underlying components. Although this is a 
broad statement, when viewed within the frame- 
work of functional genomics, this description can 
be reduced to the specific interactions that exist 
between proteins, proteins and metabolites, or 
proteins and DNA within cells. In essence, sys- 
tems biology is a method for the interpretation of 
protein functions in the context of their 
interactions. 

The view that functions can be encapsulated 
into objects of a genetic network represents a 
paradigm shift in the way we represent the 
genome of any living organism. The ability to 
quantify the inputs and outputs that exist for 
genes and proteins within and between objects 
represents a critical step in describing the func- 
tion of a genome. Each genomic dataset repre- 
sents one type of functional relationship that 
defines the objects and shapes the composition of 
genetic nodes in these networks. Systems biology 
strives to unify the contextual framework with 
their functional relationships through the integra- 
tion of multiple algorithms and genomic datasets. 
The current generation of these algorithms 
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focuses on probabilistic integration, and the most 
recent algorithms have demonstrated the ability 
to recapitulate known pathways such as galactose 
utilization in yeast (Hwang et al. 2005b, c). 

Because all functional genomics datasets may 
be used to predict functional interactions, the 
integration of these datasets is thought to provide 
a greater confidence in the individual predictions. 
In other words, an interaction that is predicted to 
occur between two genes in DNA microarray 
experiments combined with phylogenetic infer- 
ence has a higher probability of being a signifi- 
cant interaction than if the interaction was 
predicted by only one of these methods. This pos- 
tulate has gained wide acceptance within the sys- 
tems biology community, and an ever-increasing 
number of new algorithms for the integration of 
functional genomic datasets are being reported 
(Joyce and Palsson 2006). Notably, the most 
profitable advances in the application of these 
algorithms are found in the integration of a large 
number of functional genomic datasets. 
Unsurprisingly, the majority of integrative algo- 
rithms are being reported in model organisms, 
such as yeast, mouse, fly, and Arabidopsis, in 
which a large number of functional genomic 
datasets are already available. 

Several thousand expression microarray 
experiments performed on samples from 
Arabidopsis have been released to the public 
domain, and using these data to investigate meta- 
bolic networks in plants is now possible. With 
Web tools such as CressExpress 
(Srinivasasainagendra et al. 2008), users can 
identify clusters of genes that are co-expressed 
with one or more genes of interest. This method 
allows the identification of new members of met- 
abolic pathways that are regulated at the level of 
mRNA abundance. 


10.6.2 Functional Genomics 
as Genome Annotation 


Functional genomics is an advanced form of 
genome annotation, 1.e., the process of attributing 
biological definition to a DNA _ sequence. 
Annotation can be summarized as the application 
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of two largely independent steps: the identifica- 
tion of a list of genetic elements on a genome (a 
process called gene finding or structural annota- 
tion) followed by the attachment of a biological 
definition to these elements (a process called 
functional annotation) (Zagursky and Russell 
2001). Structural annotation is largely automated. 
Structural annotations are considered relatively 
stable. By contrast, functional annotations are 
dynamic and are subject to frequent changes as 
the body of experimental data grows or algo- 
rithms for gene predictions improve. 

The functional annotation of a gene is tradi- 
tionally assigned by one of two methods. In the 
first method, a novel annotation is produced 
through experimental investigations conducted 
on a specific gene, and the accumulated knowl- 
edge is assigned to that gene. The second method 
is annotation by homology, in which a functional 
annotation is assigned to one gene in a genome 
because the gene is homologous to a second gene 
that has been annotated in another organism. 
Unfortunately, annotation by homology will con- 
tribute to error propagation when the reference 
gene 1s not correctly annotated. The propagation 
of annotation error by homologous comparison is 
called genome rot (Galperin and Koonin 1998; 
Gerlt and Babbitt 2000). The annotation of genes 
through homology is routinely applied to newly 
sequenced genomes, and approximately 65 % of 
the predicted ORFs can be assigned with some 
annotation (Karaoz et al. 2004). The challenge is 
the assignment of a functional annotation to the 
remaining 35 %. The critical step is the 
confirmation of the annotation prediction 
obtained by high-throughput assignments. Thus, 
a long and tedious work of inference confirma- 
tion must be performed by experts 1n experimen- 
tal biology to validate predictions. 

The application of functional genomics to 
experimental research also results in the rapid 
accumulation of a large amount of data, and new 
methods are needed to manage and disseminate 
these data to the research community. As a result, 
data curation by experts is integral to the annota- 
tion process of any model organism’s genome. 
Following the traditional curation model, a group 
of professional annotators attempt to synthesize 
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the vast amount of scientific knowledge for each 
model organism to annotate the genome (Stein 
2001). Other types of curation processes are the 
party model, in which a community of scientists 
will meet periodically for the purposes of joint 
genome annotation, and the cottage industry 
model, in which experts within a field are 
recruited on a part-time basis to verify the consis- 
tency of the annotation. 
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Maria Elena T. Caguioa, Manish L. Raorane, 
and Ajay Kohli 


Abstract 

The increasing number of novel tools, methods and capacities to elucidate 
and understand the genome of various organisms 1s steadily leading the 
path for similar groundbreaking studies in downstream comprehension of 
the genomes. Platforms for analytical and functional transcriptomics 
have seen a similar surge in operational outputs. The cryptic complexity of 
the DNA in the genomes is manifested in various forms and functions 
of the RNA it codes for. Such complexity reaches much more elaborate 
intricacies in the translational protein products of the RNA. Although tran- 
scripts regulate a number of biological processes, proteins form the basic 
functional unit of most procedures that are evinced as a phenotype. 
Progressively increasing realization that there is limited correspondence 
between the level of a specific transcript and protein is only the beginning 
of appreciating that predictive biology would be less dependent on tran- 
scripts than on proteins. Principles of protein function that were explored 
much before the DNA and RNA paradigms are coming back to inform the 
omics world in the functional and regulatory capacity of proteins. 
Combined with the evolution of the high-throughput analytical capacity 
for proteins, these are clubbed into what is called proteomics, which 
is also riding the wave of computational and electronic revolutions in 
biology. Using such omic technologies for agriculture however lags behind 
compared to their use in medicine. Nevertheless, the recent appreciation 
of the need for food and feed security under the predicted climato- 
demographic scenarios is pushing the frontiers of agricultural research. 
The use of proteomics in particular is gaining ground rather quickly, and 
an increased understanding of crop phenotype as related to proteomics is 
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being established. A general background of proteomics and its importance 
in agriculture is presented followed by a more detailed treatment of pro- 
teomics as applied to selected important crops in agriculture. A limited 
comparison is presented between plant and non-plant proteomics to high- 
light the gaps, and finally a perspective on the future utility of plant pro- 


teomics is presented. 


Keywords 


Proteomics ¢ Crops ¢ Agriculture * Food ¢ Translational research 


11.1. Introduction 

Crops are the main source of human and animal 
food supply. Feeding the growing 7.1 billion 
world population, which is projected to increase 
to nine billion by the year 2040, is certainly of 
top priority for ensuring food security. Also, the 
use of arable land for commercial purposes and 
current climate change scenarios are also aggra- 
vating the food security issues. Though the chal- 
lenge seems insurmountable, high-throughput 
platform technologies have had an evident impact 
in recent times in analysing and understanding 
biology at a speed never possible before. The 
chief example of such advances is the genome 
sequencing technologies, which have evolved at 
a rate outpacing Moore’s law. Advances with 
computational biology for DNA sequence analysis 
have positively affected similar efforts in analysing 
other aspects of biology. This has indeed pro- 
vided strong belief in the capability to meet the 
challenge. For the past 13 years, a rapid progress 
in the study of plant genomics is also evident. 
The completion of the genome sequencing projects 
of the model system such as Arabidopsis 
thaliana in 2000 (The Arabidopsis Genome 
Initiative 2000), followed by rice in 2002 (Goff 
et al. 2002; Yu et al. 2000; IRGSP 2005), poplar 
(Tuskan et al. 2006), castor (Lu et al. 2007), sorghum 
(Bedell et al. 2005), rape seed (Jaillon et al. 2007), 
soybean (Schmutz et al. 2010), tomato (http:// 
solgenomics.net/organism/Solanum_lycopersi- 
cum/genome), maize (Wilson et al. 2009; Gore 
et al. 2009; Dolgin 2009), banana (http://banana- 
genome.cirad.fr/) and Medicago  truncatula 
(http://www.medicago.org/) have paved the way 


for further intricate research using the advanced 
high-throughput omic technologies (Agrawal 
et al. 2011). 

Much interest is now focussed on proteomics 
after steadily accumulating evidence of lack of a 
linear correlation between transcripts and func- 
tional protein expression levels, in an increasing 
number of genes (Hajduch et al. 2010). Such 
findings may be attributed to protein stability, 
in part due to the various post-translational 
modifications, certain signalling networks and 
protein-protein interactions. Earlier, genomics 
was considered as the primary source of informa- 
tion for functional biology. However with the 
advent of new technologies which allow for 
better understanding of different cellular activities, 
the protein complement is seen as highly relevant 
in predicting cellular/organismal function/pheno- 
type. Differential modifications of the gene prod- 
ucts occur due to various signals perceived by the 
cell. This in turn modulates the activity of genes 
per se. So what then is proteomics and what is its 
place in agriculture and food security? 


11.2 Whatls Proteomics? 


Proteomics in a narrow sense is a high-throughput 
cataloguing of the repertoire of an entire set of 
proteins of a cell/tissue/organism at defined spatio- 
temporal points. It is based on a combination of 
several methods such as protein separation, mass 
spectrometry, genome sequencing/annotation and 
protein search algorithms (Thelen 2007). However, 
in a broader sense, proteomics is not only used to 
catalogue protein of a given biological material, 
but it is also utilized to monitor the reversible 
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PTMs such as phosphorylation, glycosylation, 
ubiquitination, acylation, prenylation, sulphuriza- 
tion, methylation, glycosylation, oxidation, nitro- 
sylation, etc. A recent review takes account of the 
techniques for PTM analysis through proteomics 
(Remmerie et al. 2011). To date, more than 300 
different types of post-translational modification 
have been characterized with the aid of proteomics. 
The rapid development of proteomics depends 
heavily on the dynamic advancement of mass 
spectrometry and bioinformatics as it elucidates 
protein function, domain structure, subcellular 
location, post-translational modifications, splice 
variants, quantities, biochemical structures and 
cellular functions of all proteins in an organism, 
organ or organelle across time and physiological 
state (Kenyon 2002, www.ncbi.nih.gov/pubmed/). 


11.3. Historical Perspective 
of Proteomics 


The vision of indexing human proteins started 
long before Human Genome Project was con- 
ceived. In the late 1970s, Norman G. Anderson 
and N. Leigh Anderson of the Department of 
Energy’s Argonne National Laboratory devel- 
oped the Human Protein Index. Its objective was 
to enumerate human proteins by separation on 
2D gels (O’Farrel 1975), which was the only 
approach possible during those days. However, 
the effort was not sufficient because the technol- 
ogy on mass spectrometry and other suitable 
methods were not available. It was not until 1994 
when an Australian PhD student, Marc Wilkins, 
coined the word proteomics while working on the 
concept of a protein complement. From then until 
now, the field of plant proteomics has flourished 
on different model systems and crops, to differ- 
ent levels of throughput, depending upon the crop 
or model-specific information and funding avail- 
able. The extent of progress has been exceptional 
because the number of published research articles 
from a single paper in 1999 had reached to 3,806 in 
December 2013 using a single search term “plant 
proteome” in the PubMed database. Mostly 
studied are Arabidopsis, rice, maize, soybean and 
other emerging model crops. 
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11.4 Importance and Relevance 

The field of proteomics is particularly important 
because most environmental and physiological 
responses of crops are manifested at the protein 
level. Consequently, proteomics seeks to directly 
correlate the involvement of specific proteins, 
protein complexes and their modification status 
to a given response. 

Proteomics can singly serve as alternative to 
mRNA-based measurements because it can mea- 
sure the actual protein levels, profile the expres- 
sion dynamics of the cellular compartments and 
measure the large-scale protein modifications 
and their quantitative changes upon disturbance 
to the cell. Thus, it can determine the cellular 
response at the level of protein abundance and the 
downstream consequence in terms of gene 
expression changes (Cox and Mann 2007). In 
plants, proteomics was first used to annotate 
individual proteins to supplement genome and 
transcriptome information. However, today pro- 
teomics does not only characterize proteins but 
rather tries to understand the cellular response to 
environmental, and developmental change 
(Rakwal and Agrawal 2003). 


11.5 Workflow of Proteomics 


The process of proteomics can be carried out by 
first identifying a goal which could be to isolate, 
identify, characterize and/or measure the differen- 
tial expression of a protein. This is followed by 
determining the tools for identification, character- 
ization, analysis and validation. An attribute of a 
proteome study commences by extracting proteins 
from a sample followed by the classical method of 
2D PAGE to separate proteins followed by high- 
performance liquid chromatography (HPLC) for 
amino acid analysis, peptide mass fingerprinting 
using mass spectrometry (matrix-assisted laser 
desorption/ionization, MALDI-MS, or electro- 
spray ionization, ESI-MS) or sequence and 
sequence tags using post-source decay (PSD) for 
post-translational modifications as shown in 
Fig. 11.1. 
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Fig.11.1 Workflow of proteomics studies (Source: www.cn.mbnet.org) 


11.6 Proteomic Avenues 


The common approach towards a proteomic 
study was a focus-based approach that aimed 
to understand a specific protein from among 
the set of proteins found to be differentially 
expressed. However, with the swift progress in 
this field, a more global perspective now pervades 
proteomic studies. This now entails the need for a 
broader-based approach where the proteome as a 
whole is analysed and interpreted for its biologi- 
cal significance. The critical aspect in this strategy 
is to ask the correct biological question, choose 
the correct tissue at the correct stage in the life 
cycle and/or an environment of/for an organism 
and make use of the most advanced technologies. 
Table 11.1 shows the difference between the two 
approaches. 


Frontiers and Methods Used 
in Crop Proteomics 


11.7 


After the first publication on proteomics in the 
year 1999 by Marc Wilkins, more than 300 
papers were available at PubMed portal in the 
year 2009, covering various new techniques and 


discoveries on Arabidopsis, rice, maize, soybean 
and other major crops. After 4 years, about eight 
times (2,361 published papers) more publications 
are available on crop proteome studies. 

Current crop proteomic methods in most labs 
still rely on gel and mass spectrometry (MS)-based 
methods. The gel-based method includes the 
classical strategy of conventional two-dimensional 
gel electrophoresis (2DE) and two-dimensional 
differential fluorescence-based gel electrophoresis 
followed by enzymatic digestion and identifica- 
tion of the proteins by mass spectrometric analysis. 
The improvements and results gained over time 
in proteomics research have shown that the 
behaviour and variability of proteins are more 
complex than had ever been imagined. Recent 
advances have led to gel-free technologies. The 
process now starts with direct digestion of the 
proteins of a complex mixture followed by protein 
identification also called as bottoms-up approach. 
A new top-down approach has also been used in 
some cases where the liquid chromatography is 
used directly on the complex mixture for the 
separation of the protein species followed by the 
identification of the protein species by mass 
spectrometry. The MS-based quantification 
methods consist of label-based and label-free 
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Table 11.1 Comparative description of broad- and focused-based approaches in proteomic study 


Basic steps 


Broad-based approach 
Identify an organism 
Identify sample type and preparation 


Utilize a technology that is compatible with 
the same type 


Bioinformatic analysis of the proteomic 
sample 


Build a proteomic model 


Focused-based approach 

Identify protein 

Understand sample type and preparation 
Isolate protein 


Utilize analytical technology that is 
compatible 


Bioinformatic analysis of the protein 
sample 


Model protein’s function 


Inexpensive and results can be generated 
much quicker than a whole proteome study 


Functional information about a protein can 


Pros Proteomic information about a specific tissue 
under certain condition can be gained 
Relationships between many protein can be 
understood 

Cons Extensive upfront planning 


The study will cost more and last longer than 


a focused study 


be determined 

Protein information may not be valuable in 
a global proteomic perspective 

Hard to deduce any cross networks or other 
protein-protein interactions 


Proteomic data will be generated in the end 


Common technologies 
implemented 


SDS-PAGE gel electrophoresis 
2DE-DIGE 

HLPC 

Mass spectrometry 


SDS-PAGE gel electrophoresis 

HLPC 

Mass spectrometry 

Molecular modelling tools (bioinformatics) 


Proteomic modelling tools (bioinformatics) 


protein quantification approaches (Nanjo et al. 
2011) including MALDI-TOF (Sommerer et al. 
2007), 1TRAQ (isobaric tag for relative and 
absolute quantitation; Wiese et al. 2007), tandem 
mass tag approach (Thompson et al. 2003), 
MudPIT (multidimensional protein identification 
technology; Whitelegge 2002), SILAC (stable 
isotope labelling by amino acids in cell culture; 
Gruhler and Jensen 2008), ICAT (isotope-coded 
affinity tags; Lilley and Dupree 2005) and LOPIT 
(localization of organelle proteins by isotope tag- 
ging; Dunkley et al. 2004). Liquid 
chromatography-mass spectrometry (LC-MS) 
and label-free LC-MS are powerful tools that 
have facilitated the quantification and functional 
proteomic analyses of proteins. These tools and 
methods are used solely or in combination with 
other tools or with other “omics” techniques. 
However, no matter how much technological 
advancement improves or replaces the ones avail- 
able, there will always be unique methods for 
each environmental stress the crops encounter 
because their utility depends on the objective of 


the researcher and each of these methods has their 
own advantages and disadvantages. 

The readily available technologies are contin- 
uously improved for superior, more dependable 
results and afford deeper understanding on 
the phenomenon being studied. An example is 
the evaluation of 2D PAGE in rice caryopsis 
which demonstrated superior staining effect 
when silver staining protocol containing glacial 
acetic acid, sodium thiosulphate and Coomassie 
Brilliant Blue staining method was used. The 
other example was achieving higher resolution 
with IPG gel strip of pH 5—8 having an optimal 
loading of 130 wg and silver nitrate staining 
protocol (Liao and Huang 2011). Labelled 
LC-MS/MS has been used to establish cellular or 
subcellular proteome maps, characterize plant- 
pathogen interaction or stress response and profile 
protein patterns during developmental processes 
(Matros et al. 2010). The application of an 
automated approach for the derivatization of 
peptides for facilitated MS/MS de novo sequence 
determination in banana was reported, which 
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further concludes that the proteome does not always 
correspond to the presumed genome formula and 
that proteomics is a powerful tool to characterize 
varieties (Carpentier et al. 2010). 


11.8 Crop Proteomics 
11.8.1 Rice and Wheat 


For more than a decade, rice has been the model 
crop plant for progress in establishing techniques 
for proteome studies of different tissues and 
organelles. The initial proteomic studies in rice 
progressed slowly. Research endeavours were 
mainly on constructing protein data files. 
Recently, dedicated groups have treaded the 
difficult path of systematically analysing the rice 
proteome at the cellular, tissue and whole-plant 
level (Rakwal and Agrawal 2009). From there on, 
research on rice tissues, organs and organelle 
under various abiotic stresses (drought, salt, 
oxidative, temperature, nutrient, hormone, metal 
ions, UV radiation and ozone) as well as various 
biotic stresses were performed. Dependable 
protein identification was facilitated by the 
combined power of MS technology and high- 
resolution 2D PAGE. The rice proteome was 
probed intensively through mapping PTMs, 
performing inter- and intraspecies comparisons, 
integrating proteomics with other “omics” 
technology-generated data and probing func- 
tional aspect of individual proteins (Agrawal 
et al. 2009). 

Different abiotic stresses have been observed 
to yield different proteome responses in rice. 
Quantitative label-free shotgun proteomic analy- 
sis showed that 300 out of 850 proteins were 
found to be responsive to extreme temperatures 
and have unique expression either at 12 or 
44 °C. In addition, there were novel response 
proteins that use an alternate sucrose pathway in 
response to extreme temperature (Gamulla et al. 
2010, 2011). In terms of proteomic response 
of rice during salt stress, nano-LC-MS/MS and 
gel-based shotgun proteomic analysis revealed 
that ubiquitination of proteins in rice roots affected 
the protective mechanism of rice seedlings to 
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resist salt stress (Liu et al. 2012b). For drought, 
label-free quantitative shotgun proteomics 
showed that there was a differential regulation of 
aquaporins. Similarly, proteins involved in 
signalling and transport showed differential 
regulation, suggesting a critical role in the overall 
response to drought (Mirzaei et al. 2012). These 
studies are just but few from among the many 
research highlights. 

In durum wheat, heat stress was observed to 
upregulate proteins involved in glycolysis, carbo- 
hydrate metabolism and stress-related pathway. 
Laino et al. (2010) discovered that the wheat 
seed proteins comprised a subset of allergenic 
proteins. In response to flood stress, cell wall- 
associated proteins were differentially regulated. 
In addition, wheat seedlings respond to flooding 
stress by restricting the cell growth to avoid 
energy consumption (Kong et al. 2009). Drought 
stress elicits different defence mechanisms in 
wheat. Remobilization of carbohydrate from the 
wheat stem to the grain induces differential 
expression of proteins at 20 days after anthesis 
and upregulation of several oxidative stress- 
response proteins, which might suggest more 
effective ROS scavenging (Bazargani et al. 2011). 
Using 2DE and MALDI-TOF, drought stress- 
response proteins related to glycolysis and 
gluconeogenesis were identified (Caruso et al. 
2009). Isobaric tags were used to follow changes 
in the relative protein abundance of Australian 
bread wheat that showed decrease in proteins 
involved in photosynthesis and Calvin cycle 
(Ford et al. 2011). The proteomic responses of 
wheat to winter and spring were different, as seen 
using 2D DIGE and MALDI-TOF/TOF. Proteins 
with higher abundance in the winter cultivar were 
observed to be involved in the regulation of stress 
response and development. Meanwhile in the 
spring cultivar, proteins involved in the restora- 
tion of cell division and plant growth were 
observed to be of higher abundance (Kosova 
et al. 2013). High temperature shortened the 
developmental programme of the endosperm of 
wheat without altering its sequential steps. 
Proteins active in biosynthesis and metabolism 
shifted to storage and protection against abiotic 
and biotic stresses (Hurkman et al. 2009, 2013). 
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Some important proteins of an elite Chinese 
cultivar known for having high yield, superior 
gluten quality and better biotic stress resistance, 
namely, H*-ATPases, glutathione S-transferase, 
ferritin and triose phosphate isomerase, were 
found to be upregulated in the salt-tolerant lines 
(Gao et al. 2011). The differentially expressed 
proteins of wheat somatic seedlings were 
observed to be salinity/drought responsive but 
not cultivar specific (Peng et al. 2009). All these 
studies have been providing a base from where to 
build an integrative view of rice and wheat 
responses. Such an integrative view will, sooner 
rather than later, allow a cellular level understanding 
of response to developmental and environmental 
cues. That understanding in turn will allow iden- 
tification of additional beneficial genes and 
alleles for crop improvement. 


11.8.2 Soybean 


Soybean is one of the most widely grown legumes 
in the world because it is an important source of 
vegetable oil and protein. However, it is also 
sensitive to abiotic stress. Some progress has 
been made through proteomics to understand 
how soybean can adapt to physiological and 
environmental stresses. One example of such 
proteomic studies showed that soybean roots had 
32 proteins differentially abundant under drought 
(Mohammadi et al. 2011). Irrespective of the 
severity of the water stress, soybean exhibited 
tight control over carbon metabolism to meet the 
cells’ energy demand for alleviating the effects of 
the stress (Hossain and Komatsu 2014). A com- 
parative proteome analysis in soybean leaves 
against P. sojae, a soil oomycetes that causes 
root rot, showed that proteins involved in energy 
functions were the most abundant and the pro- 
teins involved in transport and transcription 
were the least abundant (Zhao et al. 2013). In res- 
ponse to salt stress, glyceraldehyde 3-phosphate 
dehydrogenase, glutathione S-transferase (GST9 
and 10) and seed maturation protein PM36 were 
downregulated in the salt-tolerant soybean vari- 
ety but were more abundant in the salt-sensitive 
variety (Xu et al. 2011a). A database has been 
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created for soybean proteome data in response to 
stresses such as flooding, salinity and drought 
(Soybean Proteome Database; SPD; http://pro- 
teome.dc.affrc.go.jp/Soybean/). It is a repository 
for proteins from different organs, tissues and 
organelles as well as from plasma membrane, 
cell wall, chloroplast and mitochondrion. The 
data was obtained through 2D gel and non-gel 
methods (Ohyanagi et al. 2012). A recent review 
by Hossain and Komatsu (2014) highlights the 
achievements of proteomics in “untying the 
mechanism of flood and drought stress tolerance’. 


11.8.3 Maize 


Recent advancement in proteomics study of 
maize has also considered various biotic and abi- 
otic stresses. For the biotic stress of Aspergillus 
flavus pathogenesis, the proteomic analyses of 
young resistant rachis of maize were found to 
contain higher levels of abiotic stress-related 
proteins and proteins from phenylpropanoid 
metabolism, while susceptible ones contain 
pathogenesis-related proteins. When challenged 
with A. flavus, resistant rachis relied on constitu- 
tive defence systems, while susceptible rachis 
was more dependent on inducible defences 
(Pechanova et al. 2010). Proteome data combined 
with transcriptome and metabolome analysis for 
long-term nitrogen deficiency in maize at two 
developmental stages revealed major changes in 
photosynthesis and carbon metabolism. A number 
of these changes were similar to biotic and 
abiotic stress-mediated changes. However, such 
changes were different during the different 
developmental periods of N assimilation and 
grain filling. The integrated approach presented a 
useful set of results for improving the under- 
standing of whole-plant nitrogen economics, 
potentially affecting breeding and agronomy 
decisions (Amiour et al. 2012). The effect of 
growing two transgenic lines in one location over 
three seasons against the control, as well as the 
effect of genotypes grown in different location in 
one growing season, showed great difference in 
the transcriptome, proteome and metabolome 
profiles of maize (Barros et al. 2010). Through 
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2DE and iTRAQ, the physiological responses to 
drought such as early stomatal closure of drought- 
sensitive genotypes were explained. Results 
showed that drought causes upregulation of 
protective and stress-related proteins (chaper- 
ones and dehydrins; Benesova et al. 2012). Maize 
proteomics has also been carried out to under- 
stand some other important aspect commercially 
more relevant to maize per se, such as heterosis 
(Marcon et al. 2013; Guo et al. 2013), starch bio- 
synthesis and grain filling (Juarez-Garcia et al. 
2013; Jin et al. 2013), C-4 photosynthesis (Zhao 
et al. 2013; Majeran et al. 2010), ear growth 
(Liao et al. 2012) and GM/non-GM comparison 
(Balsamo et al. 2011). A recent review captures 
the role of proteomics in providing an insight into 
the biology of maize (Pechanova et al. 2013). 


11.8.4 Oilseed Crop Proteomics 


Plant seeds are an important source of renewable 
of food feed and energy, such as seed oil. In oil- 
seeds quantitative profiling of proteins expressed 
during seed filling in soybean, canola, castor and 
Arabidopsis has been studied through high-reso- 
lution 2D PAGE and MS approach. More than 
500 proteins have been identified using MS with 
an aim to identify genes or gene families of aller- 
gens and silence their expression so that they 
have minimal effect on human and animal health 
(Thelen 2009). A recent technique used in study- 
ing seed allergens requires no gel or sample 
fractionation. Isotopically labelled synthetic pep- 
tides, called AQUA peptides, serve as synthetic 
mimics to proteotypic peptides and are used as 
internal standards at set concentrations. This 
approach allowed absolute quantification of 
proteins in complex biological mixtures 
(Stevenson et al. 2010). 

Proteomic analysis of Brassica napus helped 
in revealing the dynamics of N remobilization 
through the action of several proteases that occur 
at the different stages of leaf senescence induced 
by N limitation/starvation (Desclos et al. 2009). 
Another comparative proteomic study in oilseed 
rape shed light on proteins involved in lateral root 
formation and maintenance of low phosphorus in 
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the tolerant genotypes (Yao et al. 2011). 
Proteomics also provided important new insights 
into the resistance mechanisms within B. napus 
against S. sclerotiorum and also opened the way 
for novel engineering of new B. napus varieties 
with enhanced resistance and better management 
against the most serious disease of the oilseed 
rape (Garg et al. 2013). More recently, a com- 
bined approach of proteomics with genomics was 
used to gain insights into the genetic programmes 
that operate during embryo development in 
B. napus L. (canola). This study identified 20,183 
genes and 2,607 proteins that are differentially 
expressed across different stages of embryo 
development and which share high degree of 
homology with Arabidopsis and display stage- 
specific and shared expression patterns associated 
with developmental and metabolic pathways 
(Venglat et al. 2013). Apart from oilseed rape, 
proteomics has also been used to study other 
oilseed crops. A nano-LC-MS/MS approach was 
used to study drought stress in the sunflower 
inbred lines. The productivity of sunflower, one 
of the most important oilseed crops, is severely 
affected by drought stress. Proteomics data 
indicated that changes in energy usage, water 
transport and ROS scavenging are important 
mechanisms for maintaining root growth as the 
soil dries (Ghaffari et al. 2013). Proteome analysis 
of Jatropha curcas, an important biodiesel 
crop, was performed on the seed oil bodies, and 
important differences are outlined in its oleosins 
(Popluechai et al. 2011). Oil body proteome 
determination methods were recently reviewed 
(Jolivet et al. 2013). Jatropha curcas seed oil, 
which can be utilized for biodiesel production 
upon transesterification (Kohli et al. 2009), is 
rich in phorbol esters in most genotypes 
(Popluechai et al. 2009). Phorbol esters are very 
toxic and are known to cause wide range of 
negative biological effects including cancer. An 
EASY nano-LC system coupled inline to an ESI- 
LTQ-Orbitrap Velos mass spectrometer was used 
to identify and study 1,103 proteins. This study 
led to a very important conclusion that suggested 
the plastids isolated from the endosperm of 
developing seeds do not synthesize phorbol esters 
(Pinheiro et al. 2013). The role of proteomics in 
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J. curcas was also reviewed (Raorane et al. 2013). 
Such studies provided insights into the major 
biosynthetic pathways for seed oil synthesis and 
certain unique features of the metabolic pathways 
at the whole proteome level (Hajduch et al. 2011). 


11.8.5 Roots and Tubers Proteomics 


Roots and tuber crops consist of beets, carrots, 
potatoes, sweet potatoes, cassava, yam, etc. 
Among these root crops, only potato and cassava 
are mostly studied using proteomics. Protein 
profiling in transgenic potato rich in protein sug- 
gested that proteome rebalancing might cause 
increased protein content in tubers (Chakraborty 
et al. 2010). In cassava, 2D PAGE and nano- 
electrospray quadrupole TOF tandem mass analy- 
sis identified 299 proteins, which were involved in 
processes related to energy, primary and second- 
ary metabolism, disease and defence, destination 
and storage, transport, signal transduction, protein 
synthesis, cell structure and transcription to cell 
growth and division (Sheffeld et al. 2006). Lim and 
co-workers reported the first comprehensive pro- 
teomics analysis of the indirect mode of action 
of phosphite-based fungicides (Phi), demonstrat- 
ing broad effects on plant defence and plant 
metabolism. The proteomics data and the micros- 
copy study suggested that Phi triggered a hyper- 
sensitive response that was responsible for 
induced resistance of potato leaves against 
Phytophthora infestans (Lim et al. 2013). Tuber 
development in potato consists of a series of 
biochemical and morphological processes at the 
stolon tip. Proteomics was used to study 
development-related processes and to gain further 
insights into the proteins that show altered expres- 
sion during in vitro potato tuberization (Yu et al. 
2012). Furthermore, cassava has been fortified for 
iron and protein by reducing the cyanogen levels 
in the roots (Guerrero et al. 2012). Another root 
crop, Ginseng’s root proteome analysis was done 
using the combination of internal amino acid 
sequencing using ESI QTOF MS (identified 91 
proteins) and the expressed sequence tag (EST) 
database from Panax ginseng and the protein 
database of model plants such as Arabidopsis 
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thaliana and Oryza sativa due to incomplete 
ginseng genome data (Kim et al. 2003). 


11.8.6 Vegetable Crop Proteomics 


Fruits and vegetables are extremely susceptible to 
decay and easily lose their commercial value within 
short periods of time after harvest. These post- 
harvest problems are due to various stresses such as 
attack of pathogens, heat treatment, cold storage 
and resistance mechanisms (Cook et al. 1999; Tian 
et al. 2001). Induced resistance can be caused by 
microbial, chemical and physical agents or natural 
compounds (Tian and Chan 2004). Proteomics and 
metabolomics revealed the molecular mecha- 
nisms of how fruits and vegetables respond to envi- 
ronmental stresses. However, extraction of the 
protein in fruits poses a challenge because of low 
protein content in the fruit tissues. Two successful 
extraction protocols in profiling fruit proteome 
were proposed, dubbed as the Homo and Phe meth- 
ods (Chan et al. 2007, 2008). These were found to 
be better than the TCA-acetone method in profiling 
the protein of fruits. Another emerging technique 1s 
the multiple reactions monitoring (MRM) that was 
used to detect and quantify protein biomarkers in 
complex biological samples of loquat fruit 
(Marquez et al. 2013). 

The unripe green and the ripe red tomatoes 
along with a ripening-inhibited (rin) mutant 
tomato were infected with Botrytis cinerea to 
assess postharvest reactions to the fungus at the 
proteome level. Both the fruit and fungal pro- 
teome were analysed 3 days postinfection (Shah 
et al. 2012). There were 186 proteins common 
among WT red ripe and rin red ripe-equivalent 
infected tomatoes. These proteins would thus be 
responsive to infection and not related to devel- 
opmental aspects of ripening. However, the 
limited infection of the WT mature green fruit by 
B. cinerea resulted in 25 and 33 % fewer defence- 
related tomato proteins than in red and rin fruit, 
respectively. In contrast, the ripening stage of the 
infected fruit did not affect the secreted proteomes 
of B. cinerea. The composition of the set of proteins 
and their putative functions further argued for a 
role in plant-pathogen interactions. 
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2D DIGE approach revealed 52 differentially 
expressed proteins in peach fruits exposed to heat 
treatment. These proteins were proposed to play 
a role in plant metabolism, defence and stress 
response, cytoskeleton organization, primary 
metabolism, transcription and translation reg- 
ulation and protein storage and catabolism. 
Additionally, through this study the small HSP 
family proteins in the heated peach were hypoth- 
esized to participate in the acquisition of tolerance 
against some chilling injury symptoms (Lara 
et al. 2009). Another study demonstrated that 
heat-treated peach fruit showed induction of 
ascorbate peroxidase, heat-shock proteins and an 
allergen protein (Zhang et al. 2011). Proteomic 
and metabolic profiling of mandarin citrus fruit 
using 2D PAGE (2-DE), gas chromatography 
coupled to mass spectrometry (GC-MS) and 
liquid chromatography quadrupole time-of-flight 
mass spectrometry (LC-QTOF-MS) depicted that 
reactive oxygen species (ROS) and lignin play 
important roles in heat treatment induced fruit 
resistance to pathogens and physiological disorders 
(Yun et al. 2013). In citrus fruit, 28 differentially 
expressed proteins showed that the main affected 
categories were cell rescue, defence, virulence 
and metabolism. The activity of antioxidant 
enzymes was extensively changed upon heat 
treatment, including superoxide dismutases and 
alcohol dehydrogenases (Perotti et al. 2011). 

Comparative transcriptomic and proteomic 
analysis of citrus fruit (Hirado Buntan pomelo) 
using digital gene expression profiling and 2DE 
revealed that calcineurin B-like protein (CBL), an 
interacting protein kinase complex, might be 
involved in signal transduction of low temperature 
stress. Meanwhile, fruit quality is likely regulated 
by sugar-mediated auxin and abscisic acid signal- 
ling. The accumulation of limonin, nomilin, meth- 
anol and aldehyde, together with the upregulated 
heat-shock proteins, COR15, and cold response- 
related genes have provided a comprehensive 
molecular view on the coordination of low- 
temperature stress responses in the fruits (Yun 
et al. 2012). 2DE and MS technology was used to 
identify the gene Prtcorp, which was induced by 
cold stress in citrus orange, which made it a well- 
known cold-tolerant fruit (Long et al. 2012). 
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In the fruit-ripening studies for papaya, six 
main categories such as cell wall synthesis, 
ethylene biosynthesis, climacteric respiratory 
burst, stress response, synthesis of carotenoid 
precursors and chromoplast differentiation were 
shown to be related to fruit ripening using 
2DE-DIGE (Nogueira et al. 2012). Traditional 
2D PAGE showed several cell wall-degrading 
enzymes also related to fruit ripening in papaya 
(Huerta-Ocampo et al. 2012). Another study on 
peach fruit highlighted the proteins involved in 
primary metabolism, ethylene biosynthesis 
secondary metabolism and responses to stress 
(Prinsi et al. 2011). A further study characterized 
the protein accumulation patterns in soft fruits of 
three peach and two nectarine varieties of melting 
flesh. This revealed that 164 of the 621 protein 
spots analysed displayed a differential accumulation 
associated with the softening process. Among 
them, only 14 proteins changed their accumula- 
tion in all the varieties assessed, including 
proteins mostly involved in carbohydrates and 
cell wall metabolism as well as fruit senescence 
(Nilo et al. 2012). In kiwifruit, proteomic analysis 
using 1D-SDS-PAGE and mass spectrometry 
identified 102 proteins during ripening. These 
were mainly involved in energy, protein metabolism, 
defence and cell structure (Minas et al. 2012). 


11.8.7 Fibre Crop Proteomics 


Almost all textiles are derived from cotton fibres 
and other natural fibres such as hemp, silk and 
wool. Like other crops, these plants are also 
subjected to physiological and environmental 
stresses, which is why proteomic studies were 
undertaken to understand adaptive mechanisms 
to abiotic and biotic stresses. An example of such 
studies is the comparative proteomic analysis of 
cotton, which revealed that certain proteins had 
putative functions for fibre differentiation and 
that ROS homeostasis may be the central regula- 
tory mechanism for cotton fibre morphogenesis 
through post-transcriptional and post-translational 
regulation (Kang et al. 2012). Proteomic analysis 
of cotton at three developmental stages (5, 10 and 
15 days post-anthesis) showed that 37 protein 
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spots changed significantly under low temperature 
stress. These proteins were involved in malate 
metabolism, cell wall loosening, cellulose syn- 
thesis, cytoskeleton, cellular response and redox 
homeostasis (Zheng et al. 2011). A global protein 
analysis using 2DE and tandem MS on flax seeds 
grown in an irradiated area identified 318 seed 
storage proteins, and out of it, 82 proteins were 
identified to be involved in central metabolism 
(Klubicoba et al. 2010). Proteins in hempseed 
which were suspected to have an important role 
in development, germination and storage were 
profiled using 2DE and LC-MS/MS. Results 
showed that 168 unique proteins are involved 
in regulation, metabolism, stress response, cyto- 
skeleton, binding function and protein synthesis 
(Park et al. 2012). Recently, a proteomic approach 
with MALDI-TOF/TOF MS was used to study 
the proteome of 21 days post-anthesis green 
cotton fibre to understand the molecular mecha- 
nisms related to pigment biosynthesis (Li et al. 
2013). Another similar comparative study with 
brown and white cotton, at different days post- 
anthesis, illustrated several aspects of pigment 
synthesis and fibre development (Li et al. 2013). 
Such studies demonstrated the presence of a 
complicated metabolic network and advanced the 
understanding of the molecular mechanisms of 
pigment biosynthesis in coloured cotton, which 
may provide new insights into the development 
of new cotton colour types by genetic manipulation. 


11.8.8 Forage Crop Proteomics 


The improvement of forage crops translates into 
the improvement of livestock. These crops are 
mainly used in food supply for grazing livestock 
such as cattle and sheep. Few examples of these 
include members of the Poaceae family, such as 
Lolium, Poa, Cynodon and Bromus. Herbaceous 
legumes such as Medicago are also included in 
forage crops. Sorghum and maize are also 
included in forage crops in some countries, 
through their use as silage for livestock feeds. 
Being used as fodder, these crops are of impor- 
tance and their improvement adds to the food 
security issue through healthy livestock. The 
focus on studies with forage crops centres on 
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the improvement of stress tolerance in different 
climates. A study on the effects of cold and frost 
on the proteome of Lolium perenne showed 
differences in the leaf proteomes of high and low 
frost tolerance (Bociana et al. 2011). Forty-two 
differentially abundant proteins were identified 
of which 35 were found to be chloroplast proteins, 
suggesting resistance to photo-inhibition and 
survival during cold acclimatization through 
continued food manufacture. Cold acclimatiza- 
tion was shown to be effective in increasing 
freezing tolerance in velvet bentgrass (Agrostis 
canina) by Espevig et al. (2012). Allowing the 
plants to experience a more tolerable cold climate 
enhanced their cold tolerance and increased 
survival under more stringent cold conditions. 
Additionally, when the proteomes of these accli- 
matized plants were observed, proteins involved 
in amino acid metabolism, energy production, 
stress defence and secondary metabolism were 
differentially abundant. Xu et al. (2011b) also 
studied heat-shock proteins (HSPs) and _ their 
relation to heat tolerance in grasses, focusing on 
HSP role in heat tolerance, allowing essential 
cellular processes such as amino acid and protein 
metabolism, photosynthesis and carbon allocation 
to continue, thus allowing the survival of the plant. 


11.9 Plant Proteomics Update: 
2009-2013 


Proteomics is a relatively young science consid- 
ering the term per se was only coined by Marc 
Wilkins in 1994. However, in almost two decades, 
tremendous headway has been made both in 
plants and animals. There has been appreciable 
progress in plant proteomics because of the fol- 
lowing reasons: (1) the completion of the genomic 
sequence of model plants such as Arabidopsis 
and rice, (2) the swift dynamic growth of mass 
spectrometry having provided immense opportu- 
nities for high-throughput proteomic studies that 
have gone from simple protein identification to 
analysing functional proteomics, (3) database 
construction and mining through bioinformatics 
to deal with large and complex set of proteome 
data and (4) the innovative and diverse ways in 
which the methodologies in protein and peptide 
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analysis have been applied. This has led to 
modern proteomics which required a multidisci- 
plinary expertise from various fields such as soft- 
ware engineering, statistics, protein chemistry, 
cell biology, biophysics and mass spectrometry 
(Have et al. 2011). 

The updates in plant proteomics made in the 
year 2004—2006 (Rossignol et al. 2006) surveyed 
about 200 original articles in plant proteomics. 
By the time the next review on the plant pro- 
teomics was published in the year 2008 (Jorrin- 
Novo et al. 2009), 50 more papers had been 
published. However, by December 2013 there 
were 3,806 plant proteomics research articles. 
Rapid advancement in and availability of MS 
instrumentation tools and techniques, and data- 
bases mining software, has made it possible to 
identify thousands of proteins within a single day 
(Cox and Mann 2007). 

In terms of plant species being studied, thale 
cress (Arabidopsis thaliana) and rice (Oryza 
sativa) are still the most popular models. 
However, by 2009 an advent into studying the 
proteome of other major crops such as maize, 
soybeans, sorghum, citrus fruits, cotton, grasses, 
oilseeds, etc., had already commenced. Most of 
the published papers have focused on profiling 
organs, tissues, subcellular proteomes, develop- 
mental processes and responses to biotic and 
abiotic stresses using differential expression 
strategy (Jorrin-Novo et al. 2009). If the strength 
of proteomics studies in the past was more on 
targeted and focused analyses of proteomes, 
nowadays, a shift on the global functional 
approaches is starting to roll on. Many studies 
have engaged in the subcellular localizations of 
protein, molecular interaction networks and large 
signalling complexes such as PTMs with the aim 
of a full understanding of the functions of protein 
in relation to biological functions. 

The technological platform whereon plant 
proteomics is based depends still on the classical 
2DE for protein separation and enhanced with 
differential staining via DIGE. The use of gel-free 
and second-generation quantitative proteomics 
techniques [i.e. shotgun proteomics (MudPiT, 
ICAT, ITRAQ, LC-MS/MS, MALDI-TOF, etc.)] 
to quantify and analyse differential abundance of 
protein has increased. In addition, proteomic data 
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is also verified and enhanced using other omics 
complements such as transcriptomics, metabolo- 
mics, ionomics, etc. Moreover, proteomics nowa- 
days is pushing the boundaries of computation, 
database generation, curation and mining, with 
advanced statistics for true positives to handle 
large and complex data sets efficiently. 

Advanced and sophisticated proteomic studies 
have been undertaken in rice and Arabidopsis due 
to the availability of data on most of the subcellu- 
lar localizations of proteins in different tissues. 
Other crops, particularly citrus fruits and oilseed, 
manifest rapid progress as well. One major high- 
light in fruit proteomics is the discovery of two 
new successful protein extraction methods which 
for some time had been a problem (Marquez et al. 
2013). Another highlight is the novel discovery of a 
software tool in oilseed which can now access 
large-scale predictions of both general and specific 
kinase and phosphorylation sites (Gao et al. 2010). 
However, the progress made in roots, tubers, for- 
age and condiment type of crops is slow. Efforts 
have been put to initiate and successfully accom- 
plish complete genome sequencing of other spe- 
cies such as M. truncatula and Brachypodium 
distachyon. With even more pace gathering in the 
coming years on proteome characterization, a 
wider understanding on the biological processes in 
plants may be rapidly achieved. 


11.10 Gaps in General Proteomics 
Versus Plant Proteomics 


Plant proteomics in general is lagging as com- 
pared to mammalian and microbial proteomics. 
Others have described plant proteomics as in its 
infancy (Agrawal et al. 2006; Annan, http://www. 
thecatalyst.org/proteomics/). Such disparity may 
be attributed to several factors. Firstly the micro- 
array facilities for mammalian and microbes have 
already been well established and accessible for 
postgenomic activities such as the proteomics and 
other “omics” technology as compared to plants, 
where there is lack of access in most plant 
laboratories. Another is the lack of information, 
of close cooperation among proteomic researchers 
and most importantly of suitable platform to 
globally discuss updates, successes and challenges 
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encountered in plant proteomics (Agrawal et al. 
2009). The organization that is supposed to serve 
as globally recognized platform with  user- 
friendly and centralized data base called the 
International Plant Proteomics Organization 
(INPPO) is new. The 1st INPPO World Congress 
was held in Germany in 2014 and brought 
together leading experts in plant proteomics from 
all over the world. This should provide an ideal 
venue for researchers in proteomics technolo- 
gies and applications on all aspects of plant 
proteomics to share and discuss their new find- 
ings and ideas. On the contrary, the Human 
Proteome Organization (HUPO) organized its 
13th Annual World Congress in Madrid in 
October 2014. It possesses an established data- 
base for a single species (human) and encourages 
the spread of proteomics technologies. Plant pro- 
teomics will have a long mile to tread before the 
proteome databases of important crops are gath- 
ered from among other nations to facilitate 
exchange of resources and information from 
among plant proteomic researchers. This sce- 
nario would be somewhat like an insurmountable 
challenge due to variations in individual plants 
(genus and species) being so vast. Moreover, the 
influence and role of industries who serve as 
sponsors or users of the technology play an 
important role. This is evident by the fact that 
research effort on disease treatment, drug discov- 
ery and medical diagnostics and forensics are 
more appealing to private sectors and/or investors 
who exploit much of the results of various pro- 
teomic initiatives (Kenyon 2002). On the con- 
trary the issue of food security 1s a public or 
government thrust with limited support from the 
private sector. Nonetheless, the impact of plant 
proteomics will be substantial if proteome data- 
bases can be brought to bear upon agriculture per 
se (Agrawal et al. 2009). 


11.11 Pitfalls, Challenges 
and Future Perspectives 


Proteomics techniques have unique strengths but 
also have several challenges and considerable 
limitations. Proteomics is very much a technology- 
driven field. The enormous range of exciting new 
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technology for separation and quantification of 
proteins has been applied to various intriguing 
biological queries. However, the stringency to 
validate proteomic data generation and analysis 
has somewhat been lenient, and as a result, 
numerous published findings with questionable 
quality and reproducibility are now available 
(Wilkins 2006). The challenge now is to establish 
standardized protocols and regulations to validate 
methods, statistical designs, computations and 
results before publications are made. At present, 
there is no common gel matrix that would enable 
researchers to reproducibly align protein patterns. 
No matter how much 2DE has been improved, 
the technology is still tedious and difficult. It did 
not develop and mature with the same breathtaking 
speed as that of the other downstream MS 
technologies (Huber 2003). Nevertheless the 
enthusiasm to perform proteomic studies has no 
signs of slowing down; that is why voluminous 
proteomic data are swamped incredibly fast in a 
database which mainly contains abundant and 
common proteins, whereas the rare regulatory 
proteins of low abundance (e.g. GTPases, kinases 
and phosphatases) or “difficult” proteins remained 
dawdling. As a challenge, technological improve- 
ments are considered and are needed to enhance 
quantitative and comparative large-scale proteomic 
studies (Zhang and Riechers 2008). The pro- 
teome is in a State of flux and is thus extremely 
complex. The analysis of the proteome is exac- 
erbated by the huge dynamic concentration range 
of proteins in the cellular environment. The chal- 
lenge to map plant proteomes is quite formidable 
due to the myriad post-translational modifica- 
tions of proteins in each subcellular location and 
tissue. The future development of more high- 
throughput tools for fractionation and suitable 
MS/MS approach, the discovery of more bio- 
markers and the link with “omics” and bioinfor- 
matics will surely make a difference (http://www. 
thecatalyst.org/proteomics/). Furthermore, if the 
collection of proteome data becomes systematic, 
it can help build a baseline information or data- 
bank that can be accessed globally and can be 
utilized in finding patterns and trends across mul- 
tiple experiments. The sophisticated, highly sensi- 
tive throughput technologies in plant proteomics 
need to shed light in the multifaceted, complex 
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nature of global proteins. This will provide a clear, 
fast dimensional analytics, as well as a clear 
understanding of how a plant biological system 
works (Have et al. 2011). This may sound a wish- 
ful thinking at present, but were not the sequenc- 
ing of human genome, Arabidopsis and the like, 
once in this formidable situation? 
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Abstract 

Plant epigenetics has become one of the hottest topics of research not only 
as a subject of basic research but also as a possible new source of beneficial 
traits for plant breeding. In addition, epigenetic mechanisms are also cru- 
cial to appropriate plant reactions to stress. Given the sessile lifestyle and 
the late differentiation of the germ line, plants can perceive stresses during 
vegetative growth and also memorize them, possibly by epigenetic mecha- 
nisms. Plants use three systems to initiate and regulate epigenetic gene 
regulation, like other higher organisms, which include DNA methylation, 
histone modifications, and RNA interference. New concepts are being 
evolved to show how these epigenetic components interact and stabilize 
each other. The role of epigenetic mechanisms in hybrid vigor and epigen- 
etic transgene silencing is also being explored. In this chapter, we have tried 
to highlight the epigenetic mechanisms that play key roles in plants. 


Keywords 
Epigenetics e DNA methylation ¢ Histone modification ¢ RNA interference 
e Transgene 


12.1 Introduction 


The term epigenetics was first introduced by 
Conrad Waddington (1942) which is referred to 
as the intermediate factor between the genotype 
and phenotype. The redefined term “epigenetics” 


Sciences, Central University of Punjab, 
Bathinda, Punjab 151 O01, India 
e-mail: anjanadurani @ yahoo.co.in 


Y.R. Ahuja B. Bahadur 
Department of Molecular Medicine, Vasavi Hospital Sri Biotech Laboratories India Limited, 
and Research Centre, Hyderabad, Plot No.: 21, Street No.: 2, Sagara Society, Banjara 


Telangana 500 004, India 


Hills, Hyderabad, Telangana 500 034, India 


Bir Bahadur et al. (eds.), Plant Biology and Biotechnology: Volume II; Plant Genomics 265 
and Biotechnology, DOI 10.1007/978-8 1-322-2283-5_12, © Springer India 2015 


266 


includes all mitotically and meiotically heritable 
changes in gene expression which are not coded 
in the DNA sequence itself (Tsaftaris et al. 2005). 
Epigenetics now refers to the heritable changes 
which are not related to the DNA sequence but 
can be reliably inherited from one generation to 
another (Chen et al. 2010). However, this 
definition is still under debate among epigeneti- 
cists and biologists (Bird 2007). Over the last 
decade, epigenetic studies have focused mainly 
on mammals, and plants have received much less 
attention. At present, the epigenetic research is at 
forefront of plant biology and molecular genet- 
ics. Plants are excellent models for studying epi- 
genetics since epigenetics has been suggested to 
play a role in both plant development and epigen- 
etic response. There is a fair amount of epigenetic 
information on certain model plants such as 
Arabidopsis, Oryza sativa, and Zea mays. Several 
mechanisms such as DNA methylation, histone 
modifications, siRNAs, paramutations, nucleo- 
some arrangements, and others have been identi- 
fied. Various enzymes and homologous factors 
participating in the regulation of plant epigenetics 
have been identified. Depending upon analytical 
resolution and precise data interpretation, epi- 
genetic code of the DNA sequence in plants 
might be determined in the near future. The three 
main hallmarks of the process of epigenetics in 
plants as well as in higher organisms are DNA 
methylation, histone modification, and RNA 
interference. These three molecular epigenetic 
mechanisms, their effects on plant development, 
and particularly their multiple implications have 
been discussed in this chapter. 


12.2 DNA Methylation 


DNA methylation was the established epigenetic 
mark to be studied. Methylation is a heritable 
enzymatic modification which results from the 
addition of a methyl group in the cyclic carbon-5 
of cytosine. In plants, DNA methylation is species, 
tissue, organelle, and age specific and is controlled 
by phytohormones and changes such as seed 
germination and flowering (Vanyushin 2005). It is 
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also influenced by various pathogens (viral, bacte- 
rial, fungal). In mammals, it is restricted to sym- 
metrical CG sequences (Bird 2002). However, in 
plants, DNA methylation is found at CG sites, 
CHG sites (H indicates A, C, or T), and CHH sites 
(asymmetrical site). DNA methylation is catalyzed 
by a family of conserved enzymes, DNA methy]I- 
transferases (MTases). Different types of DNA 
MTases include (a) maintenance methylases, 
which maintain stable cytosine methylation pat- 
terns through successive cell generations; (b) de 
novo methylases, which are able to transfer methyl 
groups to cytosines of unmethylated DNA; and (c) 
domain-rearranged methylases (DRMs) that are 
directed from short RNAs and specifically methyl- 
ate homologous genes in a process termed as 
RNA-directed DNA methylation (RdDM). 
Arabidopsis has been found to possess ten dif- 
ferent members of MTases, and mutations of 
each have been reported to affect normal devel- 
opment or other features of the plant (Tariq and 
Paszkowski 2004). Methylation can be removed 
from DNA by either passive or active mecha- 
nisms. Replacement of methylated cytosine with 
non-modified cytosine results in passive demeth- 
ylation. The participation of specific proteins that 
demethylate DNA _ sequences cause active 
demethylation. Passive demethylation can occur 
when methylated cytosines are replaced with 
non-modified cytosines during DNA replication 
(Vairapandi and Duker 1993; Jost et al. 1995; 
Tsaftaris and Polidoros 2000). Two Arabidopsis 
mutations, one interfering with maternal expres- 
sion of an imprinted gene (Choi et al. 2002) and 
the other inducing transcriptional gene silencing 
(TGS) of transgenes and endogenous homolo- 
gous loci (Gong et al. 2002), were mapped to the 
genes DEMETER (DME1) and ROSI, respec- 
tively, which encode DNA glycosylases that 
remove 5mC from DNA. The prominent roles 
proposed for methylation are (1) to provide a heri- 
table epigenetic mark directing the developmen- 
tal program of the organism (Holliday and Pugh 
1975; Regev et al. 1998; Wolffe and Matzke 
1999), (41) to provide genome defense against the 
activity of parasitic mobile elements (Yoder et al. 
1997), (411) to reduce background transcriptional 
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noise in organisms possessing a large number of 
genes (Bird 1995), and (iv) to “memorize” pat- 
tern activity of the gene by stabilizing gene 
silencing brought about by other mechanisms 
(Bird 2002). 

DNA methylation has been found to exert its 
effects through repression of gene expression. 
Methylation inhibitor studies have been reported 
to induce developmental abnormalities in several 
plant species including Oryza sativa, Triticale, 
Linum $usitatissimum, and Nicotiana tabacum 
(Richards 1997). A clear correlation between 
methylation status and gene activity has been dem- 
onstrated by numerous studies including tissue- 
specific genes, using different techniques. In fact, 
abundant evidence supporting a role of methyla- 
tion in suppression of gene expression comes from 
studies on regulation of transposable elements 
(Fedoroff 1996; Martienssen 1996) and transgene 
silencing in genetically engineered plants (Morel 
et al. 2000; Paszkowski and Whitham 2001; 
Fojtova et al. 2003; Matzke et al. 2004). 


12.2.1 DNA Methylation and Stress 
Response 


Studies have shown that under stress, DNA meth- 
ylation and RNA interference (RNA1) rapidly 
and reversibly modify plant genomic DNA, 
which further avoids the need for excessive 
genetic recombination and population diversity 
(Goll and Bestor 2005; Lukens and Zhan 2007; 
Boyko and Kovalchuk 2008). The plant somatic 
cells can “memorize” the experience of stress by 
epigenetic mechanisms (Bruce et al. 2007). 
Studies have shown that methylation of GC-rich 
sequences acts as a major coordinating point for 
many epigenetic mechanisms and ensures that 
the “memory” is transferred faithfully to the 
progeny (Mathieu et al. 2007). A large number of 
the genes modified by methylation possess a 
close relationship with the stress response in 
plants. It has been reported that the silencing of 
type I DNA methyltransferase NtMET1 in 
tobacco (Nicotiana tabacum) plant using an anti- 
sense RNA technique resulted in the upregulated 
expression of approximately 30 genes, 62.5 % of 
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which are related to responses to biotic or abiotic 
stress (Wada et al. 2004). Another example is of 
rice (Oryza sativa) variety Wase Aikoku 3, which 
is not resistant to the blight pathogen 
Xanthomonas oryzae pv. oryzae in the seedling 
stage but develops resistance in the adult plant. 
After artificial inoculation of seedlings and adult 
plants, methylation-sensitive amplified polymor- 
phism (MSAP) analysis showed that most poly- 
morphic loci possessed a higher level of 
methylation in adult plants than in seedlings (Sha 
et al. 2005). This may contribute to the develop- 
ment of adult plant resistance (APR) in rice 
plants. One of the demethylating reagent 
5-azadeoxycytidine can also induce resistance to 
the blight pathogen in Line-2 rice (Akimoto et al. 
2007). The MSAP technique was used to screen 
DNA fragments that were methylated differently 
between Line-2 and wild-type rice, and one 
clone, Xa21G (similar to the rice blight resis- 
tance gene Xa21), was selected. The promoter of 
Xa21G was not methylated at any cytosine resi- 
due in Line-2 rice but was fully methylated in the 
wild type, which was consistent with the high 
expression of Xa21G in Line-2 (Akimoto et al. 
2007). Therefore, demethylation of the Xa21G 
promoter enhances the expression of this gene 
and contributes to the resistance of the plants to 
pathogen. It was suggested that as a rule, rice 
blight pathogen resistance was related to genomic 
hypermethylation and hypomethylation of 
resistance-related genes (Akimoto et al. 2007). 
This rule is explained in more detail via TMV- 
infected tobacco. After infection with the virus, 
an overall increase in genomic methylation was 
accompanied by the hypomethylation of 
resistance-related leucine-rich repeat (LRR)- 
containing loci, and this was found to be consis- 
tent with the increased amount of recombination 
observed within the LLR-containing resistance 
gene (N-gene) (Kovalchuk et al. 2003; Boyko 
et al. 2007). It was found that the overall increase 
in methylation might favor genomic stability 
when the plant is attacked by the virus, whereas 
reduced methylation of the resistance genes may 
accelerate genetic recombination and thus result 
in the development of novel resistance genes 
(Engler et al. 1993). Therefore, both phenomena 
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are crucial for plant resistance. TMV resistance 
can be transferred to the progeny, and blight 
pathogen resistance and the related methylation 
status are inherited for at least nine generations 
(Boyko et al. 2007; Akimoto et al. 2007). 
Classical Mendelian genetics cannot explain the 
inheritance of — stress-derived phenotypic 
variations, which reminds us of the “acquired” 
genetics theory that was proposed by Lamarck 
two centuries ago. 


12.2.2 Metal lons and Plant Genomic 
DNA Methylation 


High levels of methylation protect DNA against 
endonuclease cutting and multi-copy transposi- 
tion favoring resistance to heavy metals. For 
example, cadmium (CdCl12) has been reported to 
change the methylation status of both oilseed rape 
and radish (Raphanus sativus L.) DNA (Yang 
et al. 2007; Filek et al. 2008). MSAP ratio of rad- 
ish genomic DNA increased in proportion to the 
intensity of the stress (Yang et al. 2007). The 
methylation level of rape genomic DNA has been 
suggested to be altered by chromium. Using 
MSAP and immuno-labeling techniques, the 
analysis revealed that the level of hypermethyl- 
ation in the rape genome was correlated positively 
with the stress dosage of chromium, which sug- 
gested de novo synthesis of methylated cytosine 
(Labra et al. 2004). However, in some reports, a 
contrary trend was observed. For instance, nickel, 
cadmium, and chromium stress can reduce cyto- 
sine methylation levels in clover and hemp by 
20—40 %, and the decrease is proportional to the 
stress dosage of the heavy-metal ions (Aina et al. 
2004). Chromium stress can therefore promote 
methylation in rape but induces a decrease in 
methylation in clover and hemp, suggesting that 
different methylation mechanisms for heavy- 
metal resistance exist in different plant species 
(Labra et al. 2004). MSAP has revealed that the 
methylation patterns of different individual plants 
within CCGG sites were similar, before and after 
heavy-metal stress, which indicates that the 
changes in methylation that were induced by stress 
were not distributed randomly (Aina et al. 2004). 
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Except for a few genes such as NtGPDL, we 
still know very little about methylation modifica- 
tion at specific genomic loci under heavy-metal 
stress. Bisulfite mapping has revealed that CG 
sites in the NtGPDL coding region were demeth- 
ylated rapidly in detached leaves, within | h, 
after aluminum stress. However, the methylation 
pattern of the promoter did not change. NtGPDL 
transcripts appeared within 6 h which is in agree- 
ment with the theory that DNA methylation lev- 
els are inversely related to the levels of gene 
expression (Choi and Sano 2007). The study also 
shows that the regulation of gene expression 
under stress is not limited to the promoter region, 
but can be achieved by changing the methylation 
status of the coding region (body methylation) as 
well. Zhang et al. (2006) also reported that genes 
that are methylated in their coding and promoter 
regions are generally expressed at high and low 
levels, respectively (Zhang et al. 2006). In addi- 
tion to aluminum, both salt and low temperature 
can induce demethylation of NtGPDL leading to 
upregulation of its expression (Choi and Sano 
2007). Furthermore, both aluminum and herbi- 
cides cause overproduction of reactive oxygen 
species (ROS), suggesting that oxidative damage 
as a result of stress might cause variations in 
DNA methylation patterns ( Cerda and Weitzman 
1997; Boscolo et al. 2003; Apel and Hirt 2004). 
Cadmium stress activates superoxide dismutase 
and catalase, altering the methylation status of 
genomic DNA. Selenium prevents the change in 
methylation levels and reduces the amount of 
ROS, suggesting that this is one of the causes of 
the change in methylation status (Filek et al. 
2008). It is interesting that pathogens do not 
affect the levels of NtGPDL methylation and 
expression (Choi and Sano 2007). Hence, it is 
apparent that DNA methylation is affected differ- 
ently by biotic and abiotic stresses. 


12.2.3 Epigenetics and Plant 
Genetic Engineering 


Silencing of the introduced transgenes in plants 
constitutes a major commercial risk which ham- 
pers the general economic exploitation of 
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transgenic plants. The transgenes are silenced at 
both transcriptional and _ translational level 
(Matzke and Matzke 2004). Transcriptional gene 
silencing is achieved by methylation of transgene 
promoter, whereas the posttranscriptional gene 
silencing is associated with the methylation of 
the coding sequence. There is a lot of commercial 
interest to avoid the epigenetically imposed 
transgene silencing. Of course, the most efficient 
way to avoid the silencing of transgenes is to 
carefully construct and analyze the transformants 
thoroughly at the molecular level. Studies of 
silenced transgenes and of silenced endogenous 
sequences have shown that repeated sequence 
arrays, particularly inverted repeats, are most 
prone to silencing. Selection of transgenic plants 
with single copy transgene insertions is the first 
line of defense against silencing. Even with this 
safeguard, the transgene might still be silenced if 
it inserts near heterochromatin or if it expresses 
an RNA trigger for silencing. But with an 
increased understanding of epigenetic patterning 
across the plant genome, and the identification of 
novel gene products that control gene silencing, 
the tools are now in place to manipulate more 
effectively plant transgene expression with the 
goal of improving agriculture. 


12.2.4 DNA Methylation 
and Somaclonal Variation 


Somaclonal variation is generally attributed to tis- 
sue culture-induced heritable changes manifested 
as cytological abnormalities, frequent qualitative 
and quantitative phenotypic mutations, sequence 
change, and gene activation and _ silencing. 
Quiescent transposable elements and retrotrans- 
posons may also activate, indicating that epigen- 
etic changes occur through the culture process. It 
has been observed that DNA methylation patterns 
are highly variable among regenerated plants and 
their progeny. DNA modifications are less stable 
in culture than in seed-grown plants. 

A somatic cell or small group of cells, bearing 
a unique genetic constitution or epigenetic modi- 
fication, may reproduce preferentially in culture. 
Any plant eventually regenerated from these cells 
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would probably be labeled as a variant acquired 
during the culture period. No doubt in reality it 
comprised a minuscule, possibly even undetect- 
able, proportion of the cells in the original explant 
(Matzke and Matzke 1995). As it is becoming 
more apparent that somatically acquired epigen- 
etic modifications in plants can be mitotically 
stable and meiotically heritable, more emphasis 
is given to variation in DNA methylation as a 
source of somaclonal variation. The first such 
evidence came from studies of regenerants from 
crown-gall tumor lines where changes in thymi- 
dine methylation as well as from demethylation 
were found to be associated with phenotypic 
variation (John and Amasino 1989), showing 
reactivation of silent Ac elements following tis- 
sue culture (Brettell and Dennis 1991; Peschke 
et al. 1991). Neuhuber et al. (1994) demonstrated 
that variation in antibiotic resistance markers in 
regenerated double transformants was due to the 
presence of the transgene complex in the original 
line as an epigenetic mosaic resulting from par- 
tial methylation. 

Sun et al. (2013) studied the phenotypic varia- 
tion in pollen viability of regenerants of Torenia 
after subculturing for one to nine generations 
(Sun et al. 2013). They found that pollen viability 
of regenerants continuously decreased with 
increasing subculture time. High concentrations 
of plant growth regulators applied to the 
Murashige and Skoog (MS) medium also resulted 
in diminished pollen viability. Furthermore, anti- 
biotic application during gene transformation 
also decreased pollen viability of the transfor- 
mants. However, the process of long-term culture 
did not significantly change pollen viability. The 
mean methylation level of regenerants showed a 
0.28-3.95 % decrease in seedlings subcultured 
in vitro for nine generations. Moreover, when the 
ninth subcultured regenerants with reduced pol- 
len viability were recovered in soil to get seeds, 
the pollen viability of seed-derived plants was 
similar to that of the wild type. They concluded 
that plant growth regulators, antibiotics, and the 
number of subculture generations influence 
somaclonal variations in Jorenia sp., and these 
variations in Torenia might have resulted from 
epigenetic changes. 
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12.2.5 DNA Methylation, Heterosis, 
and Hybrid Breeding 


Heterosis refers to the increased vigor of crosses 
between inbreds, species, or distantly related 
variants within a species in comparison with the 
parents. Although heterosis, or hybrid vigor, has 
been recognized for well over a century, its 
molecular basis is not been understood clearly 
(Birchler et al. 2010). However, recent progress 
has pointed that epigenetics has a role to play in 
heterosis (Groszmann et al. 2011). Shen et al. 
(2012) have provided valuable information about 
the underpinnings of heterosis by comparing 
global DNA methylation, small RNA (sRNA) 
expression, and gene expression patterns in 
hybrids in comparison with their parents. 
Variability in gene expression can be assessed 
through polymorphism of individual RNA 
amount (RAP) as well as the polymorphism of 
individual protein amount (PAP). Significant cor- 
relations between PAP indices and hybrid vigor 
for agronomic traits have been found (Leonardi 
et al. 1988, 1991). Studies have shown that genes 
controlling protein amounts and enzyme activi- 
ties, and particularly those with multiple effects, 
directly affect the expression of hybrid vigor. 
Examining the deviation in quantity of each RNA 
of every one of the two hybrids at the same 
mRNA in the two parental hybrids at the same 
developmental stage, the heterotic hybrid was 
found to have significant number of genes 
expressed over the quantity of the better parent at 
three stages. The work by Song and Messing 
(2003) provides evidence for altered regulatory 
effects in hybrids. DNA methylation could be 
considered as a genome-wide general regulatory 
mechanism that affects the expression of many 
genes playing an important role in the manifesta- 
tion of heterosis (Tsaftaris and Kafka 1998). 
Tsaftaris and Polidoros (1993) found a signifi- 
cant negative correlation between genome activ- 
ity and total DNA methylation. Studies have also 
been carried out to monitor the pattern of DNA 
methylation in maize parental inbreds and their 
hybrids using different techniques. The results 
indicated that (1) hybrids in general are less 
methylated than their parental inbreds; (2) heter- 
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otic hybrids are less methylated than related non- 
heterotic hybrids; and (3) old, low-yielding 
inbreds are highly methylated. Shen et al. (2012) 
established reciprocal crosses between 
Arabidopsis thaliana C24 and Landsberg erecta 
ecotypes. The Fl hybrids exhibited hybrid vigor, 
with increased growth and more siliques than 
either parent. Then, methyl Cseg was used to 
sequence the entire cytosine methylome of both 
parents and the reciprocal hybrids. It was found 
that both hybrids had increased cytosine methyl- 
ation compared with the parents. In addition, the 
growth of both hybrids was more sensitive to a 
chemical inhibitor of methylation in comparison 
with the parents, supporting a role for DNA 
methylation in growth vigor. 

Does methylation have a role in a hybrid’s 
genomic DNA, especially in tolerating stress, 
leading to more stable yield? Studies have shown 
that Fl hybrids are in general less methylated 
than their parental inbreds. The possible role of 
methylation in the expression of maize genes 
and performance of hybrids under different 
growth conditions has been examined in experi- 
ments with maize inbreds and hybrids grown 
under different plant densities (Tsaftaris and 
Kafka 1998; Tani et al. 2005). It has been con- 
cluded that the genotype, the developmental 
stage, and conditions of growth do affect the 
methylation status of genomic DNA. Stressful 
growth conditions result in more methylated 
DNA (less expressed), and in general, vigorous 
hybrids are more resistant to such density- 
induced methylation and suppression of genome 
activity in their genomic DNA. The resistance of 
the hybrid genome to methylation understress 
and, consequently, avoidance of suppression of 
many of its genes could be the reason underlying 
the high F1 yield and, more importantly, F, sta- 
ble yield. Shen et al. (2012) examined the tran- 
scriptomes of the parents as well as hybrids. 
They found that more genes were downregulated 
in the hybrids than were upregulated. Integration 
of the DNA methylome, the sRNAome, and the 
transcriptome data supports the idea _ that 
increased methylation of the circadian clock 
genes CIRCADIAN CLOCK ASSOCIATED] 
and LATE ELONGATED HYPOCOTYL, which 
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have been reported to be involved in heterosis, 
leads to their downregulated expression in the 
hybrids. Further, the authors found DNA meth- 
ylation and repression of flavonoid biosynthesis 
genes, with concomitantly increased auxin 
transport and signaling activity in the hybrids 
(Shen et al. 2012). 


12.3. Histone Modifications 


Histone modifications have emerged as vital epi- 
genetic modifiers that regulate the genetic infor- 
mation encoded in DNA. The basic unit of 
chromatin, the nucleosome, consists of 147 bp of 
DNA wrapped nearly twice around the octamer, 
containing two copies of each of core histones 
H2A, H2B, H3, and H4. Each core histone within 
the nucleosome contains a globular domain 
which mediates histone-histone interactions and 
also bears a highly dynamic amino acid terminal 
tail around 20-35 residues in length and is rich in 
basic amino acids. These tails extend from the 
surface of the nucleosome. Histone H2A has an 
additional ~37 amino acid carboxy-terminal 
domain protruding from the nucleosome. All his- 
tone proteins are modified inside the nucleus of 
the cell, but so far only few modifications have 
been studied. Nature uses several modifications 
in proteins to complement and expand its chem1- 
cal repertoire. In fact, more than 200 distinct 
posttranslational modifications (PTMs) have 
been identified to date, and the number and vari- 
ety of modifications continue to grow. Histones 
are subject to enormous number of posttransla- 
tional modifications including acetylation and 
methylation of lysines (k) and arginines (R), 
phosphorylation of serine (S) and threonines (T), 
ubiquitination, sumoylation, and biotinylation of 
lysines as well as ADP ribosylation. 

Generally, the acetylation of histones marks 
active, transcriptionally competent regions, 
whereas hypoacetylated histones are seen in tran- 
scriptionally inactive euchromatic or heterochro- 
matic regions. Histone methylation can be a 
marker for both active and inactive regions of 
chromatin. Two important targets for epigenetic 
modifications in plants are histone H3 lysine 9 
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(H3K9) and H3 lysine 27 (H3K27). H3K9 meth- 
ylation has been associated with heterochromatin 
formation or gene repression, while H3K27 
methylation profile is related to gene expression 
required for developmental decisions (Zhou 
2009). Methylation of lysine 4 of histone H3 
(H3K4) leads to activity and has been found pre- 
dominantly at promoters of active genes (Lachner 
et al. 2003). Methylation H3K9 marks silent 
DNA and is found throughout heterochromatic 
regions including centromeres and telomeres. 
Lysine methylation can be monomeric, dimeric, 
or trimeric. These observations have led to the 
idea of a “histone code,” although the degree of 
specificity of these codes may vary as particular 
combinations of histone marks do not always dic- 
tate the same biological function (Strahl and Allis 
2000). Important links between histone modifica- 
tions on one hand and DNA methylation on the 
other have been found in H3K9 methylation, a 
prerequisite for DNA methylation (Tamaru and 
Selker 2001; Jackson et al. 2002; Malagnac et al. 
2002), while DNA methylation can also trigger 
H3K9 methylation (Johnson et al. 2002; Soppe 
et al. 2002; Lehnertz et al. 2003; Tariq et al. 
2003). The histone deacetylases (HDACs), his- 
tone methyltransferases, and methylcytosine- 
binding proteins (Nan et al. 1998; Fuks et al. 
2003) have been reported to lead to the recruit- 
ment of DNA methyltransferases (Fuks et al. 
2000), although it is not yet clear what initiates 
the recruitment of the different epigenetic mod1i- 
fiers to their specific target sequences. 
Multimeric protein complexes have been 
reported to function as “cellular memory keys” 
that “lock” expression of genes and enable their 
inheritance over many cell division cycles both 
mitotic and meiotic (Narlikar et al. 2002). The 
multiprotein complexes involved in the regulation 
of developmental stages have been identified and 
studied in Drosophila. Two such main com- 
plexes, the polycomb (PcG) and the trithorax 
groups (trxG), were originally described in 
Drosophila (Mahmoudi and Verrijzer 2001). 
Both of these are known for their role in control- 
ling the expression of homeotic genes but with 
opposite effect: Polycomb group maintains them 
in an inactive state, whereas trithorax group 
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maintains them in active state. Similar multipro- 
tein complexes have recently been identified to 
execute conserved functions in plants. The PcG 
protein complex is the most studied in plants. 
This complex is involved in the control of the 
expression of homeotic genes that are essential 
transcriptional factors for the proper develop- 
ment of the plants . Although there is no struc- 
tural relevance between the PcG target genes in 
Drosophila and Arabidopsis, their functions are 
similar in terms of controlling the developmental 
processes. A main component of the PcG com- 
plex in plants is, again, methyltransferases like 
the MEDEA protein of Arabidopsis that specifi- 
cally methylates H3K27 leading to a silent state 
of chromatin (Reyes and Grossniklaus 2003). 
The regulatory mechanism of histone H3K9 and 
H3K27 modification provides important clues to 
unravel the molecular basis of establishment, 
inheritance, and erasure of epigenetic informa- 
tion regarding plant cell division and 
differentiation. 

Chromatin modifications and remodeling in 
plants regulate abiotic stress responses (Luo 
et al. 2012). Histone modifications may also 
have a decisive function in regulating plant 
responses to abiotic stresses. Hypoxia in rice 
seedlings leads to the acetylation of histone H3 
and to the conversion of dimethyl H3K4 to tri- 
methyl H3K4 in stress-responsive genes ADH1 
and PDC1 (Tsuji et al. 2006). These modifica- 
tions associated with active transcription were 
reversible after the removal of the stress. Tobacco 
and Arabidopsis cells have been observed to 
show dynamic changes in histone modifications 
in response to high salinity and cold stress, man- 
ifested by transient upregulation of H3 phospho- 
acetylation and histone H4 acetylation (Sokol 
et al. 2007). Enrichment of histone acetylation at 
H3K23 and H3K27 has been reported to occur in 
response to drought stress on the coding regions 
of drought stress-responsive genes, RD29B, 
RD20, and At2¢20880 (Kim et al. 2008). Histone 
H3K4 methylation patterns in response to dehy- 
dration stress in Arabidopsis has also been 
observed (van Dijk et al. 2010). Abscisic acid 
(ABA) and salt stress induce histone H3K9K14 
acetylation and H3K4 _ trimethylation but 
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decrease H3K9 dimethylation of some ABA and 
abiotic stress-responsive genes. This suggests 
that functionally related gene groups are regu- 
lated coordinately through histone modifications 
in response to abiotic stress in plant cells. All 
these studies show that epigenetic modifications 
of chromatin are implicated in plant stress 
responses. 


12.4 RNA Interference (RNAi) 


The discovery that double-stranded RNAs (dsR- 
NAs) can robustly silence genes in Caenorhabditis 
elegans and plants, RNA interference (RNAj1) has 
become a new paradigm for understanding the 
regulation of gene expression. RNAi is a gene 
downregulation mechanism in which small RNAs 
(of about 21—24 nucleotides) function to direct 
specific effector proteins (members of the 
Argonaute protein family) to a target nucleotide 
sequence by complementary base pairing. Then, 
the effector protein complex downregulates the 
expression of the targeted RNA or DNA. Small 
RNA-directed gene regulation systems have been 
discovered and named in plants, fungi, worms, 
flies, and mammalian cells (Lindbo 2012). 
Depending on the protein composition of the 
effector complex and the nature of the target 
sequence, the outcome can be either degradation 
of mRNA, translational repression leading to 
PTGS (posttranscriptional gene silencing) and 
mRNA alternative splicing, or genome modifica- 
tion leading to TGS, all of which result in silenc- 
ing the gene expression. In most of the organisms, 
it has been observed that nuclear RNAi predomi- 
nantly operates at heterochromatic loci, where it 
facilitates sequence-specific silencing through 
the direction of histone H3K9 methylation and/or 
cytosine methylation. However, differences are 
found in small RNA biogenesis, particularly in 
the subcellular localization of small RNA pro- 
cessing and loading of Argonaute proteins, and 
this could represent alternative approaches to 
regulating nuclear RNAi. As far as the co- 
transcriptional model is concerned, it is 
mechanistically unclear as to how the RNAi 
complexes regulate transcriptional machinery. 
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Outside constitutive heterochromatin, RNAi co- 
transcriptionally regulates some genes, and research 
is being carried out to determine whether this is a 
widespread phenomenon across organisms. 

TGS was observed when two _ different 
transgene complexes were introduced in sequen- 
tial steps into the tobacco genome (Matzke and 
Matzke 2004). Each complex encoded different 
proteins but had identical gene regulatory regions. 
The first transgene complex, which was stably 
active on its own, often became silenced in the 
presence of the second. The silenced transgenes 
acquired the most common genome modification 
associated with silencing, the DNA methylation. 
The silencing and methylation were abolished 
when the transgene complexes segregated from 
each other in progeny (Matzke et al. 1989; Park 
et al. 1996). As far as PTGS is concerned, it was 
discovered in two ways. One involved experi- 
ments to evaluate antisense suppression, a prom- 
ising approach at the time for selectively silencing 
plant gene expression. Antisense RNA encoded 
by a transgene should base pair to the comple- 
mentary mRNA of a plant gene, preventing its 
translation into protein. Although the control 
“sense” transgene RNAs are unable to base pair 
to mRNA and therefore should not induce silenc- 
ing, they often inexplicably did (Smith et al. 
1990). In another experiment, efforts to enhance 
floral coloration in petunia by overexpressing a 
transgene encoding a protein involved in pigment 
synthesis led to partial or complete loss of color. 
This resulted from coordinate silencing (“‘co- 
suppression”) of both the transgene and the 
homologous plant gene (Napoli et al. 1990; Van 
der Krol et al. 1990) which was later shown to 
occur at the posttranscriptional level (de Carvalho 
et al. 1992; van Blokland et al. 1994). PTGS has 
also been associated with DNA methylation of 
transgene sequences (Ingelbrecht et al. 1994). 
RNA-mediated gene silencing pathways have 
been reported to play essential roles in plant 
development, chromosome structure, and virus 
resistance. 

RNA has found its application in biotechnol- 
ogy, especially in the engineering of food plants 
which produce lower levels of natural plant 
toxins. RNAi has been used for the reduction of 
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the cyanogenic natural product linamarin in cas- 
sava plants (Siritunga and Sayre 2003). Another 
example is of cotton seeds which are rich in 
dietary protein but naturally contain the toxic ter- 
penoid product gossypol which is known to make 
them unsuitable for human consumption. Cotton 
stocks have been produced using RNAi with 
seeds containing lower levels of delta-cadinene 
synthase, a key enzyme in gossypol production, 
without affecting the enzyme’s production in 
other parts of the plant, where it is important in 
preventing damage from pests (Kumar et al. 
2006). The levels of allergens in tomato plants 
have successfully been reduced, and the precur- 
sors decreased of likely carcinogens in tobacco 
plants (Le et al. 2006; Gavilano et al. 2006). 
Other engineered plant traits include the produc- 
tion of non-narcotic natural products by the 
opium poppy resistance to common plant viruses 
and fortification of plants such as tomatoes with 
dietary antioxidants (Allen et al. 2004; Zadeh 
and Foster 2004). RNAi/PTGS provides protection 
from viral infections, prevents transposon mobi- 
lization, and regulates endogenous genes. No 
doubt the errors are on; no plant products using 
RNAi-based genetic engineering have yet passed 
the experimental stage. 


12.5 Genomic Imprinting 


Genomic imprinting is an epigenetic mechanism 
that produces functional differences between the 
paternal and maternal genomes and plays an 
essential role in the development and growth of 
organisms. There are approximately 156 genes in 
human genome that are subject to genomic 
imprinting where one parent copy of the gene is 
expressed while the other 1s silent (Luedi et al. 2007). 
There is a rapid and possibly active genome-wide 
demethylation in both male and female primor- 
dial germ cells resulting in the erasure of imprints 
(Mann 2001). The same mechanism also erases 
any aberrant epigenetic modification, preventing 
the inheritance of epimutations (Tam et al. 1994). 
The precise mechanisms responsible for epigen- 
etic erasure and demethylation in primordial 
germ cells are not yet clear. The initiation of new 
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imprints occurs subsequently during gametogen- 
esis and primarily during oogenesis (Murphy and 
Jirtle 2003). Imprinting is controlled by methyla- 
tion of stretches of regulatory DNA, the imprint- 
ing control regions (ICRs) (Feil and Berger 
2007). Interestingly, in order to get ready for the 
establishment of imprints, methyl marks on his- 
tones need to be removed (Ciccone et al. 2009). 

It has been evolved in flowering plants and 
mammals (Feil and Berger 2007; Meaney and 
Ferguson-Smith 2010) . In case of flowering 
plants, a double fertilization event in reproduc- 
tion takes place in which one sperm fertilizes the 
egg cell to form an embryo and a second sperm 
fuse with the central cell to give rise to endo- 
sperm. It is the endosperm tissue, where imprint- 
ing occurs in plants (Gehring et al. 2009). The 
first plant gene shown to be imprinted in endo- 
sperm was A SET domain polycomb group of 
genes MEDEA (MEA) . Its maternal expression 
is controlled by DNA methylation and demethyl- 
ation. Recently, there has been significant prog- 
ress in identifying imprinted genes as well as 
understanding molecular mechanisms of imprint- 
ing in plants. Up to date, approximately 350 
genes have been reported to have differential 
parent-of-origin expression in the plant endo- 
sperm (Arabidopsis, Zea, and Oryza). In 
Arabidopsis, many imprinted genes are regulated 
by the DNA METHYLTRANSFERASE] (MET1) 
and the DNA-demethylating = glycosylase 
DEMETER (DME) and/or their chromatin states 
regulated by polycomb group proteins (PRC2). 
There have been also some maternally expressed 
genes regulated by unknown mechanisms in the 
endosperm. 

Many researchers are currently interested in 
manipulating the development of endosperm 
through the epigenetic mechanisms that underlie 
the parental imprinting. Studies in Arabidopsis 
have shown that removal of the parental imprint- 
ing leads to over proliferation of the endosperm, 
which is a desirable trait for seed crops (Berger 
2003). Exploring the epigenetic mechanisms of 
seed development will eventually reveal the 
mysteries behind apomixis in which the fertile 
plant progeny is produced without double fertil- 
ization (Koltunow and Grossniklaus 2003). If 
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this mechanism can be applied to commercial 
crops, hybrids can be regenerated indefinitely, 
therefore overcoming the current limitations of 
plant breeding to maintain hybrid vigor for more 
than one single generation. 


12.6 Epigenetic Inheritance 
and Plant Breeding 


Phenotypic variation is directed by an interaction 
between genetic and environmental variation. 
However, the line between these two components 
is currently blurred by inherited epigenetic varia- 
tion, which is quite sensitive to environmental 
inputs. Careful study of epigenetic plant pheno- 
types directed plant geneticists to recognize the 
role of epigenetic inheritance systems in plant 
evolution (Kalisz and Purugganan 2004), as well 
as domestication and breeding (Tsaftaris and 
Polidoros 2000). The significance of epigenetic 
inheritance in plants has been gradually recog- 
nized over the past 50 years, and the researchers 
have found that the epigenetic phenotypes can 
result from (1) activation, excision, and translo- 
cation of transposon elements (McClintock 
1951); (2) allelic interactions known as paramu- 
tations (Brink 1956); (3) transgene silencing in 
plants (concurrently observed from _ several 
groups working with transgenic plants in the late 
1980s (Matzke and Matzke 2004); and (4) epial- 
leles of endogenous plant genes that control flo- 
ral induction and morphogenesis, seed 
development, and parental imprinting (Jacobsen 
and Meyerowitz 1997; Luff et al. 1999; Melquist 
et al. 1999). 

Epigenetic states in plants can be inherited 
through the transmission of epigenetic alleles 
(epialleles) over many generations (Kakutani 
2002). These heritable epigenetic alleles can be 
considered as a new source of variation that may 
produce novel phenotypes. One of the examples 
of the stability and heritability of alleles and 
epigenetic variation is found in the morphologi- 
cal variant of toad flax Linaria_ vulgaris 
described 250 years ago by Linnaeus. This vari- 
ant has played such a significant role in the 
history of botany. But ironically, it turned out to 
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be neither a new species (as Linnaeus thought) 
nor a mutation (as de Vries and others thought) 
but a fairly stable epimutation due to an altera- 
tion in DNA methylation of the Lcyc gene. What 
caused the methylation change in the first place 
is unclear, but once formed, it seems to have 
been transmitted, more or less steadily (although 
there is residual instability), for many genera- 
tions . Such epigenetic variants might have sig- 
nificant implications in plant breeding because 
the selection and breeding in these cases are 
based on heritable phenotypic variation within 
populations. 


12.7. Conclusions and Future 
Perspectives 


As it is evident from the above discussion that 
epigenetics has a lot of role to play in plant biol- 
ogy, however, this has not been fully appreciated. 
In contrast to the ease with which the DNA 
sequence can be analyzed, studying the complex 
patterns inherent in epigenetics poses many dif- 
ficulties. Greater knowledge of patterns of epi- 
genetic variation is expected to be highly 
informative in taxonomy and systematics, as well 
as population biology and conservation. 
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Introduction 


Abstract 

In this chapter, the basic concepts of bioinformatics and some of the 
important tasks in biological sequence and structure analysis are described. 
In biological sequence analysis, (1) pairwise and multiple sequence align- 
ments, (11) database search against a query sequence and (111) phylogenetic 
analysis are fundamental tasks. Similarly, in the structure analysis of bio- 
logical molecules, the most important tasks are (1) visualisation of 3D 
structures, (11) prediction of three-dimensional structures of proteins, (111) 
docking and (iv) molecular dynamics simulation. Further, the recent 
developments in bioinformatics, namely, ‘omes’ and ‘omics’ concepts 
along with their integration in systems biology, which are emerging areas 
in the postgenomic era, are also briefly described. Finally, some of the 
important applications of bioinformatics in various areas like agricultural, 
pharmaceutical and medical sciences are provided. 
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In the field of molecular biology, high-throughput 
deoxyribonucleic acid (DNA) sequencing meth- 


During the past two decades or so, biological data 
explosion has occurred, and the accumulation of 
biological data has got accelerated at a faster rate. 
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ods generate the genome sequence data of differ- 
ent organisms continuously in an exponential rate 
resulting in biological sequence databases of 
unimaginable sizes. These sequence databases 
along with the structure databases play an impor- 
tant role in life sciences, as they are needed for 
understanding more about the biological func- 
tions of an organism. They also have a lot of 
potential applications in biotechnology, agricul- 
ture and pharmaceutical industries. These 
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developments really triggered off a new field in 
biological sciences, namely, bioinformatics. 

Bioinformatics is an offshoot of biotechnol- 
ogy, and it is the application of computational 
methods to analyse biological data to obtain use- 
ful information (Lesk 2005; Baxevanis and 
Ouellette 2005; Mount 2003; Pevsner 2009). 
There are several bioinformatics tasks and appli- 
cations on sequence and structural analysis of 
biological data. In biological sequence analysis 
(Durbin et al. 1998; Gromiha 2010), (1) pairwise 
and multiple sequence alignments, (11) database 
search against a query sequence and (111) phylo- 
genetic analysis are fundamental tasks. Similarly, 
in structure analysis of biological sequences 
(Higgins and Taylor 2000; Orengo et al. 2003), 
the most important tasks are (1) visualisation of 
3D structures, (11) prediction of three-dimensional 
structures of proteins, (111) docking and (iv) 
molecular dynamics simulation. There are many 
popular and standard tools available for the 
above-mentioned biological sequence and struc- 
tural analysis. 

In the postgenomic era, the high-throughput 
studies like ‘omes’ and ‘omics’ concepts along 
with their integration in systems biology have 
emerged as the recent developments in bioinfor- 
matics. For example, the complete DNA content 
of an organism is called the genome, and the study 
of genomes becomes genomics. Similarly, there 
are many “omes’ such as transcriptome, proteome 
and metabolome, and their corresponding ‘omics’ 
studies are known as transcriptomics, proteomics 
and metabolomics, respectively. These high- 
throughput studies are possible because of the 
technological advancements that provide high- 
precision equipments for making the biotechno- 
logical measurements and analysis leading to the 
large amount of data of ‘omes’ and their analysis 
‘omics’. The integration of the entire high- 
throughput ‘omes’ and ‘omics’ data provides a 
system-oriented approach to understand biologi- 
cal systems, which forms the subject matter of the 
new field called ‘systems biology’. 

Bioinformatics has a lot of potential applica- 
tions in various fields including agricultural, 
pharmaceutical, clinical and medical sciences. It 
has been used directly/indirectly for computer- 
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aided drug/inhibitor designing, genetically modi- 
fied plants and animals, gene therapy, molecular 
evolution and so on. 

In this chapter, we describe about the basic 
concepts of bioinformatics and some of the 
important tasks in biological sequence and struc- 
tural analysis. Further, we describe briefly about 
the recent developments in bioinformatics, espe- 
cially the high-throughput ‘omes’ and ‘omics’ 
concepts along with their integration in systems 
biology, which are emerging in the postgenomic 
era. Finally, we provide a few important applica- 
tions of bioinformatics. 


13.2 Bioinformatics: Definition 
and Basic Concepts 


It is interesting to note that Krishnamurthy (2003) 
in his book entitled An Advanced Textbook on 
Biodiversity has already provided a general defi- 
nition to informatics. Krishnamurthy had quoted 
the following three lines from the poem entitled 
‘Choruses from the rock’ written by the famous 
poet T.S. Eliot in 1934, which reads as 

Where is the Life we have lost in living? 

Where is the Wisdom we have lost in knowledge? 


Where is the Knowledge we have lost in 
information? 


To these lines, Krishnamurthy had added two 
more lines as 


Where is the Information we have lost in data? 
Where is the Data we have lost in database? 


And here, I wish to add one more line as 


Where is the Database we have lost in the Internet? 


The above lines clearly indicate the definition of 
informatics of any kind, whether it is bioinfor- 
matics, cheminformatics, pharmacoinformatics, 
immunoinformatics, medical informatics and so 
on. In any subject, informatics begins its role 
when the amount of data becomes large and the 
size of the data grows in a rapid manner. In infor- 
matics, the first step is to classify the data and 
store them in organised and integrated databases 
for easy retrieval and use with the help of com- 
puters. Next step is to analyse the data using any 
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suitable computational methods to obtain useful 
and pertinent information. From the information, 
one has to derive knowledge, and from the knowl- 
edge, one has to attain wisdom by taking wise 
and proper decisions resulting in efficient actions. 
So, the definition of informatics is not only to 
classify, store and retrieve the data but also to 
analyse the data to get useful information, and 
from that, the knowledge has to be derived and 
also to attain the wisdom. 


13.2.1 Need for Bioinformatics 


Before we define bioinformatics, let us know 
about the major reasons behind the need for bio- 
informatics. The first and foremost reason for any 
kind of informatics is the existence of large 
amount of data and its rapid growth. In biological 
sciences, there has been a data explosion that 
occurred due to the high-throughput DNA 
sequencing projects that generate the genome 
sequence data of different organisms, which has 
started around the year 1990. The draft genome 
of humans was completed in the millennium year 
2000 and was announced (Lesk 2005) on 26 June 
2000. The importance of this discovery becomes 
evident from the fact that the top journals like 
Nature and Science had published a special issue 
on “The human genome’ in their 15th February 
2001 and 16th February 2001 issues, respec- 
tively. However, many other genome sequences 
data of microbes, plants and animals are also 
available for analysis to date (Lesk 2005; 
Baxevanis and Ouellette 2005; Gregory 2005; 
Pevsner 2009). This is reflected in the size of the 
GenBank database, which is one of the popular 
DNA sequence databases maintained by the 
National Centre for Biotechnology Information 
(NCBI), USA. There is an exponential growth of 
the size of the GenBank database, which is evi- 
dent from its statistics (http://www.ncbi.nlm.nih. 
gov/genbank/statistics) from 1990 to date. The 
current release #201 of GenBank database con- 
tains 159,813,411,760 (.e. approximately 159 
billion) bases corresponding to 171,744,486 (.e. 
approximately 171 million) sequences, as of 
April 2014. 
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On the other hand, the experimentally determined 
three-dimensional (3D) structures of biological 
macromolecules available in structure databases are 
very less in number compared to the sequence data- 
bases. For example, Protein Data Bank (PDB) 
(http://www.rcsb.org/pdb/home/home.do), which is 
one of the most popular 3D structure databases of 
biological macromolecules including proteins, 
nucleic acids and complexes, contains 99,293 struc- 
tures only as of April 2014. This clearly indicates 
that the sizes of the sequence databases are huge 
compared to structure databases. The reason for this 
disparity is mainly because the experimental deter- 
mination of 3D structures of biological macromol- 
ecules (Higgins and Taylor 2000; Gautham 2006) is 
tedious and time consuming compared to that of the 
experimental high-throughput sequencing tech- 
niques available at present. Hence, the biological 
data explosion and the disparity in the size of the 
sequence and structure databases may be regarded 
as the two major reasons behind the need for 
bioinformatics. 


13.2.2 What Is Bioinformatics? 


From the broad definition of informatics and the 
reasons for the need of bioinformatics, one can 
easily define bioinformatics (Lesk 2005; Mount 
2003). Bioinformatics is an interdisciplinary sub- 
ject, which requires the knowledge/expertise 
from diverse backgrounds, including all branches 
of biological sciences, computer science, mathe- 
matics, statistics, physics and chemistry. In a 
popular way, Arthur Lesk (2005) in his book on 
Bioinformatics provided a simple definition as 
‘The marriage of biology and computer science 
has created a new field called bioinformatics’. 

In general, bioinformatics is the application of 
various computational methods from mathemati- 
cal, statistical and computer sciences to analyse 
biological data to obtain useful information. We 
use computer programmes to make inference 
from the biological data, to make connections 
among them and to derive useful predictions. 
From these predictions, one has to derive 
knowledge and attain wisdom by taking wise and 
proper decisions. 
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13.2.3 Understanding Gene 
Function: Importance of 3D 
Structures 


From bioinformatics’ point of view, we look at an 
organism in the form of data. An organism must 
have one genome, fixed number of chromosomes 
in the nucleus of each cell, the total number of 
nucleotide bases in all the chromosomes and so 
on. For example, in the case of humans (Homo 
sapiens), which is the complex eukaryotic organ- 
ism, there exists one genome, 24 haploid chro- 
mosomes (22 autosomes, X and Y_ sex 
chromosomes) of nuclear DNA along with the 
mitochondrial organelle DNA in each cell, a total 
of approximately 3.3 x 10° base pairs and approx- 
imately 21,000 genes predicted by gene predic- 
tion methods. In the case of plants, in addition to 
nuclear and mitochondrial DNA, genome 
includes plastid DNA, which is responsible for 
photosynthesis. For example, in the case of rice 
(Oryza sativa) genome, there are 12 chromo- 
somes, plastid and mitochondrial DNAs. In fact, 
immediately after the experimental determina- 
tion of the genome sequence of an organism, the 
very first result predicted is the possible number 
of coding genes, which is obtained only from the 
gene prediction algorithms of bioinformatics. 

It is also interesting to note that most of the 
mapping and sequencing technologies were 
developed from studies of simpler organisms 
such as microbes, plants and animals. Some of 
the representative model organisms of each major 
type along with their genome sizes are given in 
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Table 13.1. Itis interesting to note from Table 13.1 
that the genome size of the mouse and human is 
of the same order of 2.8 Gbp and 3.3 Gbp, respec- 
tively. This clearly indicates that for any organ- 
ism, the organism size does not have any relation 
to its genome size. For any organism, the total 
amount of DNA of the haploid genome measured 
in picograms (pg) is called its “C-value’ (Gregory 
2005). The lack of a consistent relationship 
between the C-value and the complexity of an 
organism is called the “C-value paradox’. Further, 
we know that the mouse has been used as a model 
organism for humans in carrying out many 
in vivo experiments for several decades without 
knowing that they do have the same order of 
genome size. 

In order to understand the various functions of 
an organism, we need to understand the functions 
of their coding genes. In order to find the function 
of a gene, first, we have to obtain the correspond- 
ing protein expressed by the gene, as coded by it, 
using the appropriate standard genetic code. The 
function of the protein depends on the 3D or 
folded structure of it, assumed in typical biologi- 
cal environments surrounding it. Hence, under- 
standing protein structure will be essential in 
determining gene function. Once the function of 
the protein is known, then it is assigned as the 
function of the associated gene. 

This is the reason, why we have compared the 
size of PDB, the 3D structure database, with that 
of the GenBank, the DNA sequence database in 
Sect. 13.2.1. Since the size of the 3D structure 
database is small and also the experimental 


Table 13.1 Genome of sizes of non-human model organisms 


SI. no. Organism type Organism name Chromosomes! (haploid) Genome size* 
1. Bacterium Escherichia coli — 4.6 Mbp’ 
2. Yeast Saccharomyces cerevisiae 16 (1-16) 12.4 Mbp 

3. Fruit Fly Drosophila melanogaster 5 (2,3,4,X, Y) 122.6 Mbp 

4. Roundworm Caenorhabditis elegans 6 (1-5,X) 100.2 Mbp 

5. Plant Arabidopsis thaliana 5 (1-5) 135.0 Mbp 

6. Animal Mus musculus (mouse) 21 (1-19,X,Y) 2.8 Gbp? 


Source: ‘http://users.rcn.com/jkimball.ma.ultranet/BiologyPages/C/Chromosomes.html 
*http://users.rcn.com/jkimball.ma.ultranet/BiologyPages/G/GenomeSizes.html 


“1 Mbp (Mega base pairs) = 10° base pairs 
>] Gbp (Giga base pairs) = 10° base pairs 
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determination of 3D structures of biological mac- 
romolecules is tedious and time consuming, 
computational algorithms for 3D structure pre- 
diction play an important role in predicting 3D 
structures from the sequences of proteins for 
which experimentally determined 3D structures 
are not available. 


13.3. Bioinformatics Tasks: 
Biological Sequence 
and Structure Analysis 


In this section, some of the basic bioinformatics 
tasks in sequence and structure analysis of bio- 
logical data are described. The important tools 
used for various tasks related to biological 
sequence and structural analysis are also dis- 
cussed. Before that, it is necessary to know about 
how to download DNA and protein sequences, 
structures and scientific articles from the respec- 
tive databases, required for any analysis. DNA 
sequences can be downloaded either directly 
from GenBank (http://www.ncbi.nlm.nih.gov/ 
genbank/), EMBL (http://www.embl.de/) and 
DDBJ (http://www.ddbj.nig.ac.jp/) databases or 
from the genome databases. Protein sequences 
can also be downloaded from the Protein-NCBI 
(http://www.ncbi.nlm.nih.gov/protein), PIR 
(http://pir.georgetown.edu/) and UniProt (http:// 
www.uniprot.org/). Scientific Literature can be 
accessed from PubMed (http://www.ncbi.nIm. 
nih.gov/pubmed), PubMed Central — PMC 
(http://www.ncbi.nlm.nih.gov/pmc/) and also 
directly from the home pages of many journals 
publishing articles with open access. 


13.3.1 Sequence Analysis Tasks 


In biological sequence analysis, the two basic 
tasks are sequence alignment and database search 
(Lesk 2005; Pevsner 2009). Suppose we get a 
DNA or protein sequence for our analysis, either 
the new sequence may be isolated, purified and 
extracted from a source organism in the lab or 
may be downloaded from a database. We know 
that in nature, new sequences are adapted from 
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the pre-existing sequences rather than invented 
de novo (Durbin et al. 1998). There exists a sig- 
nificant similarity between a new sequence and 
already known sequences. This is very fortunate 
for computational sequence analysis, as any new 
sequence can be studied by comparing it with the 
existing sequences. 

Now, if we have some clues about the new 
sequence, then the basic question is whether this 
DNA or protein sequence is related to any other 
DNA or protein sequence of our choice. The 
relatedness of a pair of sequences suggests that 
they may be homologous (Durbin et al. 1998; 
Pevsner 2009). By the homology of sequences, 
we mean that the two sequences and the organ- 
isms in which they occur are descended from a 
common ancestor, with the implication that they 
do have shared ancestral characteristics. If two 
related sequences are homologous, then we can 
transfer information about structure and may also 
function through homology. It 1s important to 
note that the ‘similarity’ of two sequences simply 
means that the measurement of resemblance and 
difference is independent of the source (..e. 
organism) of resemblance. Homology implies 
similarity and the converse is not true always. 
Further, if two amino acid residues in the same 
position of both the sequences are the same (i.e. 
D vs D), then it is known as ‘identity’ and have 
maximum score in the scoring matrices, such as 
BLOcks SUBstitution Matrix (BLOSUM), used 
for sequence alignment. For example, the score 
for D vs D in BLOSUMSO scoring matrix is 8. 
Suppose if two amino acid residues in the same 
position of both the sequences are conserved (i.e. 
D vs E), then they possess ‘similarity’ and have 
positive score in the scoring matrices. In the case 
of D vs E, the score in BLOSUMSO scoring 
matrix is 2. Again, identity implies similarity and 
converse 1s not true. 

The question now becomes how to find the 
relatedness of two sequences, and the answer is 
simply by aligning the sequences, which is 
known as pairwise sequence alignment (Mount 
2003; Pevsner 2009). Since genome sequences of 
many organisms become available, the task of 
finding out how DNA/proteins are related within 
an organism and between organisms becomes 
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increasingly fundamental to our understanding of 
life. Comparison of DNA (or nucleotide) 
sequences is important in (1) confirming the iden- 
tity of a DNA sequence in a database, (11) search- 
ing for polymorphisms (SNPs) and (111) analysing 
the identity of a cloned cDNA fragment. More 
often, protein sequence alignment is more infor- 
mative than DNA alignment. Protein sequence 
comparisons can identify homologous sequences 
from organisms that last shared a common ances- 
tor over 1 billion years ago (BYA). However, 
DNA _ sequence comparisons typically allow 
look-back time up to about 600 million years ago 
(MYA) only. Further, with the whole genome 
sequences of many organisms, it becomes the 
fundamental task to find out how proteins are 
related within an organism (known as paralogues) 
and between organisms (known as orthologues) 
for our understanding of life. 

Some of the important types of sequence com- 
parisons used for biological sequence analysis 
(Lesk 2005) are given below: 

(1) Pairwise sequence comparison is the assign- 
ment of character by character match 
between two sequences. Two types of pair- 
wise alignments are useful. (a) Global align- 
ment aligns end to end of two sequences as 
a whole, and (b) local alignment aligns one 
or more portions or subsequences of two 
sequences. 

Motif search finds matches of a short 
sequence in one or more regions internal to 
a long sequence. 

In multiple sequence alignment, the algo- 
rithm tries to align all the sequences (more 
than two in number) that are given as query. 
This type of alignment is useful to find out 
conserved regions among proteins. For 
example, the multiple sequence alignment of 
the insulin protein sequences of various 
organisms may provide the regions of resi- 
dues in the given set of query sequences 
which are conserved, while the other resi- 
dues are not. Finding these regions 1s helpful 
to determine the function of the proteins for 
which sequence similarity is not significant. 

Apart from these, it may also be interesting to 
search for similar sequences through databases 
for any given new sequence, which is known as 


(11) 


(iii) 
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database search. But comparing the sequences 
with other sequences or through sequence data- 
bases is not a simple process that can be done 
manually, as the size of the sequence databases is 
very huge and hence computational algorithms 
are used to perform the analysis. Biological 
sequence analysis algorithms are used in aligning 
a new sequence with the millions of available 
sequences, and by using similarity between the 
sequences, one can predict the structure and 
function. Let us describe few sequence analysis 
tasks mentioned above with suitable examples. 
13.3.1.1 Pairwise Sequence 
Alignment 
The most fundamental sequence analysis task is 
aligning two sequences (either DNA or protein) 
in a pairwise manner and to find whether the two 
sequences are related or not. If there is any simi- 
larity, we can be sure of the structure and func- 
tion of the new protein. The key issues involved 
in pairwise sequence alignments are: 
i) Types of alignment needed and gap 
penalties 
Global alignment: It is an end-to-end 
matching of two entire sequences using the 
Needleman—Wunsch algorithm (Needleman 
and Wunsch 1970; Durbin et al. 1998). This 
kind of alignment is used when the sequence 
sizes are close to each other and the algo- 
rithm tries to align all the residues (letters) 
end to end. Gaps are introduced to make the 
alignment better and are denoted by 


hyphens. 
Local alignment: It is the matching of 
portions or subsequences from _ two 


sequences using the Smith—Waterman algo- 
rithm (Smith and Waterman 1981; Durbin 
et al. 1998). This is used when the sequences 
are distantly related or partial sequence 
matching is required for analysis. 

Gap penalties: While aligning, if a gap is 
to be introduced in any one of the sequences 
for optimal alignment, a penalty is given 
(Durbin et al. 1998). The standard penalty 
associated with a gap of length g is given 
either by a linear score r(g) = —gd or an 
affine score a(g) = —d — (g—/)e, where d 1s 
the gap open penalty (gap is introduced) and 
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e is the gap extension penalty (gap is 
extended). 

Scoring System used to rank alignments 

A score is needed as a measure of similarity 
when a residue from the first sequence is 
aligned with a residue in the second sequence. 
The residues may be identical (same amino 
acids), similar (amino acids with similar 
properties) or different (amino acids with 
entirely different properties), and the score is 
given accordingly using appropriate substitu- 
tion matrices. The values in these matrices 
are derived using the probability of occur- 
rence derived from the statistics based on 
known sequences. Point accepted mutation 
(PAM) and BLOcks SUBstitution Matrix 
(BLOSUM) are two popularly used scoring 
matrices. They are given different numbers to 
denote the different levels of similarity or 
relatedness. For example, PAM40 and 
BLOSUM90 are equivalent which are 
obtained from _ closely _ related/similar 
sequences. PAM500 and BLOSUM30 are 
obtained from distantly related sequences. 
When we have no idea of similarity before- 
hand, PAM250 and BLOSUM62 are used as 
default substitution matrices. 

(iii) Algorithms used to find optimal (or good) 
scoring alignment 

Dynamic programming algorithms are 
used for pairwise alignment, in which the 
alignment score is dynamically given as an 
additive score based on the previous align- 
ment. These types of algorithms are guaran- 
teed to find the optimal scoring alignment or 
set of alignments. 

Statistical methods used to evaluate the sig- 
nificance of an alignment score 

The statistical quantity, namely, E-value, is 
used commonly to evaluate the significance 
of the scores obtained. 

Let us consider the sequence of myoglobin of 
humans as query, and the question is to find how 
much similar is the query sequence with haemo- 
globin subunit beta sequence of human. One has 
to perform pairwise alignment of these two 
sequences using any pairwise sequence align- 
ment tools such as EMBOSS Needle, EMBOSS 
Water and LALIGN available at http://www.ebi. 


(il) 


(iv) 
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ac.uk/Tools/psa/, BLAST (http://blast.ncbi.nlm. 
nih.gov/Blast.cgi) and so on. First, the two 
sequences have to be downloaded from the rele- 
vant databases and are given below: 
>gil4885477|refINP_005359. 1| 
[Homo sapiens | 
MGLSDGEWQLVLNVWGKVEADIPGHG 
QEVLIRLFKGHPETLEKFDKFKHLKSED 
EMKASEDLKKHGATVLTALGGILKK 
KGHHEAEIKPLAQSHATKHK 
IPVK YLEFISECHQVLQSKHPGDFGAD 
AQGAMNKALELFRKDMASN 
YKELGFQG 
>2114504349|lrefINP_OOOS09. 11 
subunit beta [Homo sapiens] 
MVHLTPEEKSAVTALWGKVNVDEVG 
GEALGRLLVVYPWTQRFFESF 
GDLSTPDAVMGNPKVKAHGKK 
VLGAFSDGLAHLDNLKGTFATL 
SELHCDKLHVDPENFRLLGNVL 
VCVLAHHFGKEFTPPVQAAYQKV 
VAGVANALAHKYH 
The NCBI BLAST input and output of the pair- 
wise alignment of the above two sequences are 
given in Figs. 13.1 and 13.2, respectively. The 
following steps have to be carried out in giving 
the BLAST input shown in Fig. 13.1: 

(1) Go to BLAST website at http://blast.ncbi.nlm. 
nih.gov/Blast.cgi and choose ‘protein blast’. 

(11) Copy the ‘myoglobin’ sequence into the 
‘Enter Query Sequence’ box. 

(ii) Check the ‘Align two or more sequences’ box. 

(iv) Copy the ‘hemoglobin subunit beta’ sequence 
into the “Enter Subject Sequence’ box. 

(v) Choose ‘BLOSUMS0O’ matrix in ‘Scoring 
parameters’ box in the ‘Algorithm parame- 
ters’ section. 

(vi) Click ‘BLAST’ button to run BLAST. 

In Fig. 13.2, the graphic summary and align- 
ments are shown as BLAST output. 


myoglobin 


haemoglobin 


13.3.1.2 Database Search 

The database search for the above-mentioned 
human myoglobin sequence as query sequence is 
carried out using BLAST, and the screenshots of 
both input and output are given in Figs. 13.3 and 
13.4, respectively. The following steps have to be 
carried out in giving the BLAST input as shown 
in Fig. 13.3: 
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> NCBI/ BLAST/ blastp suite Align Sequences Protein BLAST 


blastn | blastp | blastx | tblastn | tblastx | 


BLASTP programs search protein subjects using a protein query. more... 





Enter Query Sequence 








Enter accession number(s), gi(s), or FASTA sequence(s) Clear Query subrange | 
>gil|48s5d477|ref|NP_005359.1| myoglobin [Homo sapiens) 
MGLS DGEWOLV LNVWGERVEADI PGHGOEV LIRLFEGHPET LEKF DKFKHLESEDEMEASEDLEKHGAT From 
YLTALGGI LAKKGHHEAEI KF LAQOSHATERHEIPYRYLEFISECIIOVLOSEHPGDFGADAGGAMNEAL 
ELFRKDMASNYKELGFOQS ; 
To 
Or, upload file No file selected. oe 
Job Title Myoglobin Vs Hemoglobin Sequences 
Enter a descriptive title for your BLAST search @# 
= Align two or more sequences & 
Enter Subject Sequence 
Enter accession number(s), gi(s), or FASTA sequence(s) & Clear Subject subrange &) 
>gil acne stot chert bioalee 000509. al shindig ees cebitealtat peat, inane gapaens] 
IVDEV PEVE From 
DERE EEC EEE TERT ICoT EA LEECH ELLEN EE RIC ELLCHE LECULAR EEN: AAYOKVVA 
GVANALAHK YH +. 
°o 


Or, upload file 





_| No file selected. 


Program Selection 
Algorithm 


Choose a BLAST algorithm @ 





= Show results in anew window 


(Algorithm parameters 


© blastp (protein-protein BLAST) 


Search protein sequence using Blastp (protein-protein BLAST) 


Note: Parameter values that differ from the default are highlig) 


Fig. 13.1 Pairwise sequence alignment using BLAST — Input 


(1) Go to BLAST website at http://blast.ncbi. 
nlm.nih.gov/Blast.cgi and choose ‘protein 
blast’. 

(ii) The human myoglobin sequence is copied 
into the ‘Enter Query Sequence’ box. 

(111) Choose ‘No Adjustment’ in “Compositional 
Adjustments’ option in ‘Scoring parameters’ 
box in the ‘Algorithm parameters’ section. 

(iv) Click “BLAST? button to run blast. 

In Fig. 13.4, the graphic summary and descrip- 

tions are shown as BLAST output. 


13.3.1.3 Multiple Sequence 
Alignment 
The insulin sequences from five different organ- 
isms are downloaded from the NCBI database 
and are given below: 
>Hamster gil693 1 6lpirllINHY insulin — hamster 
FVNQHLCGSHLVEALYLVCGERG 
FFYTPKSGIVDQCCTSICSLYQLENYCN 
>Monkey g1I538566lpirllINMKSQ insulin — com- 
mon squirrel monkey 
FVNQHLCGPHLVEALYLVCGERGFFY 
APKTGVVDQCCTSICSLY QLQONYCN 
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Blast 2 sequences 
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Query ID lelj/37939 Subject ID lel|/37941 
Description 9/|4385477|ref|/NP_005359.1| myoglobin [Homo Description g/|4504349|ref|/NP_000509.1| hemoglobin subunit 
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Molecule type amino acid Molecule type amino acid 
Query Length 154 Subject Length 147 


Program BLASTP 2.2.29+ > Citation 


Other reports: & Search Summary [Taxonomy reports] [Multiple alignment] 
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Distribution of 1 Blast Hits on the Query Sequence &@ 
[Mouse over to see the defline. click to show alignments 


Color key for alignment scores 
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Query 63 KKHGATVLTALGGI LKKKGHHEAEI KPLAQSHATKHRKIPVKYLEFISECIIQVLOSKHPG 122 
K HG VL A+ L + + L++ H K + + + $4 ++ VL 
Sbjct 62 KAHGKKVLGAFSDGLAHLDNLKGTFATLSELHCDKLHVDPENFRLLGNVLVCVLAHHFGK 121 


Query 123 DFGADAQGAMNKALELFRKDMASNY 147 


+F QA K + +A 6Y 
Sbjct 122 EFTPPVQAAYQKVVAGVANALAHKY 146 


Fig.13.2 Pairwise sequence alignment using BLAST — Output. (a) Graphic summary and (b) Alignments 
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© DELT4A-BLAST (Domain Enhanced Lookup Time Accelerated BLAST) 
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i Show results in a new window 


Fig. 13.3 Database search of a query using BLAST — Input 


>Ostrich g1169327IpirllINOS insulin — ostrich 
AANQHLCGSHLVEALYLVCGERGFFY SPK 
AGIVEQCCHNTCSLY QLENYCN 
>Elephant gil69307|pirllINEL insulin — elephant 
FVNQHLCGSHLVEALYLVCGERGFFYTPKT 
GIVEQCCTGVCSLY QLENYCN 
>Turkey g1l69326lpirllIINTK insulin — turkey 
(tentative sequence) 
AANQHLCGSHLVEALYLVCGERGFFY 
SPKAGIVEQCCHNTCSLY QLENYCN 
The multiple sequence alignment of the above 
insulin sequences of five different organisms are 
carried out using Clustal Omega tool available at 
http://www.ebi.ac.uk/Tools/msa/clustalo/. The 
screenshots of both input and output are given in 
Fig. 13.5. The following steps have to be carried 
out in giving the ‘Clustal Omega’ input as shown 
in Fig. 13.5: 
(1) Go to ‘Clustal omega’ website at http:// 
www.ebi.ac.uk/Tools/msa/clustalo/. 


Search database Non-redundant protein sequences (nr) using Blastp (protein-protein BLAST) 


(11) Copy all the above five ‘insulin’ sequences 
into the ‘STEP | Enter into input sequences’ 
box. 

(ii) Click ‘Submit’ 
Omega’. 

The multiple sequence alignment alone is shown 
as output. 


button to run ‘Clustal 


13.3.1.4 Other Sequence Analysis 
Tasks 

Apart from the above-mentioned basic sequence 
analysis tasks, other computational sequence 
analysis tools are also available to predict struc- 
tural and functional features of proteins from the 
sequence information alone. Secondary structure 
prediction is an important structural feature pre- 
diction where the computational tools can predict 
which portion of the sequence may form helix, 
sheet and coil. Similarly, protein fold recognition 
tools may predict the possible fold the sequence 
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Fig. 13.4 











Database search of a query using BLAST — Output. (a) Graphic summary and (b) Descriptions 
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Tools > Multiple Sequence Alignment > Clustal Omega 


Multiple Sequence Alignment 
Clustal Omega is a new multiple sequence alignment program that uses seeded guide trees and HMM profile-profile 
techniques to generate alignments. 


STEP 1 - Enter your input sequences 
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Output: 
CLUSTAL O(1.2.1) multiple sequence alignment 


Ostrich AANOHLCGSHLVEALYLVCGERGFFYS PKAGIVEOCCHNTCSLYOLENYCN 
Turkey AANOHLCGSHLVEALYLVCGERGFFYS PKAGI VEOQCCHNTCSLYOLENYCN 
Monkey FVNOHLCGPHLVEALYLVCGERGFFYAPKTGVVDOCCTS ICSLYOLONYCN 
Hamster FVNOHLCGSHLVEALYLVCGERGFFYTPKSGIVDOCCTSICSLYQOLENYCN 
Elephant Rina Seno ane eee a eee Ene See eee ee mn aren 


Fig.13.5 Multiple sequence alignment of insulin protein sequences using Clustal Omega — Input & Output 
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Table 13.2 List of sequence analysis tools 


SIL. no. Task(s) 
1. Pair-wise sequence alignment 
a Multiple sequence alignment 
and phylogenetic analysis 
2 Database search 
4. Secondary structure prediction 
a: Protein fold recognition 
6. Amyloidogenic regions prediction 


can adopt by using the structural features of 
known folds. Further, functional features like 
amyloidogenic regions of proteins may also be 
predicted. Some of the important sequence analy- 
sis tools for the above-mentioned tasks are pro- 
vided in Table 13.2. 


13.3.2 Structure Analysis Tasks 


In structure analysis, some of the important bio- 
informatics tasks are (i) visualisation of three- 
dimensional (3D) structures of biological 
macromolecules; (11) computing bond lengths, 
bond angles, torsion angles and other structural 
parameters; (111) superposition of protein struc- 
tures and to compute root-mean-square devia- 
tion (RMSD) between them; (iv) 3D structure 
prediction of proteins; (v) molecular docking and 
(vi) molecular dynamic simulations (Higgins and 
Taylor 2000; Orengo et al. 2003; Bourne 
and Weissig 2003). As we have mentioned in 
Sect. 13.2.3, structure is responsible for the func- 
tion of a protein; structural analysis tools are very 
important in understanding the functions of bio- 
logical molecules. 

The major experimental techniques available 
for the determination of 3D structure of biologi- 
cal macromolecules (Higgins and Taylor 2000; 
Gautham 2006) are (1) X-ray diffraction and (11) 
NMR methods. However, both these methods are 
tedious and time consuming. Even the latest 
high-throughput crystallographic structure deter- 
mination method using high-flux synchrotron 
radiation sources for data collection and employ- 
ing multiple anomalous diffraction methods for 
data interpretation is not keeping the pace at 
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Free online tools 
EMBOSS Needle, EMBOSS Water, LALIGN, BLAST 


ClustalW & Tree View, Clustal Omega, PHY LIP, 
PAUP, MEGA 


BLAST, FASTA 

GOR, PHD, DSSP, SOPMA, NPS@ 
3D-PSSM/PHYRE, nFOLD3, PredicFold PSS-3D1D 
AGGRESCAN, FoldAmyloid 


which DNA sequences are determined using the 
modern high-throughput sequencing methods. 
Hence, computational 3D structure prediction 
algorithms of bioinformatics become useful in 
structural analysis of biological 
macromolecules. 

The experimentally determined 3D structures of 
biological macromolecules are made available in 
the structure databases like (1) Protein Data Bank 
(PDB) maintained by Research Collaboratory for 
Structural Bioinformatics (RCSB), USA (http:// 
www.trcsb.org/pdb/home/home.do); (11) Molecular 
Modelling Database (MMDB) of NCBI, USA 
(http://www.ncbi.nim.nih.gov/Structure/MMDB/ 
mmdb.shtml); and (411) Nucleic Acid Database 
(NDB) (http://ndbserver.rutgers.edu/). There are 
many tools available for 3D structure visualisation 
such as (1) RasMol (http://rasmol.org/), (11) Swiss- 
PDBViewer Deep View (http://spdbv.vital-it.ch/), 
(ii) VMD — visual molecular dynamics (http:// 
www.ks.uiuc.edu/Research/vmd/) and (iv) PPMOL 
(http://www.pymol.org/). 

Further, three types of methods are available 
for 3D structure prediction of proteins, namely, 
(1) homology modelling, (1) threading and (111) 
‘ab initio’ methods. In order to perform homol- 
ogy modelling, a template 3D structure is 
required whose sequence identity must be more 
than 40 % with the query sequence for which the 
3D structure has to be predicted. Some of the free 
online tools, which can perform homology mod- 
elling, are Modeller (https://salilab.org/mod- 
eller/) and SWISS-MODEL (http://swissmodel. 
expasy.org/). If such a template structure is not 
available, then one may have to use threading 
method performed by typical tool such as 
I-TASSER _ (http://zhanglab.ccmb.med.umich. 
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edu/I-TASSER/). There exists another method, 
namely, ‘ab initio’ method, which can predict the 
three-dimensional structure from the first princi- 
ples using the sequence information alone, and 
hence, it is computationally intensive. The 
Robetta server (http://robetta.bakerlab.org/) is 
used to predict protein structure maintained by 
the Baker laboratory for noncommercial ab initio 
and comparative modelling. 

Once it is possible to predict  three- 
dimensional structure of proteins and visualise 
it, one can also perform many other structure 
analysis tasks, namely, docking of small mole- 
cules with target protein structures, protein—pro- 
tein docking and protein-DNA interactions. 
Further, one can also perform computational 
protein engineering experiments like (i) muta- 
tion experiments where one alters any particular 
amino acid or base for a desired effect and (11) 
site-directed mutagenesis experiments in which 
one identifies potential sites where one can do 
alterations. All these computational experiments 
have a lot of potential applications in various 
fields including agriculture, pharmaceutical and 
medical sciences. Let us describe few structure 
analysis tasks mentioned above with suitable 
examples. 





(alpha-1) 


Fig. 13.6 Visualisation of 3D structure of human hemo- 
globin molecule using RasMol (PDB ID: 1A3N) with 
indivudal chains A, B, C and D are depicted in different 
colours with heme molecule of each chain in purple 
colour 


S. Parthasarathy 


Visualisation of 3D Structure 

of Proteins Using RasMol 

The 3D structure of haemoglobin molecule of 
humans (PDB ID: 1A3N) is downloaded from 
PDB. Visualisation of this molecule using 
RasMol is shown in Fig. 13.6. Using the visuali- 
sation tool, we can identify the chains A, B, C 
and D and indicate them with different colours. 
In addition, the heme molecules in each chain are 
also shown in ball and stick display in purple 
colour. 


13.3.2.1 


13.3.2.2 Structure Function 
Relationship Using Swiss- 
PDBViewer Deep View 
In order to exhibit the structure—function rela- 
tionship, let us consider the single point mutation 
in haemoglobin gene that has resulted in sickle- 
cell anaemia in human population. It is evident 
that a single nucleotide change has led to the sub- 
stitution of glutamic acid (E) in the 6th position 
of normal to valine (V) of mutant, which resulted 
in the structural change in the protein which in 
turn has altered the cell shape. So, let us compare 
the structure of the B chain of haemoglobin of 
normal (PDB ID: 1A3N_B) and mutant (PDB 
ID: 2HBS_B) forms of humans after 


Normal Hemoglobin 
6" residue Glutamic 
acid (E) shown in Blue 









Mutant Hemoglobin 
6" residue Valine (V) 
shown in Red 


Fig.13.7 Structure function relationship—Superimposing 
the first 20 amino acids of the 3D structures of chain B of 
normal (1LA3N_B) and mutant (2ZHBS_B) hemoglobin 
molecule of human using Swiss-PDB Viewer Deep View 
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superimposing their structures using Swiss- 
PDBViewer Deep View tool and is shown in 
Fig. 13.7. In order to recognise the structural 
change in the 6th position (E to V) of normal and 
mutant haemoglobin structures, the first 20 amino 
acids are shown in Fig. 13.7. 


13.3.2.3 Structural Classification 

of Proteins 

Proteins are classified based on their three- 

dimensional structures. The following are the 

important databases on structural classification: 
1. SCOP — Structural Classification of Proteins 
(New SCOP2 database is available at http:// 
scop2.mrc-lmb.cam.ac.uk/.) 

i. CATH —- Class, Architecture, Topology, 
Homologous superfamily 
(CATH database is available at http://www. 
cathdb.info/.) 

i. FSSP—Fold classification based on Structure— 
Structure alignment of Proteins — obtained 
by Distance-matrix alignment (DALI) 
(FSSP database is available at http://protein. 
hbu.cn/fssp/.) 


13.3.2.4 Other Structural Analysis 
Tasks 

One of the major important structural bioinfor- 
matics tasks is docking. Docking is the process of 
predicting (or checking) the fitness of a small 
molecule (drug or ligand) into the active site (or a 
binding pocket) of a target protein molecule. 
After docking, the resulting docked structure is a 
protein—ligand complex. Computational docking 
algorithms are used to compute the binding of a 
drug to the target protein and their interactions in 
a computer-simulated environment using special- 
ised software. The other structural bioinformatics 
tasks are (1) active site prediction, (11) molecular 


Table 13.3 List of structure analysis tools 


SI. no. Task 

1. 3D structure visualization 

a 3D structure prediction 

a Docking 

4. Active site prediction 

5. Molecular dynamics simulations 
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dynamics simulation, (111) quantitative structure 
activity relationship (QSAR), (iv) virtual screen- 
ing, (v) pharmacophore modelling and so on. 
Some of the important structural analysis tools 
for the above-mentioned tasks are provided in 
Table 13.3. 


13.4 Recent Developments 
in Bioinformatics 


The tasks and tools mentioned above are used to 
analyse single sequence or structure only. In the 
postgenomic era, the availability of genome 
sequences of many organisms and the data avail- 
able through the modern high-throughput experi- 
ments require more comprehensive tools to 
analyse them. Recent developments in bioinfor- 
matics include (i) genomics, (11) transcriptomics, 
(111) proteomics, (iv) metabolomics and (v) sys- 
tems biology. Let us briefly explain about these 
recent concepts. 


13.4.1 Genomics 


As we know, the total DNA content (i.e. nuclear, 
organelle and plastid DNA) is known as genome. 
Genomics is the study of the structure, content, 
evolution and functions of genes in genomes 
(Lesk 2007). The major aims of genomics are to 
(1) establish an integrated web-based database 
and research interface; (11) assemble physical, 
genetic and cytological maps of the genome; (111) 
identify and annotate the complete set of genes 
encoded within a genome and (iv) provide the 
resources for comparison with other genomes. 
There are many computational tools available 
to analyse the genomes. Genomes of many 


Free online tools 

RasMol, Swiss-PDB Viewer, VMD, PyYMOL 
Modeller, SwissModel, I-TASSER, Robetta 
AutoDock, Argus Lab, Hex 

Q-SiteFinder, CASTp 

AMBER, GROMACS 
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organisms and plants are available in NCBI 
genome database (http://www.ncbi.nlm.nih.gov/ 
genome). Further, NCBI genome Map Viewer 
(http://www.ncbi.nlm.nih.gov/mapview/) is a very 
useful tool in viewing genomes and also to down- 
load the sequence of any particular gene or set of 
genes. A typical screenshot of NCBI Map Viewer 
of the Homo sapiens (human) genome is shown in 
Fig. 13.8. All the 22 autosomes, X and Y chromo- 
somes and mitochondrial DNA are depicted in 
Fig. 13.8. Similarly, the screenshot of NCBI Map 
Viewer showing the genes on sequence of Homo 
sapiens (human) chromosome 1 is shown in 
Fig. 13.9. The other popular genome browsers are 
(i) UCSC genome browser (http://genome.ucsc. 
edu/) and (11) Ensembl human genome browser 
(http://www.ensembl.org/index.html). 

One of the important applications of genomics 
is the prediction of coding genes using the gene 
finding algorithms from the genomes of prokary- 
otes and eukaryotes. Once the genome sequence 
is obtained through experimental high-throughput 
sequencing projects, the coding genes are pre- 
dicted using computational gene prediction tools 
such as (1) GeneMark (http://www.ncbi.nlm.nih. 
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gov/genomes/MICROBES/genemark.cgi), — (41) 
GENSCAN _ (http://genes.mit.edu/GENSCAN. 
html), G1) GLIMMER (http://www.ncbi.nimnih. 
gov/genomes/MICROBES/glimmer_3.cgi) _ for 
prokaryotic and eukaryotic genomes, (iv) GRAIL 
(http://www. broadinstitute.org/mpg/grail/) and 
(v) ORF Finder (http://www.bioinformatics.org/ 
sms2/orf_find.html) for eukaryotic genomes. 

Single nucleotide polymorphism (SNP) analy- 
sis is another application of genomics. SNP is a 
genetic variation between individuals, limited to 
a single base pair, which can be substituted, 
inserted or deleted. The dbSNP is the database of 
SNPs available at http://www.ncbi.nlm.nih.gov/ 
SNP/. SNPs are useful in genetic disorders and 
identification of evolutionary traits of an organ- 
ism or a plant across geographic variations. One 
of the examples of SNPs is a genetic disorder, 
namely, sickle-cell anaemia disease, which is 
caused by a specific SNP, an adenine to thymine 
mutation in the B-globin gene that changes a glu- 
tamic acid to valine in the 6th residue (c.f. 
Fig. 13.7), creating a sticky surface on the hae- 
moglobin molecule that leads to the polymerisa- 
tion of the deoxy form. 
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Fig. 13.8 Screen shot of NCBI Map Viewer of the Homo sapiens (human) genome view 
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Fig. 13.9 Screen shot of NCBI Map Viewer showing the genes on sequence of Homo sapiens (human) chromosome 1 


Comparative genomics is another important 
application of genomics, and its tools like TaxPlot 
(http://www.ncbi.nlm.nih.gov/sutils/taxik2.cgi) 
and MUMmer (http://mummer.sourceforge.net/) 
are useful in comparing genomes of related 
organisms. Further, genomics has applications in 
many other areas including genetically modified 
plants and animals, treating genetic disorder dis- 
eases, personalised medicine and so on. 


13.4.2 Transcriptomics 


Transcriptome is the complete set of mRNA 
trasncripts of an organism. Transcriptomics per- 
forms global analyses of gene expression with 
the help of high-throughput techniques, such as 
DNA microarrays. 

Some of the important techniques used in 
transcriptomics are (1) CDNA arrays, (11) DNA 
microarrays, (111) photolithography — 250,000 oli- 
gonucleotide spots per sq.cm. and (iv) serial anal- 
ysis of gene expression (SAGE). The ExPASy 


transcriptomics server (http://www.expasy.org/ 
transcriptomics) is one of the major servers con- 
taining many transcriptomics tools. 


13.4.3 Proteomics 


Cellular activities like metabolic processes are 
mediated by proteins of the ‘proteome’ and not 
by the genes of the ‘genome’ or mRNA of the 
‘transcriptome’. Proteome is the complete collec- 
tion of proteins 1n a cell/tissue/organism at a par- 
ticular time. Unlike genomes which are stable 
over the lifetime of the organism, proteomes 
change rapidly as each cell responds to its chang- 
ing environment and produces new proteins at 
different quantities. 

It is interesting to note that genome is a more 
stable entity. An organism has only one genome 
but many proteomes. In the case of human, there 
may be (1) one body-wide proteome, (11) about 
200 tissue proteomes and about a trillion (~10'’) 
individual cell proteomes. 
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The study of proteomes that includes deter- 
mining the 3D shapes of proteins, their roles 
inside cells and the molecules with which they 
interact and defining which proteins are present 
and how much of each is present at a given time. 
The important techniques used in proteomics are 
(1) two-dimensional polyacrylamide gel electro- 
phoresis (2D-PAGE) used for separation by 
charge and mass, (11) matrix-assisted laser 
desorption/ionisation—time-of-flight (MALDI- 
TOF) mass spectrometry and (111) protein micro- 
arrays. The ExPASy Proteomics Server (http:// 
www.expasy.org/proteomics) is one of the major 
servers containing many proteomics tools. 

Some of the important applications of pro- 
teomics are (1) to correlate proteins on the basis 
of their expression profiles, (11) to observe pat- 
terns in protein synthesis and this observed pat- 
tern changes can be used as an indicator of the 
state of cell and its gene expression, (111) to char- 
acterise bacterial pathogens and to develop novel 
antimicrobials, (v) to identify regions of the bac- 
terial genome that encode pathogenic determi- 
nants and (vi) to develop drugs. Further, 
proteomics can also be used as a tool for plant 
genetics and breeding. 


13.4.4 Metabolomics 


Metabolites are the small molecules, which are 
the intermediates and products of metabolism. 
The total metabolite pool of a cell is called the 
metabolome of an organism. The research area 
dealing with metabolic profiling 1s termed metab- 
olomics. Cells can be profiled with mass spectro- 
scopic techniques to quantify cellular metabolites. 
In the Human Metabolome Database (http:// 
www.hmdb.ca/), approximately 3,000 known 
metabolites are deposited. 


13.4.5 Systems Biology 


Systems biology (Kitano 2001, 2002; Szallasi 
et al. 2006) is a new perspective and emerging 
field for research in the postgenomic era. It aims 
at system level understanding of biological sys- 
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tems. Systems biology is an approach to quantita- 
tive study of biological systems that attempts to 
embrace the complexity of life as a fact of life. 
Systems biology studies whole cells/tissues/ 
organisms not by a traditional reductionist’s 
approach but by holistic means in a reiterative 
attempt to model quantitatively the complete cell/ 
tissue/organism. It is an integrated and interact- 
ing network of genes, proteins and biochemical 
reactions, which give rise to life. 

The latest high-throughput procedures, 
namely, genomics, transcriptomics, proteomics, 
metabolomics and phenomics, have become 
important technologies that facilitate the func- 
tional determination of gene products. The inte- 
gration of all available data from all these 
high-throughput ‘omics’ procedures and analyse 
them together improves the reliability and the 
generation of more reliable hypotheses. Systems 
biology is the new emerging research field that 
focuses on the integration of various high- 
throughput ‘omics’ data because it analyses an 
entire biological system. 

Systems biology aims to produce as accurate a 
picture as possible of all the regulatory processes 
within a cell or organism by analysing the inter- 
actions between component parts of the biologi- 
cal system, e.g. metabolic pathways, organelles, 
cells and tissues. Further, the ambitious aim of 
systems biology is to develop computer models 
that simulate biological systems and predict con- 
sequences upon changing parameters (e.g. chang- 
ing the concentration of a specific metabolite). 

Some of the important software tools and 
projects in systems biology are (i) Systems 
Biology Markup Language (SBML), (11) Systems 
Biology Workbench (SBW), (iu) E-Cell, (iv) 
Virtual Cell, (v) Cytoscape and (vi) Virtual Rice 
Project. Let us briefly explain about them. 
13.4.5.1 Systems Biology Markup 
Language (SBML) 

Systems Biology Markup Language (SBML) is 
an Extensible Markup Language (XML)-based 
format for representing computational models in 
a way that can be used by more than 80 software 
tools worldwide to communicate and exchange 
those models. Therefore, each calculated model 
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can be tested in different software environments 
without additional effort. The home page of 
SBML is _http://sbml.org/Main_Page. The 
BioModels Database contains several biological 
models, and it is available at http://www.ebi.ac. 
uk/biomodels-main/. 


13.4.5.2 Systems Biology Workbench 
Systems Biology Workbench (SBW) is the soft- 
ware that allows communications between 
diverse software modules using SBML. SBW 
2.7.9 1s available at http://sys-bio.org/. Some of 
the important SBW components are (1) Jarnac, 
which is a programme for interactive systems 
biology and (11) JDesigner 2 is used for modelling 
biochemical networks. 


13.4.5.3 E-Cell 

E-Cell project aims to make precise computa- 
tional whole-cell simulation at the molecular 
level possible; E-Cell System is an_ object- 
oriented software suite for modelling, simulation 
and analysis of large-scale complex systems such 
as biological cells, architected by Koichi 
Takahashi team. The core part of the system, 
E-Cell Simulation Environment version 3, allows 
many components driven by multiple algorithms 
with different timescales to coexist. E-Cell 
System consists of the four major parts, namely, 
(1) E-Cell Console, (41) E-Cell Model Editor, (111) 
E-Cell Sessions Monitor and (iv) E-Cell Tool 
launcher. E-Cell is available at http://www.e-cell. 
org/ecell. 


13.4.5.4 Virtual Cell 

The Natural Resource for Cell Analysis and 
Modelling has developed the Virtual Cell 
Modelling and Simulation Software to model cell 
biological processes. The VCell 4.7 Server is 
available at http://vcell.org/. This new technology 
associates biochemical and electrophysiological 
data describing individual reactions with experi- 
mental microscopic image data describing their 
subcellular locations. Cell physiological events 
can then be simulated within the empirically 
derived geometries, thus facilitating the direct 
comparison of model predictions with experi- 
ment. The simulations can be run over the 
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Internet. Models developed by VCell can be 
reused, updated and published so that they are 
available to the scientific community. 


13.4.5.5 Cytoscape 

Cytoscape is an open-source bioinformatics soft- 
ware platform for visualising molecular interac- 
tion networks and integrating these interactions 
with gene expression profiles and other state 
data. Cytoscape 2.6.3 is available at http://cyto- 
scape.org/. 


13.4.5.6 Virtual Rice Project 

Japan had set up the Rice Simulator Project 
(Harris 2002) to create an in silico rice plant. This 
is an experimental scheme funded by MAFF, 
Japan, to the tune of 1.2 billion Yen. The two 
main aims of the project are to (1) create a net- 
work of databases from sequence, expression and 
microarray data and (11) create in silico models of 
the plant at the level of single cells, tissues and 
the whole plant. Mathematical models for use in 
prediction studies will be developed in four areas, 
namely, (1) knockout simulation, (11) stress 
response simulation, (111) growth simulation and 
(iv) pollination or breeding simulation. The sim- 
ulation software being developed under the proj- 
ect will also allow researchers to set up a model 
for viewing, for example, the effects of a drop in 
temperature on a plant at various stages of growth 
such as sprouting, flowering or heading. The Rice 
Simulator Project will create the technological 
engine to drive future development of crop 
varieties. 


13.4.5.7. Research in Systems Biology 
Research in systems biology requires the collab- 
oration of researchers from diverse backgrounds, 
including biology, computer science, mathemat- 
ics, Statistics, physics and biochemistry. These 
collaborations are necessary because of the enor- 
mous breadth of background needed for research 
in this field. These collaborations can be hindered 
by differing understandings of the limitations and 
applicability of techniques and concerns from 
different disciplines. 

Systems biology will necessarily be a leading 
academic and industrial thrust in the years to 
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come. Its impact on agriculture, medicine, bio- 
logical energy production and many other areas 
will make biotechnology a powerful driving force 
in the near future. Systems biology requires sci- 
entists able to cross boundaries between many 
disciplines. So far, biology has traditionally been 
an observational science rather than a predictive 
science. Systems biology is the only hope of 
understanding biological systems at the predic- 
tive level required for disease detection, preven- 
tion or cure. 


13.4.6 The ENCODE Project 


The draft human genome was published in 2000 
and the finished human genome was published in 
2003. Human genome consists of approximately 
three billion base pairs of genetic sequence, and 
it codes for around 20,000 genes that amount to 
little more than | % of the genome only. More 
than 98 % of the genetic sequence may not be 
simply ‘junk’ DNA. The ENCODE project 
known as Encyclopedia of DNA Elements, which 
Started in 2003, is a massive data collection effort 
designed to identify the information responsible 
for the ‘complexity’ of humans lay somewhere in 
the ‘deserts’ between the genes. The aims of 
ENCODE project are to (i) catalogue the func- 
tional DNA sequences in the regions between the 
genes, (11) learn when and in which cells they are 
active and (ili) trace their effects on how the 
genome is packaged, regulated and read. 

After an initial pilot phase, the scientists of 
ENCODE project consortium started applying 
their methods to the entire genome in 2007, and 
the results of that phase were completed in 2012. 
The ENCODE project consortium (2012) had 
recently published these results in 30 papers in 
three major journals, namely, (4) Nature (6 
papers), (11) Genome Biology (© papers) and (111) 
Genome Research (18 papers). The consortium 
had assigned some sort of function to roughly 
80 % of the genome, including (4) more than 
70,000 ‘promoter’ regions, which are the sites, 
just upstream of genes, where proteins bind to 
control gene expression, and (11) nearly 400,000 
‘enhancer’ regions that regulate expression of 
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distant genes. These results of the present and 
future phases of the ENCODE project will prove 
useful in many areas of science including person- 
alised medicine, in the decades to come. 


13.5 Applications 
of Bioinformatics 


There are many applications of bioinformatics in 
various branches of science including agricul- 
ture, pharmaceutical, clinical and biomedical sci- 
ences. The major application of bioinformatics is 
in the computer-aided drug discovery process in 
the pharmaceutical sciences. The tasks include 
molecular modelling; molecular dynamic simu- 
lation; molecular docking; virtual screening; 
pharmacophore modelling; absorption, distribu- 
tion, metabolism, excretion and _ toxicity 
(ADMET) properties and so on involved in 
structure-based and ligand-based drug/inhibitor 
designing processes. 

Bioinformatics is also used in the develop- 
ment and analysis of genetically modified plants 
and animals. It is useful in identifying the loca- 
tion of genes to be modified, computing various 
possible gene modifications, the computational 
analysis on the structure and functions of the 
modified genes and so on. In addition to 
computer-aided drug designing, bioinformatics 
tools can be used in clinical and biomedical sci- 
ences especially in the computational investiga- 
tions of genetic disorders, gene _ therapy, 
personalised medicine and so on. 

More recently, “DNA Barcode of Life’ (Hebert 
et al. 2003; Janzen and CBOL Plant Working 
Group 2009) is an application of phylogenetic 
analysis of bioinformatics for taxonomical clas- 
sification of biodiversity. Microgenomic identifi- 
cation systems use a small segment of the genome 
for taxonomic classification of organisms such as 
viruses, bacteria and Protista, which are unable to 
be classified using morphological features. Such 
molecular phylogeny has been extended from 
microorganisms to all organisms. Recently, 
DNA-based identification systems have been 
applied not only to microorganisms but also to 
higher organisms. DNA barcoding is to provide a 
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standardised approach to identify animals and 
plants by using small fragment of DNA sequences 
from the same segment of all the genomes, called 
DNA barcodes. The major goal of DNA barcod- 
ing is that anyone, anywhere and anytime will be 
able to identify quickly and accurately the spe- 
cies of a specimen, whatever its condition and 
stage of life. 

Recently, Hebert et al. (2003) had established 
that the mitochondrial cytochrome c oxidase I 
(COI or COX I) gene can serve as a core of a 
global bio-identification system for animals. 
However, for plants, mitochondrial genes do not 
differ sufficiently to distinguish among closely 
related species. In 2005, at the first international 
barcode meeting, Kress and his colleagues nomi- 
nated chloroplast genes of about 450 bases, part 
of a ‘spacer’ sequence between two genes as bar- 
code for the plants, trnH—psbA. More recently, 
the consortium for the Barcode of Life (CBOL) 
plant working group (Janzen and CBOL Plant 
Working Group 2009) had identified that seven 
leading plastid DNA regions (atpF—atpH spacer, 
matK gene, rbcL gene, rpoB gene, rpoCl gene, 
psbK-—psbI spacer and trnH—psbA spacer) can be 
used as barcodes for plants. Once fully devel- 
oped, DNA barcoding has the potential to com- 
pletely revolutionise our knowledge of 
biodiversity and our relationship to nature. There 
are many other applications of bioinformatics 
including the new areas, namely, genomics, pro- 
teomics and systems biology in various branches 
of science. 


13.6 Conclusions 


Biological data is growing rapidly and becomes 
so voluminous. Because of its size, it requires 
lots of information technology skills to develop 
more efficient algorithms to analyse the biologi- 
cal data. In this chapter, we explained the need 
for bioinformatics and also the basic concepts of 
bioinformatics. Some of the fundamental biolog- 
ical sequence and structure analysis tasks and the 
bioinformatics tools used are described. The 
recent developments through the high-throughput 
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experiments, the ‘omes’ and ‘omics’ concepts, 
namely, genomics, transcriptomics, proteomics, 
metabolomics and systems biology, are also 
briefly explained. Finally, some of the important 
applications of bioinformatics are mentioned. 

Bioinformatics requires novel methods and 
algorithms to handle genome sequences as such 
and also to compare genomes of related 
organisms. It also requires lots of IT skills not 
only to develop many new bioinformatics soft- 
ware tools but also to use the existing tools more 
efficiently. The molecular modelling and docking 
of drug molecules need tremendous computing 
skills for designing novel drugs required to con- 
trol or cure various diseases, which results in 
huge cost cuts in pharmaceutical industries. 
Many of the biotechnology and pharmaceutical 
industries have a separate bioinformatics divi- 
sion. Hence, bioinformatics including biotech- 
nology (BT) have already taken on information 
technology (IT) for its genomics, proteomics and 
systems biology projects. Many prominent IT 
companies like IBM, WIPRO and TCS had 
already started a separate division for either life 
sciences or health care. Information technology 
professionals with some basic knowledge in any 
branch of life sciences will find bioinformatics as 
a quite interesting and challenging job. In gen- 
eral, bioinformatics provides lot of job and 
research opportunities. 
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Abstract 

Systems biology is an emerging field with the potential of making 
significant contribution to human life and other living organisms. It helps 
to understand the entirety of life and elucidates basic principles behind the 
biological life. It captures the complexity of complex biological system 
and explains the relationship between every gene, transcript, protein, and 
phenotype. Advancements in functional and structural genomics have 
enabled the molecular biologists to come a long way toward understand- 
ing molecular constituents of the cell. Yet, we fail to understand how 
organisms function. To date, we face lot of problems in complete under- 
stating of several diseases caused to plants, animals, and other productive 
organisms. To understand underlying biological processes and to find out 
potential new drug targets, we need to understand complete molecular net- 
work systems. Systems biology is an approach based on interdisciplinary 
fields which focuses on systematic study of complex biological systems 
using new perspective of holism instead of conventional reductionism. 
Systems biology follows a holistic approach to understand life by using 
interactomics, genomics, transcriptomics, metabolomics, proteomics, and 
informational science. In this chapter, we explained different components 
of systems biology and its connections with other disciplines. In the future, 
this integrated science can answer several devastating diseases and mys- 
teries in biological systems. They can be readily tested by computer based 
models. 
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Cellular networks 


14.1 Introduction 

Although systems biology is known for years, it 
is still relatively new compared to other fields of 
science. The developments in the different fields 
of biological sciences have made possible shift 
from combined molecular and system-level 
approaches to biological research under the name 
of systems biology. It integrates molecular and 
structural knowledge, which can best be achieved 
by the combinatorial use of models and experi- 
mental data. In fact, systems biology has several 
definitions: Systems biology has been described 
as a comprehensive quantitative analysis of the 
manner in which all the components of a 
biological system interact functionally over time 
(Aderem 2005); this area of work can integrate 
behavior of complex biological organization and 
process in terms of molecular constituents 
(Kirschner 2005); and “systems biology” aims at 
a quantitative understanding of biological sys- 
tems to the extent that one is able to predict sys- 


Fig. 14.1 Causal model of 
cancer: union of all three 
networks activated by 
epidermal growth factor 
receptor (EGFR) drugs — 
(blue) drug, (yellow) 
drug2, (green) drug3, 
(black) two of the above, 
and (pink) all of the three 


temic features (Bork and Serrano 2005). Different 
types of modeling approaches are useful depend- 
ing on the amount and quality of the molecular 
data available and the purpose of the model. Two 
main approaches of systems biology can be dis- 
tinguished as top-down and bottom-up. In top- 
down systems biology method, characterization 
of cells can be achieved by combinative model of 
system-wide data that is obtained from genomics 
and metabolomics. Those models are often phe- 
nomenal but serve to discover new insights into 
the molecular networks. In bottom-up systems 
biology method, the study does not start with 
data, but detailed model of a molecular network 
on the basis of its molecular properties. In this 
approach, molecular networks can be quantita- 
tively studied leading to predictive models; they 
can be applied in drug design and optimization of 
product formation in bioengineering (Bruggeman 
et al. 2007; Bruggeman and Westerhoff 2007) as 
shown in Fig. 14.1. Based on these two 
approaches in systems biology, we have solved 
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attained number of successes. The most impor- 
tant consequence of Human Genome Project 
(HGP) and structural genomics is that these are 
pushing scientists toward new view of biology. 
Systems biology investigates neither individual 
gene nor protein one at a time, rather investigates 
in a network manner. This data can be integrated, 
graphically displayed, and used for computa- 
tions. In this respect, complete genome sequence 
is crucial in systems biology; the complete 
genome sequences of human (International 
Human Genome Sequencing 2004; Lander 2011; 
Schmutz et al. 2004), nematode (Harris et al. 
2014; van der Velde et al. 2014), fly (Levine and 
Malik 2011), Arabidopsis (Arabidopsis Genome 
2000; Meinke et al. 1998), yeast (Bassett et al. 
1996; Levy 1994; Zagulski et al. 1998), and sev- 
eral microbes and parasites are in the pipeline. 
HGP has improved the ability of the cells system- 
atically; it is also facilitated new technologies for 
systematically characterizing the cellular 
response: DNA sequences, microarrays, and 
high-throughput proteomics. Alternatively, gene- 
expression data could be combined with either 
information on protein-protein interactions and 
protein phylogenetic profiles (Chen and Xu 2003; 
Rao et al. 2014) or with the genomic location of 
each gene to find shared regulatory elements 
(Barrera and Ren 2006; Bischof et al. 2013). 
Finally, it is difficult to decide significant changes 
in expression level and is also an active area of 
research. Gene-expression clustering and its 
extensions, method of expression data analysis, 
have been reviewed elsewhere (Brazma and Vilo 
2000; Claverie 1999; Heyer et al. 1999; Sherlock 
2000). Although this analysis has been certainly 
informative, global data sets undoubtedly pro- 
vide additional information that remains 
untapped. Ultimately, it would be highly desir- 
able to analyze expression levels and other global 
measurements in a way that it validates our cur- 
rent knowledge of a cellular process and isolates 
discrepancies between expected and observed 
levels. To achieve this level of analysis, the model 
of biological process of interest is well defined 
with comparative and integrated global data. 
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14.2. The Increasing Importance 
of Computer Models 


Conventionally, a biological model begins in the 
mind of the individual researcher as a proposed 
mechanism to account for some experimental 
observations. Often, the researcher represents 
his/her ideas by sketching a diagram using pen 
and paper. This diagram is an incredible aid in 
thinking evidently about the center of the idea by 
framing a model to predict possible experimental 
outcomes. To convey this model, not surprisingly, 
diagrammatic representation of models is dis- 
cussed in journals, textbooks, presentations, and 
lectures. 

Despite the use of these conventional models is 
simple, advances in systems biology are prompt- 
ing some biologists to forego mental models or 
pen-and-paper diagrams to more sophisticated 
computer representations. The notion of modeling 
a biological process computationally is almost as 
old as the computer; such models are becoming 
popular for several reasons. First, it is apparent 
that the comprehensive interactions in a cell are 
too complicated and unpredictable to process and 
organize the interactions (Bhalla 2003; 
Eungdamrong and Iyengar 2004; Gianchandani 
et al. 2006; Weng et al. 1999). Second, the tradi- 
tion of whole genome sequencing, high-through- 
put microarray data, and_ state-of-the art 
technology begin to generate a vast amount of 
biological data; now, the idea of descriptive sci- 
ence is shifting toward predictive science (Bhalla 
et al. 2005; Gilbert 1991; Sakaki 2005; Weatherall 
1999; Yu et al. 2008). Computer systems play an 
important role in storing, sorting, and condensing 
the accumulating mass of data rapidly; in addi- 
tion, automated tools are assimilating these data 
into a network model; it can predict network 
behaviors and outcomes that may be tested exper- 
imentally. It is encouraging that recent computer 
simulations of partial or whole genetic networks 
have demonstrated network behaviors, commonly 
called systems properties or emergent properties, 
that were not apparent from examination of a few 
isolated interactions alone (Cleary et al. 2014; 
Dall’ Olio et al. 2014; Weckwerth 2011). 
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The molecular biosciences deliver lots of data 
that require systems approaches to understand 
their implications for cell function. The more 
systems-oriented sciences avail of several sys- 
tems approaches that have worked well for inani- 
mate systems. Not all molecular bioscientists are 
familiar with current mathematical methods, nor 
are all mathematicians aware of acceptable bio- 
logical mechanisms. Of course, implementation 
of mathematics to large data sets will deliver 
results that might look remarkable enough to be 
published. However, unless systems biology is 
carried out correctly, it might not deliver an 
understanding of cell function that is based on 
molecular interactions. 


14.3 Genetic Networks 


To understand biology at the system level, we 
must examine the structure and dynamics of cel- 
lular and organismal function, rather than the 
characteristics of isolated parts of a cell or organ- 
ism (Kitano 2002). Cell is not just an assembly of 
genes and proteins. Systems biology comple- 
ments molecular biology. Genomics has provided 
vast amount of information, which links gene 
activity with diseases. However, there are a num- 
ber of reasons why gene sequence information 
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does not provide a complete and all properties of 
proteins and their function. Because of piecewise 
nature, the current approach of predicting each 
gene function based on sequence similarity 
searches often fails to reconstruct cellular 
functions. 


14.3.1 Complete Pathway 
Simulations in the Cell-E-Cell 


The complete software environment for simula- 
tion of activities of a whole cell (Fig. 14.2) is 
called an E-cell (electronic cell). In general, sci- 
entific work on biochemical and genetic simula- 
tions has helped in understanding individual 
pathways, but methods for building integrative 
models of gene regulation, metabolism, signal- 
ing, and phenotypic expression have not been 
established. Hence, scientists have evolved a new 
approach, involving the development of software 
environment for building such integrative models 
based on gene sets and coordinated network of 
genes to conduct experiments in silico. E-cell is 
hence an approach for modeling and simulating 
environment for understanding biochemical and 
genetic processes. This system allows a user to 
define functions of proteins, protein-nucleic acid, 
and protein-protein interactions. The E-cell is 
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Fig.14.2 Diagrammatic representation of E-cell 
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majorly modeled using three classes of objects: 

genes, products, and reaction rules. 

This new field requires a strong combination 
of computational approach with biological data. 
E-cell modeling is an art of converting biology 
into numbers and entities. A model is an optimal 
mix of hypothesis, evidence, and abstraction to 
explain a phenomenon. 

The following biological activities would be 
important to model a complete virtual cell: 

1. Metabolic pathways 

. Cell membrane dynamic 

. Intracellular molecular trafficking 

. Signaling pathways 

Membrane transport and ion channels 

Cell division 

DNA replication and repair 

. Transcription 

. Translation 

. Aging 

A cell represents a dynamic environment of 

interactions among nucleic acids, proteins, car- 

bohydrates, ions, pH, temperature, stress, and 
environmental signals. Three elements are neces- 
sary to make a good model: (a) precise knowI- 
edge of biological phenomenon, (b) an accurate 
and appropriate conversion of biological data 
into mathematical data, and (c) good simulation 

tools. Mapping cellular networks is always a 

challenging job, especially in the presence of a 

large number of cross-link pathways. 

(a) Linear chains: represent a unidirectional flow 
of information or energy. 

(b) Branched chains: one enzyme acts on one 
substrate and results on two products. 

(c) Loops: two or three branches unite, giving 
rise to inherent dependencies between 
them. 

(d) Cycles: larger loops composed of many 
intermediates, having one overall entry and 
one exit point in the system. 

The resultant model structure will ultimately 
be custom built to the user’s needs and not 
according to the software abilities. The goal is 
that by combining model building with algorithm 
implementation, the modeling of any system 
within the E-cell simulation environment should 
be easy and open ended. Large-scale complex 
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models can be best created and composed in arbi- 
trary algorithms. 


14.3.2 Applications of E-Cell 


(a) A model library of bacterial chemotaxis: 
Bacterial organisms like Escherichia coli 
have developed stress signaling mechanism 
to detect and direct cell movement toward the 
substrate when stared or exposed to unfavor- 
able conditions. Such behavior is known as 
chemotaxis. This behaviour of chemotaxis is 
demonstrated in http://www.e-cell.org initial 
amount of substances, kinetic parameters are 
taken from the original models and published 
papers (Bray et al. 1993; Mello and Tu 2003; 
Morton-Firth et al. 1999). Both deterministic 
and stochastic models of chemotaxis have 
been successfully implemented. 
Electrophysiological simulations on cardio- 
myocytes: The cardiac membrane contains 
various ionic channels and pumps that allow 
specific ions to travel as well as exchanged 
through the membrane. Models from the 
action potential simulation at different devel- 
opmental stages were constructed on the 
basis of Kyoto and Luo-Rudy models. The 
latest version of the Luo-Rudy model was 
implemented in the E-cell version 3.1. 

(c) Model and simulations of human erythrocyte 
metabolism: The simulation studies enabled 
us to see the rational design of metabolic 
pathways in human erythrocytes as an evolu- 
tionarily optimized system. Mathematical 
and logical models can be applied not only 
for straightforward simulations, but also for 
the nature of explanations of the nature of 
network properties complying nonlinear 
dynamics in the cell. The metabolic model of 
human erythrocyte on E-cell system, which 
consists of glycolysis, pentose phosphate 
pathways, nucleotide metabolism, simple 
membrane transport systems, and the ATP- 
dependent Na*/K* pump, carried out simula- 
tion analysis of G6PDH. This basic model 
was developed by Joshi and Palsson (1990). 
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Negative Control 





Positive Control 


Fig.14.3 Diagrammatic representation of negative and positive control in feedback-dependent signaling system 


(d) Modeling of mitochondrial — energy 
metabolism: This model includes 58 enzy- 
matic reactions and 117 metabolites to repre- 
sent the respiratory chain, TCA cycle, fatty 
acid f-oxidation, and inner membrane 
metabolite transporters. In this model spe- 
cially inner membrane metabolite transport- 
ers were included to allow simulation of 
metabolite insertion from outside mitochon- 
drion (Yugi and Tomita 2004). This model 
was based on published kinetic parameters. 
With this model, it is possible to compare 
mitochondrial ATP production in vivo and in 
silico and also comprehensive profiling of 
intracellular metabolite concentrations. 


14.3.3 Genetic Network Analysis 


Living cells can be viewed as signal processors 
also. They will have metabolic and genetic net- 
work systems. The purpose of the network is to 
control the life processes so that the organism can 
grow, survive, and multiply in a constantly chang- 
ing, unfavorable, and challenging environment. 
The cells receive signals from the environment in 
the form of hormones, nod factors, nutrient lev- 
els, and/or stresses (cold, heat drought, toxins). 
The cells have intra- and intercell signaling, e.g., 
events such as DNA damage, cell division, and 


chromosomal duplication. These cell signals may 
start repair or begin apoptosis. 

An essential mechanism in cell regulation is 
feedback (Fig. 14.3). Negative feedback is a sys- 
tem where the product or output downregulates 
the process or system. This is very common in 
metabolic reactions and amino acid synthesis 
pathways. Positive feedback is a signaling loop 
where the product reinforces the process. Cyclical 
processes are processes that never reach a steady 
state but move between stages and states in an 
ordered fashion. The cell cycle is a prime exam- 
ple and another one circadian clock in flowering 
plants. 





14.4 Metabolic Networks 


Metabolism is the totality of all the chemical 
reactions that operate in a living organism. It per- 
forms two types of reactions: (a) catabolic reac- 
tions where breakdown of larger molecules 
produces energy and (b) anabolic reactions which 
involve the use of energy and build up essential 
cell components. The current approach of pre- 
dicting each gene function based on sequence 
similarity searches often fails to reconstruct cel- 
lular functions with all necessary components. 
Metabolic pathway data is used in systems biol- 
ogy to predict the biological function of a partic- 
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ular genome sequence either by activation or 
interference of pathway intermediates. For 
instance, the metabolic pathway diagram from 
KEGG (Kyoto Encyclopedia of Genes and 
Genomes) database (Fig. 14.4) organizes and 
digitalizes all current knowledge of molecular 
and genetic pathways from experimental obser- 
vations. It maintains the catalog of every organ- 
ism that has been sequenced and the mapping of 
each gene product onto a component in the path- 
way. The main aim of KEGG database is to 
describe, predict, and possibly design systems 
behavior of living organisms. The KEGG data- 
base consists of two sections: (a) metabolic path- 
way section and (b) regulatory section. The first 
one is well organized in KEGG database, whereas 
regulatory pathways are still rudimentary. KEGG 
is based on binary relations, which help in draw- 
ing the pathway diagrams based on successive 
conversion of various states. The substrate- 
product binary relations have been utilized in 
computing possible reaction paths, and the 
enzyme-enzyme binary relations have been cor- 
related with other types of molecular relations. In 
metabolic networking systems, three types of 
links are used in path calculations: 
1. Factual link — cross-references in databases — 
links between genes and functions 
2. Similarity link — sequence homology (orthol- 
ogous/paralogous) -— 3D _ similarity and 
complementarity 
3. Biological link — substrate—product relations 
in enzymatic reactions — interacting molecules 
in a cell neighboring in the genome 
The WIT (what is there) (http://wit.mcs.anl. 
gov/WITs2) system has been designed to support 
comparative analysis of sequenced genomes and 
to generate metabolic reconstructions. WIT 
works based on modules from EMP/MPW fam- 
ily of databases. The Enzymes and Metabolic 
Pathways (EMP) database is an encoding of the 
contents of more than 10,000 original research 
publications on the topics of enzymology and 
metabolism; a project, which was _ started 
November 2006, uses the model bacterium 
Bacillus subtilis (B. subtilis) to gain insight into 
the global structure of the regulatory networks 
that control bacterial metabolism. 
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14.4.1 Applications of Metabolic 
Network Models 


1. Predict drug targets for metabolic disorders 

2. Dispensability of metabolic genes 

3. Robustness and evolution of metabolic 
networks 

4. Bioengineering and overproduction of desired 
compounds 

5. Knockout lethality 

6. Nutrient uptake/secretion rates 


14.5 Transcriptional Networks 


Transcriptional control is embedded into a hierar- 
chical flow of information from gene to pheno- 
type, in which many regulatory steps occur 
(Fig. 14.5). Data need to be generated about the 
network components at all the levels of the infor- 
mation flow, in order to understand, at the system 
level, the global regulation of gene transcription 
in bacteria. The modeling experimental strategy 
developed in B. subtilis model, when validated, 
can lead to an understanding of regulatory net- 
works controlling pathogenesis in disease- 
causing bacteria. The BaSysBio has started on 
November 2006 and has been developing and 
implementing high-throughput technologies for 
the quantitative determination of the cellular 
transcriptional responses. To discover and moni- 
tor the relevance of each protein to a biological 
process in the cellular networks, it 1s important to 
identify where, when, and to what extent a pro- 
tein 1s expressed. Two of the tools which are 
extensively used for protein characterization are 
(a) microarrays and (b) mass_ spectrometry 
approaches. A microarray is a set of short 
expressed sequence tags (EST) made from a 
cDNA library of a set of known gene loci or par- 
tial known loci. The ESTs are spotted onto a 
cover-slip-sized glass plate. A complete set of 
mRNA transcripts is prepared from the tissue of 
an experimental treatment or an individual with 
breast cancer. Complementary DNA reverse tran- 
scripts are prepared and labeled with red fluores- 
cent. A control library is prepared from an 
untreated source or noncancerous breast tissues; 
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Fig.14.5 Schematic representation of transcriptional signaling 


this library is labeled with different fluorescents. 

The experimental and control libraries are hybrid- 

ized to the microarray. Relative gene expression 

is measured as the ratio of the two fluorescence 
wavelengths. 

(a) Upregulation: Genes in the experimental 
transcriptome relative to the control will be 
visualized as a hotter red. 

(b) Downregulation: Genes in the experimental 
transcriptome show less green fluorescence 
than control. 

(c) Constituent expression: Genes in the experi- 
mental transcriptome exhibit equal expres- 
sion to that of control set, genes to control 
ratio is 1:1; neutral black fluorescence will 
be observed. 

DNA microarray technology can accomplish 
this because mRNA and protein concentration 
are often correlated. In mass _ spectroscopy 
approach method, mixture of proteins in cellular 
extract 1s resolved, and individual proteins are 
identified using MALDI-TOF and MS peptide 
finger printing. Although in theory mass spectro- 
scopic approaches have the potential to charac- 
terize the entire protein complement of a cell, in 
practice it has proved difficult to identify proteins 
of low abundance. Systems biology plays a major 
role in providing connecting links in whole cell 
protein networks. 


14.6 Cellular Signaling Networks 


Cell signaling is a complex system of communi- 
cation that governs the basic cellular activities 
and actions. The cell converts one signal to 
another in a network fashion. In these networks, 
proteins are nodes and edges between them are 
directed. Building complex models of signaling 
networks is best accomplished in a stepwise man- 
ner (Fig. 14.6). Systems biology plays a major 
role in understanding complex networks with 
limited experimental data also. In this case 1t may 
help in identifying the types of data that are still 
unknown and are needed for better understanding 
the system. For example, when modeling growth 
factor activation of MAP kinase, one may use 
input of activated RAS. Thus, the output may be 
dually phosphorylated MAPK. Several modeling 
environments are available that can be used to 
develop kinetic simulation out of the three 
computer-based programs, which help model the 
cell signaling networks with ease. 

(a) GENESIS/Kinetikit: It makes cellular net- 
works based on assumption that “well- 
stirred” reactions where every molecule has 
equal access to any other molecule. It runs 
with large-scale General Neural Simulation 
System (GENESIS). 

(b) Gepasi 3: Gepasi 3 is a modeling platform 
that allows the simulation of signaling path- 
ways. It runs on Windows operating system 
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Fig.14.6 Depiction of wiring diagram for the cell signal- 
ing information network 


and it performs steady state and time-course 
simulations. It has parameter scan capability. 
The user can select parameters that will be 
varied, and the range and extent of variation 
can be compared with experimentally 
observed data. This feature can be used to 
optimize unknown parameters. 

(c) Virtual Cell: The model is built by using a 
graphical interface that allows the user to 
determine the geometry of the model. This 
usually involves a collection of molecules, 
with their respective parameters and subcel- 
lular localizations. It gives the output in dif- 
ferent formats: spreadsheets, pseudocolor 
images, and QuickTime movies. 


14.7 Systems Biology 
and Personalized Medicine 


in the New Era 


Starting from the last century, advances in high- 
throughput biological data capturing instruments 
such as DNA sequencers, microarrays, mass 
spectroscopy, electron microscopy, X-ray crys- 
tallography, and NMR made it possible to rou- 
tinely perform structural and functional genomics 
on complex biological samples. In isolation the 
study of genes, transcripts, proteins, and metabo- 
lites in the context of human physiology and 
pathology has been challenging, because these 


S.R. Sagurthi et al. 


molecular entities must ensemble into molecular 
machinery and coordinate in molecular pathways 
to play their role inside the cells. Systems biology 
studies can help interpret molecular functions and 
bridge molecular understanding of human diseases. 
Electronic databases play a central role in storing 
and analyzing clinical data, which has been useful 
for conventional clinical decision-making. The 
establishment of connections between clinical data 
and molecular data requires integrative models to 
help enhance medical decision-making. In the near 
future, development of systems biology may help 
physicians to prescribe a drug to their patients 
based on “proteomics profiling” and “genomics 
profiling.’ This may help in determining correct 
subtype of the disease. The “genome-wide geno- 
typing” test of patients would help to look for pos- 
sible genetic variability that helps in determining 
the right dosage. Personalized medicine has been 
adapting to a new economic reality (Osnabrugge 
et al. 2014). Given that ~60 % of the prescriptions 
written produced desired benefits, 30-40 % of pre- 
scribed medicine do not help and up to 7 % have 
hard or negative impact on patients. Now recent 
advances in systems biology and ensemble of 
omics (genomics, proteomics, transcriptomics, and 
metabolomics) made physicians to predict accurate 
drug targets which can virtually guarantee patient 
that the drug they were prescribed would help them 
without causing any side effects. 


14.8 Reverse Vaccinology 
and Pan-genomics 


Classical approaches to vaccine development are 
time consuming and economically costly, biased 
toward the identification of several antigens 
which may not lead to immunity. It is also 
dependent on the ability to cultivate the pathogen 
under laboratory conditions. The recent advances 
of systems biology have led us into a new era of 
vaccinology. The “omics” field has provided us 
with a continuous flow of data to look at host- 
pathogen interactions in a holistic way. Genome 
sequences of many pathogens including viruses 
are available. In the future, many vaccines previ- 
ously thought impossible can be designed by 
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using reverse vaccinology (based on pathogen 
genome information) method (Fig. 14.7). In addi- 
tion, the comparison of the genomes of multiple 
strains of a single pathogen, known as “pan- 
genomics,” has expanded our understanding of 
virulence (Tettelin 2009). In the coming years, 
systems biology will play a major role in solving 
deadly diseases by integrating information from 
other disciplines. 


14.9 Single Nucleotide 
Polymorphism (SNP) 
and HapMap Applications 


in Mapping Disease Loci 


SNP is a genetic variation in a person’s DNA 
sequence that occurs when a single nucleotide is 
replaced by any one of the other three nucleo- 
tides. Identification of genes that confer disease 
susceptibility can be found by studying DNA 
markers such as SNP associated with a disease 
trait. Genome-wide association study (GWAS) 
offers a systematic analysis of the linkage of 
thousands of SNPs with a quantitative complex 
trait. The methodology was successfully applied 
to a wide variety of common diseases and traits 
and has generated valuable findings that have 
improved the understanding of the genetic basis 
of many complex disease traits. For example, in 
recent studies on lipid metabolism by genome 
wide analysis, found 19 loci are being associated 
with LDL/HDL or triglyceride levels. 

The HapMap 1s a catalog of common genetic 
variants that occur in human beings. The elucida- 
tion of the entire human genome has made pos- 
sible our current efforts to develop a haplotype 
map (Fig. 14.8). The “HapMap” is a tool that 
allows researcher to find genes and genetic varia- 
tions that affect human health. The HapMap is 
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valuable for it reduced the number of SNPs 
required to examine the entire genome for asso- 
ciation with a phenotype from the ten million 
SNPs that exist to roughly 500,000 tag SNPs. 
This allows the development of test to predict 
which drugs or vaccines would be most effective 
in individuals with particular genotypes. 


14.10 Conclusions 


Computational biology and experimental biology 
for many decades have been separate disciplines. 
The dramatic progress of molecular biology and 
completion of the sequences of many genomes 
have provided us with a lot of data to be inter- 
preted. This required an integration of theoretical 
and experimental approaches at the highest lev- 
els. This integrated field rapidly evolved into sys- 
tems biology. The present dominant research 
paradigm of one laboratory-one gene-one 
function-one pathogen has reached its limits in 
many areas, as disease arises from a combination 
of several factors. A systems approach to life sci- 
ences is often necessary for better qualitative and 
quantitative understanding of biological systems. 
Systematic understanding of complex biological 
processes will allow taking many real-world 
problems. This is not achieved by better data 
alone. Rather, the way the model is built and soft- 
ware used makes the calculations critical. 
Systems biology might therefore be one of the 
key approaches of the twenty-first century in pro- 
viding solutions to cellular biology, physiology, 
and medicine and also for pharmaceutical and 
biotechnology industries. 
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Fig.14.7 Diagrammatic representation of conventional, reverse, and pan-genome reverse vaccinology 
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Abstract 

In addition to natural embryogenesis termed “‘apomixis” in some species, 
plant cells can also be induced to form embryos in plant tissue culture. 
Such embryos are called somatic embryos and can be used to generate 
plants from single cells. Plants possess cellular totipotency which makes it 
feasible for individual somatic cells to regenerate into a whole plant 
in vitro, a process termed “somatic embryogenesis” (SE). Development of 
somatic embryos closely resembles that of zygotic embryos, both morpho- 
logically and physiologically. Since the first reports of this phenomenon in 
1958 in carrot, SE has been reported in innumerable plant species. SE with 
a low frequency of chimeras, high efficiency of regenerates, and limited 
occurrence of somaclonal variation is more attractive than organogenesis 
as a plant regeneration system. This can be successfully applied to plant 
transformation, in vitro mutagenesis, and selection. Several factors, 
however, are known to influence initiation and progression of somatic 
embryogenesis in plants. 
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through a series of morphological and biochemical 
changes that result in the production of a bipolar 
structure without vascular connection with the 
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regenerants, and a limited level of somaclonal 
variation 1s more attractive than organogenesis as 
a plant regeneration system (e.g., in genetic trans- 
formation, in vitro mutagenesis, and selection). 
However, several factors influence initiation of 
SE in plants. Most of the important crops and 
grasses are recalcitrant to in vitro culture, which 
hampers the development of reliable regeneration 
techniques in these plant groups. This phenomenon 
was first reported in carrot (Steward et al. 1958), 
and ever since, SE has been reported in innumer- 
able plant species. Somatic embryogenesis 
poses even greater challenges in recalcitrant 
plants; e.g., two plant species, habanero chili 
(Capsicum chinense Jacq.) and coconut palm 
(Cocos nucifera L.), have been studied in some 
detail. Several factors, however, are known to 
influence initiation and progression of somatic 
embryogenesis in plants (Yalcin Elidemir et al. 
2007; Perrin et al. 2001). 

Somatic embryos are formed from plant cells 
that are not normally involved in the develop- 
ment of (zygotic) embryos, 1.e., ordinary plant 
tissue. No endosperm or seed coat is formed 
around a somatic embryo. Applications of this 
process include clonal propagation of genetically 
uniform plant material; elimination of viruses; 
provision of source tissue for genetic transforma- 
tion; generation of whole plants from single cells 
called protoplasts; development of synthetic seed 
technology, etc. 

Cells derived from competent source tissue 
are cultured to form an undifferentiated mass of 
cells, called a callus. Plant growth regulators in 
tissue culture media can be manipulated to induce 
callus formation and, subsequently, changed to 
induce embryos to form from the callus. Ratio of 
various plant growth regulators required for 
induction of callus or embryo formation varies 
with type of the plant. Asymmetrical cell division 
also appears to be important in developing 
somatic embryos; and, while failure to form the 
suspensor cell is lethal to zygotic embryos, it is 
not lethal to the somatic embryos. 
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15.2 Plant Regeneration: Somatic 
Embryogenesis Versus 
Organogenesis 


Organogenesis and somatic embryogenesis, two 
patterns of in vitro differentiation, are distinctly 
different processes that also occur in nature. 
Adventive embryony within ovules of polyem- 
bryonic plant species, including many fruit and 
forest trees, is identical with in vitro somatic 
embryogenesis. Adventitious shoot meristems 
and adventitious roots that develop in vivo from 
many different tissues are identical with in vitro 
organogenesis. Since somatic embryogenesis and 
organogenesis reflect different developmental 
events, Ammirato (1985) suggested that it is 
likely that the two processes are mutually exclu- 
sive, so that cells can be committed to only a 
single pathway. 


15.2.1 Somatic Embryogenesis 


Somatic embryo is an embryo formed in vitro from 
somatic cells or embryogenic cells by mitotic cell 
division. “Mature somatic embryo,’ however, is a 
fully developed embryo with evidence of root and 
shoot apices and exhibits a bipolar structure. 
Preferred mature somatic embryos are those with 
well-defined cotyledons. Rangaswamy (1986) and 
Ammirato (1989) have reviewed the phenomenon 
of somatic embryogenesis. 

Development of somatic embryos directly from 
intact explants (without an intervening callus 
phase) is known as primary direct somatic embryo- 
genesis (Ammirato 1989). According to Quiroz- 
Figueroa et al. (2006), somatic embryos originate 
by two pathways: (1) unicellular and (11) multicel- 
lular. When somatic embryos have unicellular 
origin, cell division is seen to be coordinated, and 
the somatic embryo is sometimes connected to the 
maternal tissue by a suspensor-like structure. In 
contrast, somatic embryos of multicellular origin 
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are observable initially as a protuberance, showing 
no coordinated cell division, and those somatic 
embryos here that are in contact with the basal 
area are typically fused to the maternal tissue. 
Srinivasan and Mullins (1980) demonstrated SE 
from unfertilized ovules of seedless grapes, while 
Salunkhe et al. (1997) developed SE and subse- 
quent plantlet formation from tendrils of grape 
plants. Carimi et al. (2005) could induce somatic 
embryogenesis in stigmas and styles of grapevine. 
Ontogenesis, differentiation, and precocious 
germination in anther-derived somatic embryos 
of grapevine through proembryogenesis have also 
been studied (Faure et al. 1996). 


15.2.2 Organogenesis 


Cersosimo et al. (1990) reported organogenesis, 
embryogenesis, and plant regeneration from cul- 
tures of Vitis. Organogenesis has also been reported 
by Mezzetti et al. (2002) wherein they used the 
technique for genetic transformation of grape. 

Two different cultivars and two rootstocks 
were used in the study of Bayir et al. (2007) 
where they studied the effect of genotype on cal- 
lus formation and organogenesis. MS medium 
supplemented with BAP and 2, 4-D was found to 
be the best for callus formation for all grape cul- 
tivars/rootstocks used. 

Venkatachalam et al. (2006) induced direct 
shoot and cormlet regeneration from leaf explants 
of “silk” banana (AAB), while Novak et al. 
(1989) induced indirect somatic embryogenesis 
and plant regeneration in suspension cultures of 
dessert (AA and AAA) and cooking (ABB) 
bananas (Musa spp.). 

A procedure for the regeneration of Vitis 
plantlets by organogenesis (Kwon et al. 2000) 
and in rootstocks (Clog et al. 1990) from foliar 
tissues was described earlier. Leaves from mature 
plants grown in growth chambers, or from 
plantlets grown in tubes, were wounded with a 
scalpel and cultured on modified Murashige and 
Skoog liquid medium containing different 
concentrations of benzylaminopurine. Presence 
of benzylaminopurine was required for shoot 
formation. Age of the source explant, composition 
of the culture medium, and culture incubation 
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temperature were found to be important parameters 
for regeneration. 

Kwon et al. (2000) optimized medium com- 
position and culture conditions for plant regen- 
eration through organogenesis from leaf explant 
in “Kyoho” grape (Vitis labrusca x V. vinifera). 
Adventitious shoot formation from in_ vitro- 
cultured leaf segments was affected by basal 
medium, plant growth regulator, and light condi- 
tion. Within the first 7 days of culture on shoot 
regeneration medium, the petiolar stub of explants 
was swollen and formed callus. Adventitious 
structures formed on the petiolar stub after 
14 days. Among plant growth regulator treatment, 
2.0 mg/L BA and 0.01 mg/L IBA produced the 
highest shoot regeneration frequency. Adventitious 
shoots were regenerated effectively on the 
modified MS medium such as 1/2 TMS and 1/2 
MMS. Among three gelling agents, agar showed 
higher shoot regeneration than agarose and 
Phytagel. The dark incubation for 3-4 weeks at 
the initial culture periods was effective for shoot 
regeneration. In conclusion, the most excellent 
shoot regeneration from leaf explants was 
obtained with dark incubation of initial 3 weeks 
on 1/2 TMS medium with 2.0 mg/L BA, 0.01 mg/L 
IBA, and 8 g/L agar, in which the regeneration 
frequency reached up to 80 %. The regenerated 
shoots rooted successfully in MS medium with 
0.1 mg/L IAA, and plantlets were acclimated. 

In addition to the above, the “embryonic condi- 
tion” is considered to also occur in buds that form 
on stems. The buds have tissue that has differenti- 
ated but not grown into complete structures. They 
can be in a resting state and lying dormant over the 
winter or when conditions are dry and, then, 
commence growth when conditions turn favorable. 
Before they start growing into stem, leaves, or flow- 
ers, the buds are said to be in an “embryonic” state. 


15.2.3 Factors Governing Somatic 
Embryogenesis 


Several factors are known to influence induction 
of somatic embryogenesis (Gaj 2004) and matu- 
ration of the so-formed somatic embryos. Chief 
among these are: 

1. Genotype of plant 
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. Type of plant (plant species) 

. Plant/explant age 

. Developmental stage of the explant 

. Physiological status of the explant donor 

plant/explant (including endogenous levels of 
plant growth regulators) 

6. Environment external to the explant, viz., 
composition of culture medium (chemical), 
and physical culture conditions (incubation 
temperature, quantity/quality/duration of light) 
Polar transport of auxin in early globular 

embryos is essential to establish bilateral symme- 

try during plant embryogenesis. Interference 
with auxin transport causes failure of transition 
from the axial to bilateral symmetry, thereby 
resulting in embryos with fused cotyledons (Liu 

et al. 1993). 

Details of factors governing SE are elaborated 
later in the chapter under grape, wherein the crop 
has been treated as a case study. 
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15.2.4 Genes Involved in Somatic 
Embryogenesis 


Some genes have been shown to be related to 
expression of somatic embryogenesis, such as 
WUSCHEL (WUS), Baby Boom (BBM), and 
SERK (somatic embryogenesis receptor kinase). 
The last of these three, SERK, has been detected 
in early stages of the process of somatic embryo- 
genesis and forms a subgroup in the leucine- 
rich repeat receptor-like kinases (LRR-RLKs) 
comprising the largest subfamily of RLKs in 
plants. These genes are also related to key processes 
in plant growth (Sharma et al. 2008). 


15.3. Grape: A Case Study 


Somatic embryogenesis is the most common 
regeneration system for grapevine (Vitis spp.) 
(Gribaudo et al. 2007). After rice, the second 
food crop to be fully sequenced genomically is 
the grape (var. Pinot Noir). This is viticulture’s 
equivalent of the first moonwalk. The genetic 
blueprint will make it possible to develop new, 
more resistant varieties (www.biotech-weblog. 
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com). Thus, grape is the first fruit crop genome to 
be mapped. The far-reaching consequences of 
having sound regeneration systems in grape to 
plumb the wealth of genetic information, therefore, 
cannot be undermined. However, implementation 
of these strategies for grape improvement requires 
the ability to manipulate and achieve regeneration 
from cells and tissues in vitro. Somatic embryo- 
genesis being central to such manipulations 
encompasses culture initiation, maintenance, 
manipulation of somatic embryos, and plant 
development and, finally, incorporates specific 
uses of embryogenic culture systems. For perennial 
embryonic culture system (e.g., grape), several 
methods have proven effective for regeneration 
by somatic embryogenesis. 


15.3.1 Regeneration System 
in Grapes 


Somatic embryogenesis of grapevine has been 
described in several instances. The first descrip- 
tion of regeneration of plantlets by the route of 
somatic embryogenesis was reported for an 
ancient clone of grapevine (Vitis vinifera L. cv. 
Cabernet Sauvignon) in liquid culture conditions 
(Coutos-Thevenot et al. 1992). Other cultivars 
have been shown to be competent for somatic 
embryo production, and different explants like 
leaves, anthers, zygotic embryos, and ovaries 
were used for somatic embryogenesis induction 
(Rajasekaran and Mullins 1983). Somatic 
embryogenesis was observed by Salunkhe et al. 
(1997) in callus initiated from tendril explants of 
Vitis vinifera L. cvs. Thompson, Sonaka, and Tas- 
e-Ganesh on Emershad and Ramming medium 
supplemented with 1 uM 6-benzylaminopurine. 
Current methods for improving grapevines are 
time-consuming and labor intensive. For example, 
genetic improvement in grapes through conven- 
tional breeding is severely limited by a number of 
factors such as long prebearing age and varying 
ploidy levels. Cultivated grapes are also highly 
heterozygous and do not generally breed true 
from seeds. Moreover, grape breeding programs 
are expensive, long-term projects. Although plant 
biotechnology is an attractive alternative for 
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genetic improvement of grapes, in vitro genetic 
manipulation can be addressed only if there is an 
effective regeneration system. Accordingly, 
methods that reduce any of these problems would 
represent a significant advancement in the art. 

Methods for growing perennial grape embryo- 
genic cultures, and for growing large quantities 
of somatic grape embryos from such perennial 
embryogenic cultures, in a relatively short period 
using liquid suspension culture are available. 
Several advantages are provided by the present 
methods. These approaches, for example, facilitate 
an extraordinarily high frequency of somatic 
embryo formation and plant regeneration. Such 
frequencies have not been previously reported for 
grapevine regeneration of any known cultivar and 
render the method useful for large-scale production 
of clonal planting stock of grape plants. In addi- 
tion, the methods produce embryos free of such 
common abnormalities as fusion and fasciations 
of somatic embryos. The methods of the inven- 
tion also result in enhanced embryogenic culture 
initiation frequency, allowing for the production 
of highly embryogenic cultures that can then be 
successfully carried through the subsequent 
stages of the regeneration process to the whole 
plant level. Because of these advantages, the 
methods of the invention are especially useful in 
the application of biotechnology for the genetic 
improvement of this crop. 

Production of an embryogenic perennial cul- 
ture of Vitis vinifera cv. Thompson Seedless: 
mother plant-derived cultures were obtained 
from a leaf that was surface disinfected and inoc- 
ulated onto culture initiation medium. 

Subculture of embryogenic cultures typically 
results in the formation of the somatic embryos. 
Embryogenic cultures and early-stage somatic 
embryos obtained from these cultures are then 
further cultured in a suitable liquid plant growth 
medium, for example, a plant tissue culture nutrient 
medium consisting of B5 medium devised by 
Gamborg et al. in 1968 (Sigma Chemicals, St. 
Louis, MO) that has been modified as described 
by DeWald and coworkers (J Amer Soc Hort Sci 
114: 712-716, 1989) and Litz et al. 1995 
(“Somatic embryogenesis in mango,’ 1995, 
supra). This modified medium consisted of B5 
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major salts, MS minor salts and vitamins, gluta- 
mine (about 400 mg/L), and sucrose or commer- 
cial table sugar (about 60 g/L). 

In the experiments of Emershad and Ramming, 
somatic embryo formation was shown to occur 
from immature zygotic embryos within ovules of 
stenospermoearpic seedless grapes (Vitis vinifera 
L.) when cultured for 2 months on liquid 
Emershad/Ramming medium. Somatic embryos 
continued to proliferate after excision and transfer 
to Emershad/Ramming medium supplemented 
with 1 pM benzylaminopurine and 0.65 % TC 
agar. Plant development from somatic embryos 
was influenced by genotype, medium, phase 
(liquid, agar), stage (torpedo, mature), and their 
interactions. Optimal plant development occurred 
on Woody Plant Medium supplemented with 
1.5 % sucrose + 1 wM benzylaminopurine + 
0.3 % activated charcoal and 0.65 % TC agar. 

Cultures are scanned weekly for the presence 
of emerging embryogenic cells or embryogenic 
cell masses. Embryogenic cells or cell masses are 
identified based on morphology. Embryogenic 
cell masses, in general, tend to be white to pale 
yellow in color and are often hyaline. They may 
be recognized from a very early, small stage (10— 
20 cell aggregates), based on color/friable, gran- 
ular appearance. Embryogenic cultures are also 
identified by their compact nature, containing 
cells rich in cytoplasm, as seen under the micro- 
scope. A multitude of factors affect somatic 
embryogenesis including genotype, nature and 
type of explant, medium composition, and season 
of harvest. Cultures can be prompted to develop 
for enhanced embryogenesis or diminished 
embryogenesis, or more unorganized embryogenic 
cell growth, by repeated manipulation of the cul- 
ture, which entails careful selection of embryo- 
genic cells and cell masses during transfer. 

Ever since Mullins and Srinivasan (1980) and, 
later, Gray and Mortensen (1987) demonstrated 
initiation and long-term maintenance of somatic 
embryogenesis in grape, several success stories 
have been reported. Petioles, tendrils, leaf and 
nodal explants, anthers, ovaries, and unfertilized 
ovules have all been used for inducing somatic 
embryogenesis (Croce et al. 2005; Perrin et al. 2004). 
Suspension cultures too have been successfully 
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used to obtain high-efficiency somatic embryo- 
genesis and plantlet production (Jayasankar et al. 
1999). 

Lopez-Perez et al. (2006) achieved plant 
regeneration from somatic embryogenesis 1n cvs. 
Sugraone, Crimson Seedless, and Don Mariano. 
Conversion rate to plantlets was higher in medium 
with IAA and GA;. Medium-dependent response 
(e.g., use of activated charcoal, silver nitrate, etc.) 
has also been reported by other workers. Indirect 
somatic embryogenesis using stem nodal explants 
was obtained for six clones of Vitis vinifera culti- 
vars, including rootstocks (Maillot et al. 2006). 
Gribaudo et al. (2007) identified the best devel- 
opment stage to initiate anther culture and 
improve somatic embryo germination by some 
treatments and optimized two key steps of grape 
somatic embryogenesis. Goussard et al. (1991) 
elucidated the role of in vitro somatic embryo- 
genesis in eliminating fan leaf virus and leaf roll— 
associated viruses from grapevines. 

Grapevine somatic embryos have also been used 
for the selection of toxin-resistant strains (Jayasankar 
et al. 2000). Das et al. (2002) developed an efficient 
leaf-disc culture method for transforming grape via 
somatic embryogenesis. Very often, patents are 
sought from researchers successfully developing 
protocols for somatic embryogenesis in crops/ 
plants, particularly in recalcitrant/woody/diffi- 
cult-to-manipulate systems, for example, walnut. 


15.3.2 Regeneration System 
for Grape and Uses Thereof 


Method of producing a mature somatic embryo: 
The method includes the steps of (a) providing a 
liquid culture that includes an embryogenic cell, 
(b) recovering embryogenic cell from the culture, 
(c) transferring the embryogenic cell to a second 
culture, and (d) growing a mature somatic grape 
embryo from the embryogenic cell. 

Culture Initiation: Embryogenic cultures were 
initiated from anthers and ovaries of the cultivar 
“Chardonnay” (clones CH 01 and CH 02) and 
from the leaves of the cultivar “Thompson 
Seedless” according to standard methods, e.g., 
those described herein. 
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Preferably, the embryogenic cell is obtained 
from anthers, ovaries, ovules, floral tissue, vege- 
tative tissue, tendrils, leaves, roots, nucellar tis- 
sue, stems, seeds, protoplasts, pericycle, apical 
meristem tissue, embryogenic tissue, somatic 
embryos, or zygotic embryos. In still other pre- 
ferred embodiments, the liquid culture medium 
of step (a) includes a plant growth regulator (e.g., 
an auxin, NOA (naphthoxyacetic acid), indole-3- 
acetic acid (IAA), dicamba, picloram, naphthale- 
neacetic acid (NAA), 2,4-dichlorophenoxyacetic 
acid (2,4-D), a cytokinin, benzyladenine (BA), 
thidiazuron, zeatin, abscisic acid (ABA), or gib- 
berellic acid (GA)). In other preferred embodi- 
ments, the initiation culture medium includes at 
least 0.01 mg/L of a cytokinin, at least 0.1 mg/L 
of a carbohydrate, and at least 0.1 mg/L of a 
nitrogenous compound. Exemplary carbohy- 
drates include sucrose, glucose, maltose, and 
glycerol. Exemplary nitrogenous compounds 
include potassium nitrate, calcium nitrate, ammo- 
nium nitrate, and ammonium sulfate. If desired, 
the embryogenic cell or the somatic embryo can 
be transformed with DNA (e.g., DNA encoding a 
gene which is capable of conferring disease resis- 
tance). The embryogenic cell can be recovered 
from the first culture by filtration, sedimentation, 
or selection. In addition, the first culture can be 
sieved to synchronize the formation of differenti- 
ated somatic grape embryos. Preferably, the 
embryogenic cell of step (a) is obtained from a 
source selected from the group consisting of an 
embryogenic cell produced according to steps (a) 
and (b) of the first aspect; a somatic grape embryo 
produced according to steps (a) to (d) of the first 
aspect; a mature somatic embryo produced 
according to method of the first aspect; a plantlet 
according to the fifth aspect; and anthers, ovaries, 
ovules, floral tissue, vegetative tissue, tendrils, 
leaves, roots, nucellar tissue, stems, seeds, proto- 
plasts, pericycle, apical meristem tissue, embryo- 
genic tissue, somatic embryos, or zygotic 
embryos from a plant produced from a mature 
somatic embryo produced according to the 
method of the first aspect. In preferred embodi- 
ments, the method further includes transferring 
the somatic grape embryo to a germination 
medium to grow a grape plantlet. 
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15.3.3 Method for Producing 
Somatic Grape Embryos 


The method includes the general steps of 
(a) providing a liquid culture comprising an 
embryogenic cell, (b) selection of embryogenic 
cell from the culture, (c) transferring the embryo- 
genic cell to a second culture, and (d) growing a 
somatic grape embryo from the embryogenic cell. 

In a third aspect, the invention features a 
method of producing a somatic grape embryo. 
The method includes the general steps of (a) pro- 
viding a liquid culture that includes an embryo- 
genic cell and medium consisting of B-5 major 
salts and MS minor salts, (b) recovering the 
embryogenic cell from the culture, (c) transfer- 
ring the embryogenic cell to a second culture, 
and (d) growing a somatic grape embryo from the 
embryogenic cell. 


15.3.4 Plantlet Germinated 
from the Mature Somatic 
Grape Embryo 


A method for long-term storage of a mature 
somatic grape embryo: The method includes the 
general step of drying the embryo and storing the 
embryo at a temperature less than 8 °C. Preferably, 
the mature somatic grape embryo is produced by 
the method of the first aspect. In a seventh aspect, 
the invention features a method for direct seeding 
of a somatic grape embryo. The method includes 
the general step of placing the said somatic 
grape embryo in potting medium comprising of 
sand and potting mixture. In an eighth aspect, 
the invention features a method of producing a 
somatic grape embryo using a liquid culture 
medium from a perennial grape embryogenic 
culture. The method includes the steps of (a) cul- 
turing a perennial grape embryogenic culture in a 
first liquid culture medium to grow a cellular sus- 
pension culture, the first liquid culture medium 
including a plant growth regulator; (b) recovering 
an embryogenic cell or embryogenic cell mass 
from the cellular suspension culture; and (c) cul- 
turing the embryogenic cell or embryogenic cell 
mass from the cellular suspension culture of step 
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(b) in a second liquid culture medium to produce 
a somatic grape embryo. In preferred embodi- 
ments, the method further includes transferring 
the somatic grape embryo to a germination 
medium to grow a grape plantlet. In other pre- 
ferred embodiments of the eighth aspect, the 
plant growth regulator is an auxin, NOA, BA, 
zeatin, dicamba, picloram, NAA, JAA, 2, 4-D, a 
cytokinin, thidiazuron, ABA, or GA. In yet other 
preferred embodiments, the liquid cell culture of 
step (a) is subcultured, and the recovery of the 
grape cell or grape cell cluster from the cellular 
suspension culture includes filtration, sedimenta- 
tion, or selection. In other preferred embodiments, 
the second liquid culture medium further includes 
a plant growth regulator (e.g., a cytokinin). The 
liquid cell culture of step (c) can be sieved to 
synchronize the formation of differentiated 
somatic grape embryos. If desired, the somatic 
grape embryo or embryogenic cell of step (a) can 
each be transformed with DNA. The embryo- 
genic cell or embryogenic cell mass recovered 
in step (b) may also be transformed with 
DNA. Similarly, the recovered somatic embryo 
may be transformed with DNA. 

In about 6-8 weeks, a fine cell suspension 
culture is produced, which consists of highly 
vacuolated elongated cells (nonembryogenic 
cells), and also a lesser number of small, 
cytoplasm-rich, isodiametric cells (embryogenic 
cells). Repeated transfer of embryogenic cells or 
cell masses has not only been found to enrich 
the growth of embryogenic tissue but also to 
facilitate the process of somatic embryogenesis. 
The cultures are perennial in that they typically 
persist for over 2 years. 

Embryogenic suspensions of grapevine (Vitis 
vinifera L.) were initiated from somatic embryos 
of “Thompson Seedless” and “Chardonnay” by 
Jayasankar et al. (1999). Suspension cultures 
consisted of proembryonic masses (PEM) that 
proliferated without differentiation in a medium 
containing 2,4-dichlorophenoxyacetic acid (2,4- 
D). “Chardonnay” somatic embryos developed 
fully from PEMs following subculture in medium 
without 2,4-D; however, somatic embryo devel- 
opment did not advance beyond the heart stage 
in “Thompson Seedless” suspension cultures. 
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Highly synchronized development of somatic 
embryos was obtained by inoculating <960-um 
PEMs into liquid medium without 2,4-D. Somatic 
embryos were also produced in large numbers 
from suspension-derived PEMs of both cultivars 
on semisolid medium lacking 2,4-D. Somatic 
embryos matured and regenerated into plants in 
MS basal medium containing 3 % sucrose. Using 
this method, over 60 % somatic embryos regener- 
ated plants, of which more than 90 % plants were 
successfully transferred to the greenhouse. 

Somatic embryogenesis and subsequent dip- 
loid plants were obtained from anthers of Vitis 
vinifera Cabernet Sauvignon, a cultivar consid- 
ered recalcitrant to in vitro regeneration (Mauro 
et al. 1986). Anthers enclosing microspores near 
the first pollen mitosis were found to be the most 
responsive. However, from a practical point of 
view, anther length proved to be an easier criterion 
to determine the optimal physiological stage of 
the anther. Calluses derived from somatic tissues 
of the anther produced embryoids only when 
cultured on medium supplemented with casein 
hydrolysate. Glutamine and adenine were found 
to stimulate embryoid production. Early removal 
of cotyledons increases the frequency of develop- 
ment of embryoids into normal plantlets. 
Immature inflorescences of coconut belonging to 
three different genotypes were cultured by 
Jean-Luc et al. (1994) on a solid medium sup- 
plemented with activated charcoal (2 %) and 
2,4-dichlorophenoxyacetic acid (2,4-D) concen- 
trations ranging from 1.5 to 3.5 x 10-* M. Globular 
white callus is formed from immature floral 
meristems, depending on inflorescence age and 
2,4-D concentration. Acquisition of embryogenic 
competence was_ confirmed histologically. 
Somatic embryos presented a functional bipolar 
organization with a completely differentiated 
shoot meristem, reported for the first time in 
coconut tissue culture. Embryo maturation 
allowed reliable plant regeneration in this in vitro 
recalcitrant species. Details of exogenous hor- 
monal requirements were worked out for acquisi- 
tion of embryogenic competence and embryo 
maturation. 
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15.4 Other Plant Species 


A new protocol was developed by Jiahe et al. 
(2004) for a highly efficient somatic embryogen- 
esis and plant regeneration of ten recalcitrant 
Chinese cotton cultivars. Calluses and embryo- 
genic calluses were induced on MSB1 medium 
containing optimal combination of indolebutyric 
acid (IBA; 2.46 uM) and kinetin (KT; 2.32 uM). 
Up to 86.7 % of embryogenic calluses differenti- 
ated into globular somatic embryos 2 months 
after culture on MSB2 medium containing 
double the KNO; and free of growth regulators. 
Up to 38.3 % of the somatic embryos converted 
into complete plants in 8 weeks on MSB3 
medium with L-asparagine (Asn)/L-glutamine 
(Gln) (7.6/13.6 mM). The plants were success- 
fully transferred to soil and grew to maturity. 
With this protocol, they obtained hundreds of 
regenerating plantlets from ten recalcitrant culti- 
vars, which is important for the application of tis- 
sue culture to cotton breeding and 
biotechnology. 

An improved protocol was also developed for 
somatic embryogenesis and plant regeneration 
in recalcitrant cotton cultivars. High callus fre- 
quency and embryogenic tissues were developed 
on MSB medium supplemented with gradient 
concentrations of KT and 2,4-D, with concentra- 
tion decreasing from 0.1 to 0.01 mg/L. Somatic 
embryos were successfully incubated in one-half 
macronutrient MSB_ suspension supplemented 
with 0.5 g L“! glutamine and 0.5 g L“ asparagine. 
Decrease in macronutrient concentration in MSB 
significantly alleviated browning and was benefi- 
cial to suspension cells. Transformation of 
somatic embryos into plants was induced on 
MSB medium supplemented with 3 % sucrose, 
0.5 g L' glutamine, 0.5 g L" asparagine, and 
6.0 g L"' agar. Effect of sucrose as the carbohy- 
drate source was better than glucose for plant 
germination. Regenerated plantlets from 
CCRI521 and Zhongzhi86-6 could be recovered 
at the rate of as much as 19.6 and 18.5 % somatic 
embryos, respectively. 
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Rao and Bahadur (1990) induced somatic 
embryogenesis and subsequent plant formation 
in distylous — self-incompatible Oldenlandia 
umbellata (Rubiaceae), a vegetable crop valued 
for its red pigment, from primary cultures of leaf, 
axillary bud, stem, and immature fruit explants. 
This protocol is useful for self-incompatible 
species that fail to produce seeds. 

Sahiyjram and Doreswamy (1988, 1991) 
induced morphogenesis and nucellar embryog- 
eny in monoembryonate mango cultivars and 
somatic embryogenesis in embryo cultures. 
Sahijram et al. (1996) studied factors influencing 
somatic embryogenesis in this crop. Ravindra 
et al. (2000) demonstrated somatic embryogene- 
sis in nucellus cultures of polyembryonic mango 
cv. “Vellaikolumban.” Sahijram et al. (2011) and 
Mirajkar et al. (2010) gave a protocol for plant 
regeneration from petal cultures in pomegranate. 
Sahyram (2014) also observed differential 
response to MS and Gamborg’s B5 media of 
petal-derived embryogenic calluses of pome- 
granate cv. Bhagwa with/without cocultivation 
with Agrobacterium, while Sahijram and Rao 
(2012) induced high-frequency regeneration 
from segmented root explant cultures in this 
fruit crop (Punica granatum L.). They also 
regenerated plantlets from petals of pomegranate 
for Agrobacterium-mediated transformation. 
Madhusudhana Rao and Sahijram (2013) studied 
the comparative response of epicotyl, hypocotyl, 
and cotyledonary explants for the induction of 
somatic embryogenesis in pomegranate. Somatic 
embryogenesis is also reviewed by Sahijram 
(2011) in grape. Regeneration systems developed 
in horticultural crops are reviewed by 
Vageeshbabu et al. (2007). 

Litz et al. (1995) opined that in mango, it was 
preferable to remove the somatic embryos from 
liquid medium as soon as they germinated preco- 
ciously and transfer these to semisolid medium. 
For somatic embryo maturation, germination, 
and plant regeneration studies, three maturation 
media — mango maturation medium, mango 
maturation medium solidified with agar (7 g/L), 
and MS basal medium with 3 % sucrose — were 
evaluated for their ability to promote somatic 
embryo germination and plant regeneration. MS 
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basal medium containing 3 % sucrose was the 
most effective at promoting embryo maturation, 
germination, and plant regeneration, for both 
embryos derived from semisolid medium and for 
the precociously germinated embryos obtained 
from liquid medium cultures. Although good 
germination was obtained by the authors on 
mango maturation medium with agar, quality of 
the regenerants was not as good as with MS salts 
containing 3 % sucrose in the medium. 

Genetic engineering of corn, and other cereals, 
is currently limited because of the difficulty of 
regenerating plants via cell culture. Hodges et al. 
(1986) reported commercially important inbred 
lines and hybrids from several different maize 
groups to be capable of regeneration via somatic 
embryos. Inbred line A188 had a high frequency 
of plant regeneration, and, when crossed to recal- 
citrant inbreds, the resulting Fl hybrids too were 
found to be “regenerable.” These results demon- 
strated that nuclear genes exhibiting dominance 
were important information of somatic embryos 
and regeneration of corn plants. 

Hakman and von Arnold (1988) regenerated 
plantlets from long-term embryogenic cultures of 
Picea glauca (Moench) Voss. (white spruce). 
Embryogenic calluses, initiated from immature 
zygotic embryos and maintained by monthly 
subculture for 16 months, were used to establish 
suspension cultures. Small somatic embryos 
were continuously produced in liquid culture 
medium containing auxin and cytokinin, and the 
cultures showed sustained regeneration capacity 
for >6 months. Somatic embryos propagated 
in suspension cultures developed further into 
embryos bearing cotyledons about 1 month after 
transfer to solidified medium containing abscisic 
acid. Electron microscopic examination revealed 
that storage nutrients, lipids, proteins, and carbo- 
hydrates accumulated in the somatic embryos 
during treatment with abscisic acid (ABA). Upon 
subculture to medium lacking plant growth regu- 
lators, such embryos could develop into small, 
green plantlets. 

Maheshwaran and Williams (1984) reported 
direct somatic embryoid formation in immature 
embryos of Trifolium repens, T: pretense, and 
Medicago sativa. They also demonstrated the 
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effect of cytokinins on somatic embryo initiation/ 
maturation especially cotyledon development 
and subsequent plantlet formation in several 
plant species. Bhattacharya and Sen (1980) 
successfully produced plantlets through somatic 
embryogenesis in Brassica campestris. 

An efficient procedure was developed by Gill 
and Saxena (1992) for inducing direct somatic 
embryogenesis, organogenesis, and regeneration 
of plants from tissue cultures of peanut (Arachis 
hypogaea L.). Thin transverse sections of cotyle- 
dons and juvenile leaves were cultured on 
Murashige and Skoog (1962) medium supple- 
mented with N°-benzylaminopurine (BAP) or a 
substituted phenylurea, thidiazuron (TDZ). 
Somatic embryos or shoot buds differentiated 
from cut surfaces of the cotyledons and midrib 
region of the leaves. Application of BAP induced 
differentiation of shoot buds, whereas treatment 
with TDZ resulted in the production of somatic 
embryos. Somatic embryos developed into plants 
upon subculture on basal medium. Agar-solidified 
medium was found to be superior over liquid 
medium for the development of embryos and shoot 
buds. TDZ-induced somatic embryogenesis and 
plant regeneration were successfully applied to 
three genotypes. A distinct feature of their study 
was induction of morphogenic competence in 
cultures of seedling explant origin in peanut — 
until then known to be recalcitrant to somatic 
embryogenesis in vitro. 


15.5 Molecular Markers 
for Assessing Genetic 
Fidelity of Regenerants 


Gesteira et al. (2008) used random amplified 
polymorphic DNA (RAPD) markers for evaluating 
genetic stability of regenerants (plants) in soy- 
bean obtained through somatic embryogenesis 
using 180 pM 2,4-dichlorophenoxyacetic acid. 
Twenty primers were used for screening 44 
regenerants in cv “Spring” and 28 in cv “CAC-1.” 
Three primers were polymorphic for two of the 
““Spring’’-derived regenerants, showing somaclonal 
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frequency of 4.5 %. Four primers were polymor- 
phic for ‘“CAC”-l-derived regenerants, with 
somaclonal frequency of 3.57 %. Thus, usefulness 
of RAPD markers in detecting genetic instability 
in soybean primary regenerant plants derived 
from somatic embryogenesis was demonstrated; 
this could also be used as a certification tool for 
monitoring genetic stability during the regenera- 
tion process. 


15.6 In Vitro Conservation 

Several methods of in vitro conservation in 
grapes have been developed. Cryopreservation of 
embryogenic cell suspensions by encapsulation- 
vitrification; influence of pretreatment, freezing 
and thawing, and postculture conditions and 
freezing in liquid nitrogen of alginate-coated 
shoot tips are some of the aspects investigated. 
Genetic stability over long periods and over vari- 
able temperatures imposed during preservation is 
a critical component of in vitro conservation 
methodologies. This has been studied using 
molecular markers. 

Tissue culture-based conservation strategies 
for use in in vitro gene banks have been worked 
out for grape germplasm by Ganeshan. In his 
study, meristem culture and differentiating, ovary 
explant—derived callus showed amenability to 
conservation under reduced input culture 
conditions. 

Synthetic seed technology in grape, which 
would allow clonal germplasm of grape to be con- 
served in seed repositories, is relatively well 
advanced. Cotyledonary-stage somatic embryos 
originating from leaf explants have been encapsu- 
lated in alginate gel in Murashige and Skoog (1962) 
medium by Das et al. (2006). The percentage germi- 
nation in encapsulated somatic embryos was found 
to be higher than in nonencapsulated embryos. The 
same workers encapsulated somatic embryos as an 
efficient method for storing and propagating patho- 
gen-free planting material. Patented technologies 
are available for a number of cultivars for achieving 
successful somatic embryogenesis. 
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15.7 Conclusions 


Biotechnology promises to dramatically decrease 
the time needed to develop new grape varieties in 
view of the exciting strides made recently, 
especially in the sequencing of the genome and 
somatic embryogenesis promises to deliver 
nonchimeric, solid transgenics. These could be 
further multiplied, in turn, using somatic embryo- 
genesis protocols. Already patented technologies 
have been developed for generating disease- 
resistant grape cultivars. Patented technologies 
are available for regeneration systems in grape 
(US Patent No. WO/1999/059398) for exploita- 
tion by the biotechnologist for incorporation of 
genes of choice into cultivars of choice. 

Improved methods of germplasm conservation 
and vegetative propagation should arise through 
synthetic seed technology. In vitro selection for 
desirable traits in established cultivars is an 
attractive approach and has been successful 
recently. Haploids and dihaploids have not been 
developed, stifling the evaluation of their use in 
grape improvement. 

However, ongoing research in these areas is 
encouraging and the rapid progress displayed to 
date demonstrates that we are on the verge of uti- 
lizing an array of biotechnological procedures. 
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Abstract 

Micropropagation is a rapid multiplication of a selected plant using in vitro 
culture techniques. In this chapter various aspects of micropropagation 
have been discussed. The propagation of selected plant through micro- 
propagation would be useful for raising plantation using apical and nodal 
segment. They are best for micropropagation and mostly result in true to 
type plants. These segments upon the subsequent subcultures result in a 
number of multiple shoots. These multiple shoots on elongation allowed 
to root in vitro. After rooting, they are in vitro hardened and transferred to 
field. The potential of plant tissue culture is well recognized, as it increases 
agricultural production and generates rural employment. But the high cost 
of production on micropropagation is a major bottleneck. Low-cost proto- 
col development can popularize this method. High multiplication rate, use 
of low-cost chemicals, high rooting and survival percentage, and use of 
ex vitro rooting method can minimize the expense of any protocol. 
Presently, both the developing and the developed countries require low- 
cost technologies to progressively reduce the cost of production. The per- 
formance of tissue plants generally better than cutting and seedling. 
Initially, some morphological changes occur in micropropagated plant, but 
in the course of time, they minimize. Somaclonal variation caused during 
callus cultures can be used to generate variants for hybridization. Variation 
in phenotype of plants produced in tissue culture can be characterized as 
either temporary or permanent. Temporary variation includes increased 
branching, greater susceptibility to disease, and lack of uniform response. 
Micropropagated plants need to be monitored to get long-term data for its 
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growth, survival, yield, and disease attach under field condition as very 
limited information is available for many plants. In this chapter micro- 
propagation of commercially important plants like Jatropha curcas, 
Eucalyptus, banana, and bamboo has been discussed. 


Keywords 


Plant tissue culture * Regeneration ¢ Micropropagation * Shoot prolifera- 
tion ¢ Clonal plants * Tissue hardening ¢ Genetic fidelity * Somaclonal 
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16.1 Introduction 

In vitro clonal propagation provides true to type 
plants of a selected genotype using in vitro cul- 
ture techniques (Thorpe 2007). Endangered, 
threatened, and rare species have successfully 
propagated by micropropagation because of its 
high multiplication rate with minimal number of 
starting material/plants used for initiation of cul- 
tures. Most often micropropagation is coupled 
with mass production at a viable price. Gottlieb 
Haberlandt (1902) is the pioneer of plant tissue 
culture and was the first person to isolate plant 
cell. With course of time, micropropagation has 
become an essential tool for the propagation of 
plants that are otherwise difficult to propagate 
conventionally by seed and/or vegetative means. 
It has many distinctive advantages over conven- 
tional propagation methods such as rapid multi- 
plication of valuable genotypes, expeditious 
release of improved varieties, production of 
disease-free plants, production round the year, 
germplasm conservation, and facilitating their 
easy exchange. Micropropagation has raised high 
expectation for large-scale propagation, and lot 
of research has been focused on the development 
of protocols for large-scale propagation of many 
plants. This biotechnological dream was fulfilled 
by a large number of publications; although there 
are a number of publications, only a few of the 
published protocols could be converted into com- 
mercially viable propagation systems, and prob- 
lems were either technological or associated with 
marketing. Most of the published work lacked 


crucial issue of selecting seeds or adult plants as 
a starting material, and the choice of the propaga- 
tion method is crucial. The advantage of using 
seedlings is that the technology is easier but with 
the disadvantages of unknown genetic back- 
ground. Using adult plants as a starting material, 
it has the problem of endogenous contamination, 
poor response, instability of multiplication rates, 
and rooting problems. However, the use of adult 
plant as the starting material is desirable as its 
pedigree is known. It is more useful to propagate 
the elite plant material than unknown. Most of 
the conventional techniques for vegetative propa- 
gation are useful for the production on a smaller 
scale, but for large-scale propagation (lakhs and 
millions of the plants per year) requires highly 
efficient techniques. For large-scale propagation, 
conventional methods are insufficient and inef- 
fective, and tissue culture is the only viable 
method. Nobody would think of propagating a 
weedy fig tree via tissue culture as can be propa- 
gated easily via cuttings in the tropical countries. 
More than fifty lakh plants of Ficus plants are 
generated in Europe through tissue culture. In 
some plants, micropropagation has completely 
replaced propagation via cuttings. 

Plant can be micropropagated in three differ- 
ent ways: (1) axillary bud, where axillary bud 
gives rise to many new shoot buds; (2) direct 
regeneration of shoot buds, further gives rise to 
many buds or through somatic embryos and gives 
rise to plantlet as they are diploid with shoot and 
root connected by vascular system; and (3) indi- 
rect regeneration of shoot buds or somatic 
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embryos from callus. Numerous examples of its 
application have been reported such as culturing 
of gametes, meristematic culture for virus elimi- 
nation, zygotic embryo culture, embryo rescue 
(ER), and germplasm conservation; ER tech- 
nique nurtures the immature or weak embryo, 
and this way it helps to germinate. Recently, this 
technique has been proven useful for breeding 
seedless grape; thus, remarkable progress in 
micropropagation technologies is occurring as 
their applications are diverse and significant. 

Genetic transformation is the most recent 
aspect of micropropagation that provides the 
mean of transfer of genes with desirable trait into 
host plants. Plant can be genetically modified by 
direct gene transfer or indirect gene transfer 
method. In the indirect gene transfer method, 
Agrobacterium-mediated genetic transformation 
is most commonly used for the expression of for- 
eign genes in plant tissue. Successful introduc- 
tion of agronomic characters in plants was 
achieved by using root explants for the genetic 
transformation. Virus-based vectors offer an 
alternate for stable and quick ephemeral protein 
expression, thus providing an efficient mean of 
recombinant protein production. 


16.2 Micropropagation 


In vegetative propagation use of micropropaga- 
tion has become very popular due to its applica- 
tion in agriculture, forestry, and horticulture. It 
has been possible to propagate forest trees, crop 
plants, vegetables, ornamentals, medicinal plants, 
and other commercial useful plants using this 
technique. Micropropagation 1s performed in fol- 
lowing steps: (1) generations of aseptic cultures, 
(2) multiple shoot induction, (3) shoot elongation 
and proliferation, and (4) rooting, hardening, and 
field trials (Fig. 16.1). We will be discussing in 
this present chapter about micropropagation of 
commercially important plants like Jatropha cur- 
cas, Eucalyptus, banana, and bamboo. 
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Fig. 16.1 Flow chart summarizing micropropagation 
method 


16.2.1 Case Study 1 
Micropropagation of Jatropha 
curcas L. 
16.2.1.1 Generation of Aseptic 
Cultures 
Seedling, nodal and apical shoot buds, cotyle- 
donary leaf, petiole, stem, epicotyls, and hypo- 
cotyls were used for culture generation. Axillary, 
apical, stem, and leaf were used for the genera- 
tion of large-scale cultures using explants from 
Council of Scientific and Industrial Research — 
Central Salt and Marine Chemicals Research 
Institute (CSIR-CSMCRI) field _ station, 
Bhavnagar, Gujarat, India. The establishment 
success of shoot cultures was >90 % when 
explants were taken from actively growing stems, 
while explants from the dormant plant were dif- 
ficult to establish, as lot of infection occurred. 
Young shoots of new branches or priory pruned 
branch was cut, and petiole was removed by 
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keeping | cm petiole base attached to stem. 
Shoots were then cut keeping single bud in each 
segment. Explants were surface sterilized for 
8 min in 0.1 % mercuric chloride and washed 
thrice with sterile distilled water. Trim the 
remaining part of petiole along with the lower 
and upper end of segment and cultured on bud 
initiation medium (Singh et al. 2010; Reddy et al. 
2008, 2009). While most of the reports have used 
systemic fungicide, Bavistin, two to four liquid 
soap solution or two to five drops Teepol for 
10—20 min prior to surface sterilization and fol- 
lowed by 0.1 % HgCl, (w/v) for surface 
sterilization. 


16.2.1.2 Multiple Shoot Bud 
Induction 

Nodal segment, leaves, and stem were used for 
shoot bud induction and cultured on MS medium 
supplemented with different concentrations and 
combinations of auxin and cytokinins. Multiple 
shoot bud induction is the most important stage 
in micropropagation. Success of any protocol 
depends upon the rate and number of multiple 
shoots. It mainly depends upon the formulation 
of medium used and combinations of plant 
growth regulators (PGRs). Cytokinins play key 
role in multiple shoot induction in this plant. 
Large-scale cultures were generated by CSIR- 
CSMCRI (Fig. 16.2), during the study cytokinin 
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alone resulted 3-6 multiple shoots in primary 
cultures, and upon subculture on MS medium 
supplemented with cytokinin and auxin, 6—12 
multiple shoots were achieved (Reddy et al. 
2009; Fig. 16.3a, b). MS medium with 
6-benzylaminopurine (BAP) alone induced shoot 
buds with two multiple shoots per explants. 
Activated charcoal when added to the culture 
medium was found to have promoted high fre- 
quency of multiple shoots in shoot tip cultures of 
J. curcas. While, BAP and kinetin (KN) in com- 
bination with indole-3-acetic acid (IAA) induced 
45 multiple shoots. BAP in combination with 
indole-3-butyric acid (IBA) adenine sulfate + 
glutamine + L-arginine + citric acid induced 
shoots bud (Shrivastava and Banerjee 2008). 
Axillary shoot bud proliferation was best (six 
shoots per nodal explants) initiated on BAP with 
adenine sulfate. The multiplication rate (30 mul- 
tiple shoot buds per culture) was significantly 
enhanced upon transfer to MS medium with KN, 
IBA, and adenine sulfate (Datta et al. 2007) and 
more than 20 shoot buds from leaf explants (Reddy 
et al. 2008) and 10—15 buds from stem explants 
were achieved (Singh et al. 2010). 

In vitro culture studies on Jatropha have been 
conducted on somatic embryogenesis, zygotic 
embryogenesis, and direct and indirect organo- 
genesis. Demissie and Lele (2010) reported 
phorbol ester from solid and liquid culture 





Fig.16.2 Large-scale culture generation at CSMCRI-CSIR 
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Fig. 16.3 Large-scale micropropagation of J. curcas 
using axillary shoot bud (a, b) multiple shoot induction 
upon subculture of explant, (c) large-scale cultures gen- 


medium of Jatropha; phorbol ester content was 
high in liquid medium as compared to solid 
medium. Callus cultures were initiated from leaf 
and hypocotyl explants of J. curcas; hypocotyl 
explants were proved to be best in inducing large- 
scale callus in the short time (Soomro and Memon 
2007). Attempts were also made for in vitro mat- 
uration and germination of J. curcas microspores. 
The rate of germination was 71.6 % and pollen 


eration of J. curcas (d), in vitro rooting of shoots (e), 
ex vitro rooted shoots (f), well-rooted plants in nursery (g) 


matured in vitro exhibited a rate of germination 
of 13.7 %, when it was transferred into the opti- 
mal medium (Li et al. 2010). However, the use of 
cyanobacterial filtrate instead of synthetic PGRs 
was found to be better for in vitro germination of 
J. curcas embryo. In medium containing cyano- 
bacterial filtrate, shoot and root lengths were 
comparatively higher than the other treatments 
with synthetic plant growth regulators. The same 
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trend was found for chlorophylls a and b (Shrivastava 
and Banerjee 2009). Somatic embryos were gen- 
erated using different explants like cotyledonary 
leaf and leaf callus with different regeneration 
efficiencies. In recent years, plant regeneration in 
J. curcas has been accomplished through organo- 
genesis from various explants, including mature 
leaf, petiole, hypocotyls, and stem. 


16.2.1.3 Shoot Elongation 
and Proliferation 

Shoot elongation and proliferation is an impor- 
tant stage for the successful plantlet development 
in tissue culture. Generally, reduction in the 
PGRs in the subculture medium is favorable for 
elongation and proliferation, but sometimes sub- 
culturing to the same composition also found to 
be equally good. In an experiment MS medium 
supplemented with BAP and IAA induced 6-12 
multiple shoots upon subsequent subculture on 
the same medium (Fig. 16.3c, d). Subculturing 
from alone BAP to BAP in combination with 
IAA and other additives was found optimal for 
shoot proliferation. While subculturing on the 
Same medium used for primary culture, BAP/KN 
in combination with IAA was also found equally 
good for shoot elongation and _ proliferation. 
Similar results were reported of subsequent sub- 
culturing onto the same medium found to be best 
for shoot proliferation and elongation with 100 
shoot buds per explants (Shrivastava and Banerjee 
2008). MS medium with KN, IBA, and adenine 
sulfate was found best for shoot elongation and 
proliferation as both shoot number and shoot 
length were found to increase significantly. 


16.2.1.4 Rooting, Hardening, 
and Field Trials 

Rooting of shoot not only depends upon the 
medium composition and PGRs but also depends 
upon other factors like shoot length, light, and 
age of culture. The full-strength MS medium was 
found to best for rooting as compared to half MS 
medium. As only 33 % rooting could be achieved 
in half MS medium and 88 % rooting in PGR-free 
full MS medium. The shoots rooted efficiently on 
half-strength MS medium (Purkayastha et al. 
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2010). Healthy elongated shoots were rooted on 
half-strength MS medium supplemented with 
IBA. The addition of phloroglucinol (200 uM) to 
the full MS medium enhanced the frequency of 
rooting to 76.7 % (Kumar and Reddy 2010). 
While, IAA (1.0 mg/l) was found to be more 
suitable than IBA for root induction. More than 
50 % rooting could be achieved with the use of 
combinations of auxins IBA, IAA, and NAA 
(Reddy et al. 2009). Addition of activated charcoal 
(AC) to the medium does not have any significant 
role to play on rooting percentage (Fig. 16.3e) 
The role of AC on plant growth is still unclear, 
but it is believed that AC may slowly release 
certain adsorbed products, such as nutrients 
and growth regulators, and become available for 
growth and rooting of the plant. Long mature 
shoots could ex vitro root while tender shoots 
dried. Only few protocols are available to mini- 
mize the cost of the protocol via ex vitro rooting 
(Singh et al. 2010; Reddy et al. 2009; Fig. 16.3f). 
Hardening of the plant involves in vitro hardening 
by placing plants transferred into sterile polythene 
bags under in vitro conditions for a month, 
then shifted to the greenhouse and slowly 
humidity minimized. During ex vitro rooting, 
time minimizes in this whole process and so 
expense minimizes. Sapling shifted to nursery for 
further acclimatization and then shifted to field 
(Figs. 16.3g and 16.4). 

More than 85 % of saplings of particular 
accession well established in different wasteland 
sites of CSMCRI, Bhavnagar, Gujarat. It was 
observed that tissue culture—raised plants were 
superior to cuttings and seed-raised plants in the 
field in terms of consistent yield. 

Morphologically saplings were similar and- 
verified by molecular marker (Rathore et al. 
2014), and suggesting plants -were true to type. 
The survival rate of 60-70 % was achieved in 
field (Rajore and Batra 2005). Most of the work- 
ers limited their workup to pot experiments. Most 
of the work has been done using explants of 
unknown pedigree or used seedlings. Available 
protocols do not have information regarding their 
survival in the field, which is a very crucial issue 
in tissue culture. Though plant is susceptible to 
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Fig. 16.4 Micropropagated plant growing in amended wasteland 


abiotic and biotic stress, only few attempts have 
been made for genetic transformation in the 
plant. Although many methods have been 
published, implementation of this technology on 
a commercial scale is a constraint, due to the high 
cost involved. Some Indian units are currently 
producing J. curcas at the commercial level 
(Table 16.1). CSIR-CSMCRI in collaboration 
with MNRE (Ministry of New and Renewable 
Energy) worked on project large-scale micropropa- 
gation of Jatropha curcas and generated cultures 
on a large scale and developed plants were well 
established in the field. Micropropagated plants 
were found genetically stable (Rathore et al. 
2014). 


16.2.2 Case Study 2 
Micropropagation 
of Eucalyptus 

16.2.2.1 Generation of Aseptic 

Cultures 

Seedling, lignotubers, and nodal and apical cop- 

pice explants from mature plants were used for 

culture generation. Pretreatment of fungicide was 


Table 16.1 J. curcas being currently micropropagated 
by commercial tissue culture units in India 

1 Labland Biotech Private Limited, Mysore J. curcas 
2 Reliance Life Sciences Pvt. Ltd. Bombay J. curcas 
3. Nandan Biomatrix Limited, Hyderabad J. curcas 


4 Global Flora Biotech, Coimbatore J. curcas 


found essential for mature plants by many workers, 
but not found essential by Giryjashankar (2012). 
Surface sterilization of nodal segments carried out 
using 0.1 % mercuric chloride (HgCl,) for 
10-17 min and | % sodium hypochlorite (NaOCl) 
for 7-15 min; 70 % ethanol and few drops of 
Tween 20 were added to the treatment solution. 
Regeneration through callus is difficult; however. 
callus induction has been reported in more than 30 
species of Eucalyptus, and successful regeneration 
was achieved in a few species only. 

Regeneration efficiency depends on the type 
of explants (plant part used for culture genera- 
tion), time of subculture, nutrients, and PGR for- 
mulation. Malt extract was used in callus cultures 
of E. viminalis and E. coccifera (\shii 1982) and 
coconut milk for callus cultures of E. polybrac- 
tea. Callus generated from lignotuber, stem, and 
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cotyledons could regenerate shoot buds. These 
shoot buds could elongate on medium with auxin 
in combination with cytokinin. Shoots regener- 
ated from stamen callus of E. marginatu showed 
low rooting percentage. Explants source plays an 
important role in regeneration efficiency and 
rooting percentage. 


16.2.2.2 Multiple Shoot Bud 
Induction 

The best micropropagation is achieved by axil- 
lary shoot bud proliferation using nodal segments 
of mature plant. For multiple shoot bud induc- 
tion, explants were cultured onto the medium 
supplemented with cytokinin and auxin (Roux 
and Staden 1991; Hartney and Barker 1980). 
Multiplication rate depends upon factors like 
species, clone, explant source, and juvenility. 
Highest multiplication was achieved by Hartney 
(1982) with a potential production of millions 
shoots from a single shoot within a year. However, 
seedling performed even better in E. marginatu. 
Cultures showed problem like unwanted callus 
and vitrification of shoot buds. Vitrified shoot is 
difficult to multiply and impossible to root. This 
problem could be overcome by the use of acti- 
vated charcoal into the medium in this and other 
plants (Boulay 1983; Singh et al. 2008). Callusing 
retards shoot bud growth; it adversely affects on 
multiplication rate and rooting of the shoot. 
Eucalyptus cultures of E. marginatu showed cal- 
lusing. Storage of cultures has been reported in E. 
camaldulensis and E. grandis on a _ simple 
medium for over 8 months and E. gunnii and E. 
dalrympleana for 6 months. Embryonic cultures 
of E. citriodora could be stored for more than 
9 months without subculture and without loss of 
embryogenic potential. 


16.2.2.3. Shoot Proliferation 
and Elongation 

Multiple shoot buds need to be separated so that 
they could elongate sufficient enough to get 
rooted. By deceasing cytokinin shoot bud elonga- 
tion was achieved in E. torelliana_ shoots. 
Whereas a transfer from liquid to solid medium 
achieved a similar effect with E. tereticornis 
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shoots (Mascarenhas et al. 1982). Gibberellic 
acid in different concentrations ranging from 0.1 
to 15 mg/L has been added to media to obtain 
shoot elongation with and without activated char- 
coal (Furze and Cresswell 1985). The use of dif- 
ferent medium has been reported for culture 
generation of Eucalyptus. MS medium was found 
suitable for culture generation E. camaldulensis 
and Eucalyptus spp. 


16.2.2.4 Rooting, Hardening, 
and Field Trials 

Root initiation using different media, e.g., 
White’s (1963) medium, Knop’s (1865) medium, 
and half-strength Murashige and Skoog’s (1962) 
medium, has been reported. Rooting efficiency 
also gets affected by source explant. The range of 
rooting percentages for seedling shoots was 
5—80 %, whereas the range for an adult shoot was 
2-20 %. Burger (1987) achieved 4 % rooting 
with adult explants of E. sideroxylon and 100 % 
rooting from coppice explants shoots. Gupta 
et al. (1983) observed 50 and 70 % rooting of 
shoots from mature trees of E. torelliana and 
E. camaldulensis, respectively. The rooting 
percentage of shoots varies upon the number of 
subcultures. Mature shoots root better than tender 
shoot. As the number of subculture increases, 
rooting percentage increased from 20 to 100 % 
from the third to the sixth subculture (Badia 
1982a). Mc Comb and Bennett (1982) found sim- 
ilar results with mature explants of E. marginata. 
The use of half-strength MS supplemented with 
activated charcoal favored rooting. The use of 
auxins like NAA (naphthaleneacetic acid) and 
IAA (indole acetic acid) promoted rooting by 
minimizing callus formation. The addition of 
riboflavin into the rooting medium changed root- 
ing pattern. Vitamin E, enhanced shoot growth, 
was reported by Badia(1982b), while Boulay 
(1983) found it noneffective. Ex vitro rooting is a 
promising method for cost minimization of the 
protocol (Singh et al. 2010). Shoots of E. camal- 
dulensis were ex vitro rooted directly in the sand 
and in nonsterile conditions. 

Adaptation of plants to the new environment 
is acclimatization, and the process of adjusting 
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the in vitro grown plantlet to the outside or sur- 
rounding climate is hardening. Rooted shoots are 
subjected to hardening so that they can be trans- 
ferred to field conditions. During hardening plant 
is Slowly acclimatized to the outside environ- 
ment. Hardening material and humidity play an 
important role in plant establishment. Transferring 
plantlets to container containing a mixture of 
70 % pine bark, 25 % sphagnum peat, 5 % brown 
coal ash, and 4 g dmW3 Osmocote fertilizer 
resulted in a good establishment of the plant. As 
plantlets come from complete sterile conditions, 
they are sensitive to diseases; also frequent sprays 
with fungicides and insecticides are recom- 
mended. Establishment of plant varies species to 
species. Bell et al. (1993) studied the behavior of 
micropropagated plants and seedling. They found 
some variation in the initial phase of growth, but 
soon those variations were lost after 40 days. 
Goodger and Woodrow (2010) studied E. poly- 
bractea and found somaclonal variants with 
increased foliar oil concentrations. The study on 
micropropagated FE. grandis x nitens showed 
that, despite of root morphological differences, 
micropropagated plants did not significantly 
reduce the efficiency of roots to take up and 
transport water to the leaves in a field trial. 
Greater uniformity was found in micropropa- 
gated plants from the normal microcutting. 
Uniformity is desirable for the practice of clonal 
forestry. There is a need to monitor micropropa- 
gated plant and study its behavior in field condi- 
tion for years, as many time elite features of the 
plant disappear with time, for example, increased 
branching of ornamental plants, but the effect is 
lost in successive generations when cuttings are 
generated from the tissue cultured plants. 

List of commercial unit’s micropropagating 
Eucalyptus (Table 16.2). Center of Agriculture and 
Forestry Research in Italy together with Celulosas 


Table 16.2 Eucalyptus being currently micropropagated 
by commercial tissue culture units 


Green Valley of India, Aligarh, Rs. 37 
India 


Revolving Earth Agro Pvt. Ltd, Rs. 30 
Mumbai, India 


Merar Limited, London — 


Eucalyptus sp. 
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de Asturias SA worked on a collaborative project 
Biogenie and successfully applied molecular 
technique for clonal identity on a commercial 
basis. Mondi forest, a division Paper Company 
Ltd. in South Africa, uses tissue culture technique 
to establish microcuttings of cold-tolerant clones. 

The cost of production of micropropagated 
plants depends mainly on labor cost in the coun- 
try/place/region of origin, energy cost, number of 
cultures per culture vessel/unit area of produc- 
tion, rate of multiplication and number of weeks 
between subculture, percent of culture contami- 
nation, and saleable plants after the production 
process. 


16.2.3 Case Study 3 
Micropropagation of Banana 
(Musa sp.) 


In 1972 tissue culture of banana (Musa) was first 
time reported using shoot apex; later on meristem 
tissue combined with thermotherapy was used 
for generation of virus-free banana plants. Since 
then people are working on large-scale propaga- 
tion of good varieties of banana. The leading pro- 
ducer of banana is Maharashtra and then followed 
by Tamil Nadu, Gujarat, Karnataka, and Andhra 
Pradesh. They contribute more than 85 % of total 
banana production in the country. 

16.2.3.1 Generation of Aseptic 

Cultures 

Cultures are initiated using different explant 
sources of banana plant. The most commonly used 
explant sources are shoot apex, suckers, lateral 
buds, peepers, and terminal inflorescence. For cul- 
ture initiation, explant must be disinfected; various 
methods of surface sterilization have been pro- 
posed by researchers. Sodium hypochloride is 
most commonly used for surface sterilization of 
banana and also use of mercury chloride found to 
be effective by many researchers. Pretreatment of 
antibiotic, fungicide, and ethanol minimized infec- 
tion rate. However, cultures could be initiated 
without any disinfection reported. For in vitro cul- 
ture, banana seeds were treated with 1.4 % sodium 
hypochloride and washed with ethanol. 
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16.2.3.2 Multiple Shoot Bud 
Induction 

Shoot tip and shoot tip meristem are commonly 
used explants for multiple shoot bud induction; 
use of male flower for embryogenesis is also 
reported. Shoot tips are cultured on the medium 
intact, wounded or fragmented for shoot bud 
induction. During fragmentation, shoot apex was 
cut longitudinally into two or more halves and 
cultured onto the medium. Different media sup- 
plemented with PGRs (plant growth regulators) 
are used for culture initiation. Some researchers 
used the same medium for initiation and multipli- 
cation, while others have changed the concentra- 
tion of PGRs in initiation and multiplication 
medium. Assani et al. (2003) initiated cultures on 
MS medium supplemented with casein hydroly- 
sate and adenine sulfate (Nandwani et al. 2000). 
Diploid and triploid cultures could be generated 
using male flower explants. Anthers were also 
used as explant for generation of haploid plants. 
Lee et al. (1997) used sliced rhizome tissue for 
the induction of somatic embryos. 


16.2.3.3 Shoot Proliferation 
and Elongation 

PGRs are the essential components of the 
medium; their concentration and ratio generally 
determines the growth and development of the 
explant. The response varies explant to explant; 
Mendes et al. 1996 reported that shoot tips of dif- 
ferent rhizomes behave differently as some of 
them were highly productive and others were less 
productive. A high concentration of 5—20 mg/L 
BAP in combination with 10-40 % sucrose and 
5—20 mg/L ancymidol (AVC) is found best for 
shoot proliferation and elongation, medium sup- 
plemented with high concentration of BAP and 
coconut milk enhanced regeneration efficiency of 
shoot buds. BAP in combination with auxin 
enhanced shoot proliferation and elongation, 
BAP with IBA is found to be best for three culti- 
vars (Cavendish, Bluggoe, and Silk) of banana, 
and BAP with IAA enhanced shoot multiplica- 
tion of leaf regenerants. TDZ is frequently used 
in tissue culture; its combination with IAA is 
found to be best for shoot elongation as com- 
pared to BAP in combination with IAA. Also 
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more than one cytokinin, with or without auxin, 
were used together to enhance the shoot prolif- 
eration and elongation of banana cultures. Kinetin 
and BAP were used in combination, and their 
combination with 21P was used for shoot multi- 
plication of cv. Sabri. However, kinetin and BAP 
combination produced the highest number of 
shoot buds as compared with 21P. The addition of 
ANC (growth retardants) induced significant dif- 
ference in plant height during hardening. Also, 
the use of ANC and paclobutrazol in liquid 
medium decreased the excessive growth of leaf 
and stem. Browning of the medium during multi- 
plication is observed due to the oxidation of phe- 
nols. It interferes in the plant growth and leads to 
necrosis of the tissue. It could be minimized by 
the addition of 50-100 mg/L of calcium chloride 
in the medium. Washing with 0.125 % potassium 
citrate before culturing was beneficial to control 
necrosis of the explant. Agar medium is expen- 
sive aS compared to liquid medium, and also liq- 
uid medium is easy to handle. The highest 
multiplication rate was observed in the explant 
cultured in temporary immersion as compared to 
other methods using cellulose as substrate and 
partial immersion. Though there are many reports 
using liquid medium and bioreactors, but com- 
mercially propagation is carried by solid culture 
medium. 


16.2.3.4 Rooting, Hardening, 
and Field Trials 

Generally for successful rooting, concentration 
of auxins, MS, and sucrose is mainly responsi- 
ble. However, auxin with cytokinin is also 
reported for rooting in many plants. High auxin 
and low cytokinin concentration are favorable 
for root induction. The most frequently used 
auxins for rooting of banana shoots are NAA, 
IAA, and IBA; however, rooting without PGRs 
has been also reported in banana. Shoots regen- 
erated many roots when cultured on MS 
medium supplemented with 1.0 g/L activated 
charcoal (AC); use of 0.025 % AC was also 
good for rooting. Activated charcoal (AC) was 
initially added to tissue culture media in an 
attempt to simulate soil conditions: now a day it 
is usually added in many tissue culture media 


16 Micropropagation of Plants 


formulations. The promotary effects of AC on 
rooting may be due to its irreversible adsorp- 
tion of inhibitory compounds in the culture 
medium and substantially decreasing phenolic 
exudation and brown exudate accumulation. In 
addition to this, activated charcoal is also 
involved in a number of stimulatory and inhibi- 
tory activities which promote growth. IBA was 
effective for root induction of banana shoots 
and frequently used for this purpose in solid 
and semisolid medium. Half MS medium sup- 
plemented with IBA produced a good number 
of roots. Pulse treatment of IBA and NAA com- 
bination was used to produce in vitro rooted 
plants. Alone NAA could root in vitro shoots of 
Musa textilis and AAB bananas. The establish- 
ment success of in vitro plant in a field depends 
upon the method used for hardening. In vitro 
plants which have poor cuticle and open stoma- 
tas, therefore, lose water rapidly under field 
condition and also prone to different diseases. 
Shoots cultured on medium supplemented with 
triazoles established better as compared to 
control. In vitro triazole treatment acts as a 
conditioner and obviated the need for harden- 
ing. To minimize the cost, plantlets were hard- 
ened without a greenhouse. The humidity was 
maintained by spraying water after 2 h. On an 
average 92 % of the plantlets survived. 

Hardened plants are shifted to the field, and 
their establishment in the field is found to be 
good by many workers. The yield performance of 
plant depends upon the spacing between the 
plants (Nokoe and Ortiz 1998). Plant number var- 
ies according to variety as the growth of the plant 
varies plant to plant. The leaf and pseudostem 
were bigger in micropropagated plant as com- 
pared to original suckers. Higher yield in micro- 
propagated plant is due to large bunch and shorter 
cycle to harvest. However, in some cases pheno- 
typical variations were seen, and the plants did 
not show any higher yield. To evaluate, regenerants 
were compared with mother clone, and signifi- 
cant variation was observed in fruit maturity and 
leaf size. Among all the variants, three of four 
variants were inferior to mother clone from which 
they were derived. Only one regenerant resembled 
to mother clone with reference to yield. 
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Among the banana growers, popular banana 
varieties are Grand Naine (G9), Yelakki, Dwarf 
Cavendish, and Nendran in India. Grand Naine 
(G9) is high-yielding variety and resistant to 
Panama wilt. Yelakki is good in taste and is pre- 
ferred in religious ceremony but prone to Panama 
wilt, and sometimes complete devastation of the 
crop occurs by the disease. The selling price of 
them is different; Yelakki market value is higher 
than G9, but as G9 is resistant Panama wilt is 
more popular among growers. Desai Fruits and 
Vegetables (DFV) is an agribusiness and India’s 
leading exporter company of premium banana. 
Company works with the farmers in Gujarat, 
Maharashtra, Tamil Nadu, and Andhra Pradesh 
and also exports banana in Middle East (Dubai, 
Iran, Iraq, Kuwait), Saudi Arabia, Oman, Qatar, 
and Eastern Europe (Ukraine). In the year 2013- 
2014, the production of banana tissue culture was 
250,000 MT and is expected to reach 300,000 MT 
in 2014-2015 (www.desaifv.com). Cadila pro- 
duces eight millions of plants per annum of 
Grand Naine (G9), and SECON Agriventure pro- 
duces G9, Yelakki, and Nendran bananas. 


16.2.4 Case Study 4 
Micropropagation of Bamboo 
(Bambusa sp.) 


Bamboos are natural polyploids and variability 
occurs within the seedling of tropical and tem- 
perate bamboos. In the seedling, variability 
occurs at the genetic and the phenotypic level. 
Micropropagation via tissue culture has attracted 
lots of interest, and it was thought that this 
method could crack most or at least some prob- 
lems in propagation of bamboo. Woody bamboos 
have been used for different applications, but 
recently interest from paper and wood industries 
has increased, for both tropical and temperate 
woody bamboos. Bamboo can be cultivated by 
different ways like seeds, clump division, and 
rhizome cuttings. But these methods are insuffi- 
cient for small- or large-scale propagation, espe- 
cially when the target is in lakhs. Bamboo has 
huge market round the world; temperate bam- 
boos are used for agroforestry in China and as 
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ornamental plants in Europe and the USA. The 
magnitude of the demand for bamboo planting 
materials indicates that micropropagation will be 
necessary for large-scale propagation. The total 
world production of all tissue cultured plants in 
1995 was estimated in millions. Through micro- 
propagation within a very short time, lakhs and 
millions of true to type plants can be made from 
one single elite clone. For large-scale production, 
efficiency of the propagation methods is impor- 
tant, but perhaps even more crucial is the genetic 
stability. Gielis and Oprins (1998) practiced com- 
mercially feasible micropropagation for large- 
scale propagation for species and varieties of 
temperate and tropical bamboos, and the technol- 
ogy was capable to propagate any bamboo. The 
main emphasis was on the propagation of more 
than 60 different species, cultivars of bamboo, 
ornamental bamboos, and tropical bamboos. The 
approach for successful method involves three 
major components: (1) research on bamboo 
physiology and genetics, (2) the development of 
propagation method using axillary branching into 
a universal technique with a high clonal fidelity 
and very high efficiency, and (3) evaluation of 
in vitro to in vivo performance. The developed 
method could possibly micropropagate almost 
any selected bamboo at mass scale in a short time 
frame. Huge numbers of bamboo can be trans- 
ported from laboratory to site or anywhere in the 
world. The quality of young plants derived 
through micropropagation was excellent. The 
quality of plants from tissue culture was consid- 
erably better than those propagated via conven- 
tional methods of propagation. The technology 
developed ensures the best possible quality, 
which is also reflected in the bamboo tissue 
culture standards in India. Some of these 
practices are: 

1. Micropropagation cultures are only initiated 
from selected mother plants kept in green- 
houses with known phenotypes. Only adult 
plants were used as explant. 

2. Cultures for large-scale production need to be 
changed yearly; high-production genotypes 
are multiplied maximum of 16 subcultures 
using axillary branching. 
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3. During dispatch detailed labeling of the kind 
and variety of the plants. 

4. Propagation rates (multiplication rates) are 
kept below five. Overproducing cultures are 
not used in production. 

5. During the transfer of cultures, only the quali- 
tatively best cultures are maintained; the oth- 
ers are simply discarded. 


16.3 Somaclonal Variation 


In vitro-regenerated plants show variations; it can 
be genetic (permanent) or epigenetic (reversible 
and temporary). This phenomenon is known as 
“somaclonal variation,’ and the frequency of 
somaclonal variation (SV) is usually elevated far 
beyond that expected in nature. Bird (2007) 
defined epigenetic events as the structural adapta- 
tion of chromosomal regions so as to register, sig- 
nal, or responsible for altered activity. Epigenetic 
variation in vitro reflects the adaptation process 
of cells in a different environment which includes 
the response to signals that may trigger button in 
the developmental program. However, such adap- 
tive alterations may compromise the genetic 
makeup for which the plant cells or tissues were 
cultured. Therefore, a detailed study is necessary 
of the epigenetic occurrence and its consequences 
likely to occur in the longer term. Chromatin 
structure may change by epigenetic mechanisms 
such as DNA methylation, histone modifications 
(histone methylation, histone deacetylation, 
phosphorylation, and carbonylation), RNA inter- 
ference (RNAi), and ultimately lead to pheno- 
typic variation. 

DNA methylation affects genomic imprinting, 
chromosome inactivation, silencing of transpo- 
sons, and other repetitive DNA sequences. DNA 
methylation can be divided into three different 
types in the reference sequence of cytosine, high 
methylation at CG sites, medium methylation at 
CHG sites, and lowest methylation at CHH sites 
(H =A, C, or T) (Feng et al. 2010). In comparison 
to eukaryotic organisms, plants and _ well- 
developed multicellular organisms show high 
DNA methylation, may be due to a complex 
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development system requiring additional epigen- 
etic regulation to manage the development of 
many different types of cells. Histone deacety- 
lases (HDACs) are involved in eukaryotic gene 
regulation by catalyzing the acetyl groups and 
deletion from lysine residues on histone and this 
way transcriptionally repress gene expression. 
The changes in chromatin structure influence 
gene transcription in response to diverse exoge- 
nous and endogenous stimuli like pathogen 
attack, water, heat, cold, light, and hormones 
(Anzola et al. 2010). Hyperacetylation of his- 
tones 1s involved with active gene expression and 
hypoacetylation with gene repression. It was 
found that H3K27me3 regulates many genes that 
act independently of other epigenetic pathways 
like DNA methylation and smRNAs. SmRNAs 
not only influence at the posttranscriptional level 
but also affect RNA-directed DNA methylation. 
During grafting of Arabidopsis, it was known 
that small interfermg RNA (siRNA) and 
microRNA (miRNA) can be responsible for epi- 
genetic changes in the genome of recipient cells 
(Carlsbecker et al. 2010). Let us understand the 
process by the schematic diagram as variation 
occurs at different developmental stages of the 


Fig. 16.5 In vitro plant 
regeneration and epigen- 
etic variation at dedifferen- 
tiation and differentiation 


PGRS, light, temperature, nutrients 
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regeneration, at each stage different genes involve 
in the process (Fig. 16.5). 


16.3.1 Dedifferentiation 


The regeneration of plants through differentia- 
tion is a clear example of the plasticity of plant 
cells. Cells regenerate in response to certain 
environmental signals, and cells acquire capa- 
bility for undergoing dedifferentiation process 
followed by new developmental pathway. 
Histone methylation activity is essential for the 
generation of the dedifferentiated state into the 
cell cycle. During the cell cycle, change in chro- 
matin structure and histone modifications 
occurs, and some of them are plant specific. 
Berdasco et al. (2008) treated undifferentiated 
cultures of Arabidopsis cell with a demethylat- 
ing drug and found out many hypermethylated 
genes in callus and cell suspensions. Promoters 
of the MAPK12, GSTU10, and BXLI1 genes 
were closely hypermethylated in callus and cell 
suspensions, at the same time as the TTGI, 
GSTF5, and SUVH8 genes occasionally become 
hypermethylated only in cell suspensions 


Culture of different plant parts 





<< MAPK12, GSTU10, and 
BXL1 





DNA damage and repair 


<(_ J SPL 
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(Fig. 16.5). In long-term suspension cultures, it 
was seen that euchromatin was hypermethyl- 
ated, and heterochromatin was DNA hypometh- 
ylated resulting in activation of specific 
transposable elements, and its reprogramming 
of proliferating cells causes DNA methylation, 
histone modifications, and RNA. 


16.3.2 Differentiation 


During organogenesis the proliferating cells dif- 
ferentiate when growth regulators (specific 
changes) are added to the culture medium and the 
process of plant restructuring takes place. 
Polycomb-group (Pc-G) proteins are known as 
epigenetic regulators and are key regulators for 
differentiation pathways. The miRNAs facili- 
tated embryogenesis in Arabidopsis by prevent- 
ing advance expression of differentiation 
promoting transcription factors Squamosa 
Promoter Binding Protein-Like (SPL) (Fig. 16.5). 
During in vitro regeneration, reset of epigenetic 
reprogramming may be needed to erase the 
effects of epigenetic marks caused by external 
stimuli. An accumulation of epigenetic changes 
during successive cell divisions may increase the 
risk responsible for dangerous epigenetic alleles. 
Ex vitro clonally propagated plants are also prone 
to these changes. 


16.3.3 Methodologies for Detecting 
Epigenetic Variation 


The most common tools to analyze somaclonal 
variations are random amplified polymorphic 
DNA (RAPD), inter-simple sequence repeat 
(ISSR), simple sequence repeat (SSR), and DNA 
methylation markers; however, RAPD has a 
problem with reproducibility. Variation in the 
plants cultured in vitro has been analyzed 
mainly by AFLP (amplified fragment length 
polymorphism)-based techniques (Vos et al. 
1995). MSAP has found to be the best method for 
finding changes in cytosine; however, it has 
limitations like it can only detect a small propor- 
tion of the methylated cytosines and the changes 
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between unmethylated sites and internally meth- 
ylated sites. Another method of quantification of 
SmC can be carried out by HPCE (high perfor- 
mance capillary electrophoresis) and HPLC 
(high performance liquid chromatography) tech- 
niques (Fraga and Esteller 2002); here genomic 
DNA is digested into nucleotides, nucleosides, or 
bases. Through HPCE method, epigenetic varia- 
tion has been detected in some plant species. 
These methods can map genomic DNA methyla- 
tion and histone modifications at unmatched lev- 
els. The tools like chromatin immunoprecipitation 
(ChIP)-chip and ChIP-seq can be used to map 
DNA protein relations (Miguel and Marum 
2011). 

The developmental state of the explant is 
affecting the ability to regenerate and unable to 
regenerate shoots, roots, or embryos from 
somatic tissues. Needle maturation in Pinus 
radiata caused depletion in organogenic capabil- 
ity was due to DNA methylation and low acety- 
lated histone. Under in vitro conditions, DNA 
methylation, histone modification, and other 
regulatory proteins have important roles to play 
in plant development. Postculture behavior and 
genetic, epigenetic effects, and their related 
problems need to be studied. Phonotypical vari- 
ations like dwarfs, color changes or mosaic 
patterns (chlorosis), growth pattern changes with 
the inconsistent multiplication rate, change in 
leaf shape, growth vigor, and changes in produc- 
tivity or prolonged juvenility are observed in 
micropropagated plants. Large numbers of plants 
are produced in vitro; each year and phonotypi- 
cal variations like dwarfs, color changes or 
mosaic patterns (chlorosis), growth pattern 
changes with the inconsistent multiplication 
rate, change in leaf shape, growth vigor, and 
changes in productivity or prolonged juvenility 
are observed in micropropagated plants. Recent 
advances in molecular genetics, as well as more 
detailed physiological and cytological study of 
the effect of tissue culture on genotypic stability 
and plant performance, have allowed a clearer 
understanding of the observed variation. By 
understanding epigenetic regulation, it may be 
possible to avoid the negative consequences of 
variation as it’s the only way to avoid it and there 
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no single recipe to control it. However, epigene- 
tic changes are reversible and the plant regains 
its original makeup, but the duration and time is 
not fixed. In due course, it will be possible to 
deviate/change the regenerant outcomes by 
selecting conditions leading to varied epigenetic 
landscapes. This controlled change opens the 
way to epigenetic engineering in plants. 


16.3.4 Advantages of Somaclonal 
Variations 


e Useful for crop improvement by generating 
variability 

e¢ Creation of additional genetic variation like 
biotic stress resistance 

e Increased and improved production of second- 
ary metabolites 

e Suitable variants for breeding of tree species 
of monoculture 


16.4 Problems 
in Micropropagation 


16.4.1 Plant Status 


The use of mature tissue and older age explants 
for micropropagation is still a very difficult 
task. Generation of cultures using seedling is 
comparatively easy as compared to multiply elite 
germplasm growing in an open-field condition. 
In perennial plants, selection (required character, 
e.g., yield) is made after they gain maturity as 
fruiting starts after 3-5 years (optimum yield). 
Older age mother plants are difficult to multiply 
as aging causes poor response. Tree species/ 
perennial plants growing in an open-field condi- 
tion may be diseased (virus infection in latent 
form) and may cause poor or erratic culture 
behavior. Plants growing in high humid area are 
difficult to multiply under in vitro condition, as 
they are prone to different infections caused by 
bacteria or fungus. After a prolong number of 
culture passages, culture growth/multiplication 
rate declines. 
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16.4.2 Contamination 


Contamination 1s caused by poor inoculation tech- 
nique, unhygienic laboratory condition, or by 
faulty autoclaving. The use of jars, boxes, and test 
tubes with lids/caps is safer as compared to cotton 
plugs for long-term culture maintenance. The poor 
explant sterilization technique can cause culture 
contamination. Contamination after numerous 
passages may be due to hidden microbes in groves 
of explants. The use of antibiotics can inhibit the 
growth of microbes, and its use in the medium can 
save cultures. Contamination can damage cultures 
in large number and lead to poor production. 


16.5 Role of Government 
to Encourage Plant Tissue 
Culture Industry 


In the year 2002—2003, the consumption of TCP 
was 44 million with 41 % of banana share. 
Looking the growth rate, the demand for tissue 
culture plants is expected to increase at a high 
rate of 25-30 %. It may increase to approxi- 
mately 72 million for the year 2003-2004 and 
may reach 144 million in the next 5 years (survey 
report). To support the tissue culture industry, 
various central and state government departments 
have several schemes. Agriculture and Processed 
Food Products Export Development Authority 
(APEDA) under the Ministry of Commerce and 
Industry provides airfreight subsidy up to 25 % to 
tissue culture plants and 50 % subsidy for the 
development of infrastructure (Singh and Shetty 
2011). In addition, 50 % subsidy is given to the 
farmers for purchase of tissue culture fruit crops 
by the Andhra Pradesh State Agriculture 
Department under the Macro Management 
Scheme. The rate of subsidy for export to Europe 
other than CIS countries, North America, and Far 
East, at the rate of Rs. 25 per kg or 25 % of the 
airfreight rate approved by IATA or one third of 
the FOB value whichever is the least (survey 
report). The Government of India has set up a 
national facility for virus diagnosis and quality 
control of tissue culture plants at New Delhi. 
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Procedures and standard parameters by government of India for production of banana by tissue culture 


Superior certified virus free clones 
(DNA sample for reference) 


v 
Culture initiation and multiplication up to 8 passages 


v 
Rooting (in vitro) 


v 


Hardening inside greenhouse /shade 


y 


Nursery 


| 


Testing of plantlets for virus free certification for true to type certification 
(DNA sample) 


, 


Dispach labeled plant container with all the details of variety and institute 
name affixed on the container 


16.6 Conclusions and Future 
Prospective 


Plant tissue culture techniques have an immense 
potential to produce plants of superior quality, 
but this potential has to be fully exploited by 
developing cost-effective protocols for micro- 
propagation. In J. curcas compared to vegetative 
propagation through cuttings, high multiplication 
can be possible through micropropagation. It 
offers a much quicker way of propagation with a 
consistent supply of plantlets throughout the 
year. Limited number of shoot buds and slow 
growth of culture lead to high-cost propagation 
and can be one of the major hindrances to the 
direct use of micropropagation. There is a need to 
use low-cost options like bioreactors and liquid 
culture; they are meant for large-scale culture 
generation in liquid media. The advantages of 
bioreactors include high multiplication rate, 
reduction in medium cost and low power con- 
sumption, reduction in labor, and laboratory 
space. Due to the use of liquid medium in the bio- 
reactor, the medium cost reduces; the estimated 
cost of 1-1 solid MS medium is three times greater 
than for the liquid MS medium. There is the need 


to generate cultures using liquid medium as no 
attempt has been by anyone in J. curcas. There 
are, however, some disadvantages with the use of 
bioreactors. Enormous scope exists for genetic 
improvement of J. curcas through in vitro tech- 
niques, adopting biotechnological tools, includ- 
ing development of transgenic crops to synthesize 
high oil content in conventional oil seeds 
(Mazumdar et al. 2010). Somaclonal variation 
can be a useful tool to generate variability in the 
plant for hybridization. Micropropagated plants 
(are usually with fibrous root system) can be gen- 
erated with a taproot system using grafting. There 
is aneed to monitor micropropagated plant for its 
growth, survival, yield, and disease attach under 
field condition as very limited information is 
available. Long-term data will be useful for com- 
mercial growers/farmers and will define their 
future use in large-scale propagation/plantation 
generation. 
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Abstract 

Present-day cultivars of crop species have become susceptible to a range 
of biotic and abiotic stresses because of their narrow genetic base. The 
genes imparting resistance to these stresses are no more available within 
the cultivated species but are present in many wild and weedy relatives. To 
transfer desirable genes to the cultivars, the production of wide hybrids is 
an important pre-breeding requirement. One of the major limitations of 
wide hybridization is the presence of strong crossability barriers that oper- 
ate before as well as after fertilization. During the last several decades, 
many biotechnological methods have become available, and their integra- 
tion into the traditional methods of wide hybridization greatly increases 
the efficacy and reduces the time and efforts needed. Some of these include 
pollen storage, application of growth substances, stump pollination, pla- 
cental pollination, in vitro fertilization and embryo rescue in the form of 
ovary, ovule and embryo culture. This review highlights the efficacy of 
these techniques in realizing wide hybrids and emphasizes the use of com- 
bination of techniques to make these approaches more successful. The 
interest on production and use of sexual wide hybrids has greatly reduced 
after the development of the techniques of somatic hybridization and 
genetic transformation. Although they provide powerful technologies to 
achieve some specific objectives of the breeding program, they cannot 
replace sexual hybridization for gene transfer. It is important for the plant 
breeders to refine and exploit fully some of the underexploited techniques 
described in this review in raising wide sexual hybrids. 
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e Wide hybrids 


17.1. Introduction 

The transfer of desirable genes to crop species 

from other accessions or species through sexual 

hybridization has been one of the most effective 
crop improvement programs (Goodman et al. 

1987). Until recently, the major objectives of crop 

improvement programs have been on increasing 

the yield and improving the quality and genetic 
uniformity of the cultivars. Breeders have been 
able to achieve remarkable success on these lines 
in almost all crop species. Intensive breeding and 
selection for economic traits over the years have 
greatly reduced genetic variability in crop species. 

Because of their narrow genetic base, present-day 

cultivars have become susceptible to a range of 

biotic and abiotic stresses. Although most of the 
cultivars have the genetic potential for high yield 
under optimal agronomic conditions, — their 
increased susceptibility to the stresses brings down 
the yield substantially (Boyer 1982). Broadening 
the genetic base of cultivated species, therefore, 
has become an important pre-breeding require- 
ment to safeguard crop yield in the coming years. 

In recent years, the objectives of the breeding 
program, apart from improvement in yield and 
quality, have become more diverse. Some of 
these additional breeding objectives are: 

1. To breed varieties tolerant/resistant to biotic 
and abiotic stresses. 

2. To breed varieties that have lower require- 
ment for environmentally unfriendly chemi- 
cals particularly pesticides and herbicides. 
Excessive use of agrochemicals has been a 
major environmental problem of modern agri- 
culture. The trend is to grow crops ‘organi- 
cally’ without the use of environmentally 
unfriendly agrochemicals. 


3. To breed varieties that can be grown on mar- 
ginal lands. A large area of agricultural land is 
being degraded, particularly in developing 
countries, due to water logging, soil erosion, 
salinity, alkalinity and contamination with 
industrial chemicals. The recovery of such 
vast areas of land is not feasible because of the 
lack of suitable technologies and/or the high 
cost involved. Breeding crops that can be 
grown on marginal lands has become an alter- 
nate strategy. 

Because of the constant erosion of genetic 
diversity in our crop species, genes imparting the 
required traits are no longer available within the 
cultivated species. A large number of wild and 
weedy relatives of crop species, however, form a 
good repository of such desirable genes (Harlan 
1976). The plant breeders, therefore, often have 
to extend the breeding program across the species 
limits to tap genes of the wild and weedy species 
(wide hybridization). A good number of exam- 
ples of successful gene transfer through wide 
hybridization are already available (Hawkes 
1977; Hadley and Openshaw 1980; Stalker 1980; 
Goodman et al. 1987; Hermsen 1992; Kalloo 
1992; Chopra et al. 1996; Sareen et al. 1992; 
Wang et al. 2005; Prakash et al. 2011). The pri- 
mary requirement for transferring genes from 
wild species to the cultivars is the production of 
wide hybrids (hybrids from distantly related 
species). 

In addition to the transfer of desirable genes 
from wild species to the cultivars, wide hybrid- 
ization is also one of the standard approaches to 
develop new cytoplasmic male sterile lines 
(through cytoplasmic substitution), which are 
important for crop species (Labana et al. 1992; 
Shivanna and Sawhney 1997; Shivanna 2003; 
Prakash et al. 2011). Another application of wide 
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hybrids is to develop new alloploid crops such as 

triticale (Triticum x Secale) and Raphanobrassica 

(Raphanus x Brassica) (Sareen et al. 1992). Wide 

hybrids are also useful to study a range of prob- 

lems of traditional and molecular cytogenetics, 
particularly for elucidating homology (Prakash 

et al. 2011). 

The presence of strong crossability barriers 
between the species is the major constraint in any 
wide hybridization program. Early plant breeders 
used to resort to mass pollination, generally in 
thousands, to realize a few wide hybrids. Now, a 
number of biotechnological approaches have 
become available to overcome crossability barri- 
ers. Integration of these techniques with conven- 
tional breeding program greatly improves the 
efficiency of wide hybridization and reduces 
markedly the time and cost of breeding. This 
review briefly discusses some of the important 
biotechnological approaches that have been used 
to produce wide hybrids through sexual pathway 
and highlights their importance in the coming 
decades. 

The following are a few general guidelines to 
be followed to achieve success in any wide 
hybridization program: 

1. It is desirable to try different accessions of 
both the parents in crossing experiments since 
genetic variability within the species may 
affect the intensity of the barriers. Some of the 
accessions/genotypes may turn out to be more 
compatible when compared to others. 

2. It is also necessary to check the viability of 
the pollen sample used for crosses and the 
receptivity of the stigma at the time of pollina- 
tion (Shivanna and Rangaswamy 1992). If 
possible, it is better to use pollen grains col- 
lected from freshly dehisced anthers. 

3. Many of these crosses show unilateral incom- 
patibility (de Nettancourt 2001); it is, there- 
fore, better to make crosses in both the 
directions. 

4. Recent studies have shown that knowledge 
on the phylogenetic relationships of the culti- 
var with its wild relatives would help in 
selecting the wild parent which is likely to be 
more successful (Kubota et al. 2011). For 
example, in Hydrangea, interspecific crosses 
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were found to be successful up to an average 
genetic distance of 0.01065. The crosses are 
likely to fail when the average genetic dis- 
tance is 0.01385 and higher (Mendoza et al. 
2013a, b). Similarly, in wheat-rye hybrids, 
significant differences in the frequency of 
normal embryo development were found 
when different lines of rye were used in rye x 
wheat crosses (Taira et al. 2011). It would 
therefore be helpful to check the available 
information on phylogenetic relationship of 
the selected species and use closely related 
parents for making crosses. 


17.2  Crossability Barriers 


Crossability barriers operate at different levels 
before and after fertilization (Sastri 1985; 
Shivanna 1997; de Nettancourt 2001). One of the 
common barriers particularly in distantly related 
species is the physical barrier imposed by geo- 
graphical and/or temporal isolation of the parent 
species. The male and female parents do not 
grow at the same place and/or their flowering 
may not be synchronous, thus preventing effec- 
tive pollination. Pre-fertilization barriers inhibit 
pollen germination or pollen tube entry into the 
stigma or subsequent growth of the pollen tubes 
before reaching the ovule. More often these bar- 
riers operate in a cumulative way; the proportion 
of pollen grains that complete germination or 
post-germination growth of pollen tubes at differ- 
ent levels may get reduced at each step, resulting 
in the failure of fertilization. In multiovulate sys- 
tems, fertilization may occur in a few ovules 
which may not be sufficient to activate the devel- 
opment of the fruit and thus no seeds are 
realized. 

Post-fertilization barriers operate at various 
levels: abortion of hybrid embryos at preglobu- 
lar/globular/later stages, failure of hybrid seeds 
to germinate or the seedling to grow up to the 
flowering, hybrid sterility and lack of recombina- 
tion. Post-fertilization barriers also operate in a 
cumulative way with the result that no usable 
hybrid is realized. A number of biotechnological 
approaches are now available to overcome such 
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barriers. Identification of the barrier(s) would 
help in the selection of effective method(s) to 
circumvent the barrier(s). 


17.3 Methods to Overcome 
Physical Barriers 


17.3.1 Pollen Storage 


Pollen storage is one of the simple and effective 
methods to overcome physical barriers imposed 
by temporal and spatial isolation of the parent 
species. A number of techniques are now avail- 
able to store pollen grains for extended periods in 
viable condition. When once a suitable technique 
is standardized for a given species, pollen grains 
of the male parent can be stored and used rou- 
tinely for pollination when the female parent 
flowers. Apart from its application to overcome 
physical barriers, successful pollen storage elimi- 
nates the need to grow pollen parent continuously 
in the breeding program. Another important 
application of pollen storage relevant to the 
breeding program is that it provides a convenient 
and simple means of pollen exchange amongst 
breeders within and between countries. Many of 
the horticultural societies such as the American 
Rhododendron Society maintain their own pollen 
banks which are accessible to their members 
(Mayer 1983). Pollen grains are generally free 
from pathogens even when the parent plant is 
infected. Except for some viral diseases, there are 
no authentic reports of systemic transmission of 
fungal and bacterial diseases through pollen 
(Mink 1993). Therefore, the quarantine restric- 
tions for the exchange of germplasm through pol- 
len are much less when compared to seeds and 
vegetative parts of the plant. The utility of ‘pollen 
banks’, which would ensure the availability of 
pollen of the desired species/variety at any time 
of the year and at any place, has been emphasized 
since long. Pollen banks would greatly facilitate 
the breeding program, particularly of tree species 
which have to complete their juvenile phase, 
often lasting for several years, before flowering. 
Extensive literature, available on pollen storage, 
has been reviewed regularly (Johri and Vasil 
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1961; King 1965; Stanley and Linskens 1974; 
Shivanna and Johri 1985; Towill 1991; Hanna 
and Towill 1995; Barnabas and Kovacs 1997; 
Ganeshan and Rajasekaran 2000; Shivanna 
2003). Many of these reviews list the pollen spe- 
cies stored for various periods under different 
storage conditions. The following is a brief 
description of the important methods used to 
store pollen grains of various species: 

Storage of pollen under low temperature (+4 to 
—20 °C) and humidity (<10 % RH) conditions 
is one of the simple and commonly used meth- 
ods particularly for horticultural species. 
Pollen grains are kept in small unsealed vials 
and stored in a desiccator or a suitable airtight 
vial containing an appropriate dehydrating 
agent such as dry silica to maintain low RH 
(Shivanna and Johri 1985; Shivanna and 
Rangaswamy 1992). The sealed desiccators 
are then kept in a refrigerator or a deep freeze. 
This method is very convenient and effective 
for short-term storage (for a few weeks/ 
months) and has been used extensively by 
amateur horticulturists. 

Storage under subfreezing temperatures (ca 
—20 °C) is effective for storing pollen grains 
of several species for more than a year. Frozen 
pollen of Rhododendron in home freezer has 
been reported to be viable for 3 years (Mayer 
1983). Pollen grains of cereals in general can- 
not withstand desiccation and need to be 
stored under high RH in the refrigerator. Even 
under these conditions, viability of cereal pol- 
len lasts only for a few days (Shivanna and 
Heslop-Harrison 1981). 

Storage of freeze-dried/vacuum-dried pollen is 
an effective method for long-term storage. 
Freeze-drying involves rapid freezing of pol- 
len (—60 to —80 °C) and gradual removal of 
water under sublimation. In vacuum-drying, 
the pollen is subjected to simultaneous cool- 
ing and vacuum-drying. Freeze-dried and 
vacuum-dried pollen grains generally do not 
show any differences in their responses to 
storage. The freeze-dried pollen is usually 
stored at sub-zero temperatures. For effective 
use of this method, optimum pollen water 
content, duration of drying and subsequent 
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rehydration have to be optimized (Barnabas 
and Kovacs 1997). Freeze-drying method has 
been effective for long-term storage of pollen 
grains of a number of species (King 1965; 
Towill 1991). However, in recent years, this 
method has not been very popular probably 
because of the success achieved by using a 
simpler technique of cryopreservation. 
Cryopreservation of pollen is another effective 
method for long-term pollen storage and has 
become more popular than freeze-drying 
method in recent years. In this method, pollen 
grains are dried to bring their water content 
below a threshold level and stored in liquid 
nitrogen (Towill 1991; Barnabas and Kovacs 
1997). Initial attempts to cryopreserve pollen 
grains of cereals were not successful largely 
because of their susceptibility to desiccation 
(which is critical for cryopreservation). Unlike 
the earlier methods used for drying pollen 
grains over a desiccant, Barnabas and her 
associates used a ‘pollen drier’ in which the air 
of 20 °C and 20-40 % humidity is blown 
through pollen, facilitating a rapid but gentle 
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and uniform drying (Barnabas and Kovacs 
1997). Through this drying method, it was 
possible to successfully cryopreserve the 
pollen grains of many cereals for several 
years. 


17.4 Methods to Overcome 
Pre-fertilization Barriers 


A number of techniques have been developed over 
the years to overcome pre-fertilization barriers that 
operate after pollination but before fertilization. 
Some of these methods along with a few examples 
are presented in Table 17.1. Additional examples 
are found in Van Tuyl and De Jeu (2003) and 
Shivanna (2003). According to Bates and Deyoe 
(1973), the inhibition reaction of pollen in the pis- 
til is analogous to immunochemical reaction found 
in animals. On the basis of this hypothesis, the 
flowers were treated with some immunosuppres- 
sors such as E-amino caproic acid, salicylic acid or 
acriflavin (using similar methods used for treat- 
ment with growth substances), and a few wide 


Table 17.1 Details of in situ pollination methods used to produce hybrids with some examples 


Effective methods 
Application of growth substances 


Growth substances (auxins, cytokinins and gibberlins) 
applied around the pedicel of the flower or to the 
wound caused by removing one of the sepals/petals, at 
the time of pollination or soon after pollination. This is 
one of the oldest methods tried and has been successful 
in several crosses. Growth substances delay floral 
abscission and facilitate slow-growing pollen tubes to 
effect fertilization 


Mentor pollen 
Pollinated with a mixture of mentor pollen (compatible 


Some examples 


Interspecific crosses of Vigna (Chen et al. 1978), 
Agropyrum (Alonso and Kimber 1980), Triticum 
(Mujeeb-Kazi 1981), Arachis (Sastri 1985; Sastri and 
Moss 1982; Sastri et al. 1983), Corchorus (Park and 
Walton 1990) and Hordeum (Subramanyam 1999) 


Very effective in several crosses of Populus (Knox et al. 


pollen made ineffective to effect fertilization, generally 
prepared by exposing pollen to high doses of 
irradiation) and viable pollen of the male parent 


Stump pollination 


The stigma and a part or the whole of the style are 
excised, and pollen grains are deposited at the cut end 
of the style or the tip of the ovary. This reduces the 
distance through which pollen tubes have to grow to 
reach the ovules. Application of suitable medium on 
the stump before pollination may facilitate pollen 
germination 


1972; Stettler 1968; see Villar and Gaget-Faurobert 1997). 
In wide crosses of Sesamum, it was effective in 
overcoming the inhibition of pollen germination but not of 
pollen tube growth in the style (Sastri and Shivanna 1976). 
Not effective for Lilium (Van Tuyl and de Jeu 2003) 


Interspecific crosses of corn (Heslop-Harrison et al. 1985), 
Lilium (van Tuy] et al. 1991; Janson et al. 1993), Lathyrus 
(Herrick et al. 1993), Fritillaria (Wietsma et al. 1994) 


(continued) 
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Table 17.1 (continued) 


Effective methods 

Style grafts 

Pollination is carried out on compatible stigma, and 
after pollen grains have germinated and pollen tubes 
have grown through a part of the style, the style is 
excised in front of the growing pollen tubes and grafted 
onto incompatible pistil. This is a delicate operation 
and can be done only in species with robust pistils. In 
Lilium, grafting was carried out using a straw filled 
with the exudates of compatible stigma 


Bud pollination 


Incompatible pollination carried out in the bud stage 
when the factors that induce crossability barriers are 
not likely to be fully developed. This approach has not 
been used frequently; as it is one of the simplest 
techniques, it may be worthwhile trying 


Intraovarian pollination 


Injection of pollen grains suspended in a suitable 
medium directly into the ovary. This is another simple 
technique which has not been tested with additional 
systems. Lack of ovarian cavity in many species is a 
limitation 


hybrids were realized in some cereals and legumes 
(Van Tuyl and De Jeu 2003; Shivanna 2003). 
However, so far, there are no evidences to indicate 
the involvement of immunochemical reactions in 
crossability barriers, and there have been no addi- 
tional reports on the success of immunosuppres- 
sors in realizing wide hybrids. 


17.4.1 In Vitro Pollination of Cultured 
Ovules 


In this method, instead of carrying out pollination 
on the stigma, pollen grains are deposited directly 
on the ovules. In one of these methods, a group of 
ovules are excised from the ovary and cultured on 
a nutrient medium (Kanta et al. 1962; Kameya 
and Hinata 1970); the ovule mass is dusted with 
pollen grains. In another method termed placen- 
tal pollination, the entire mass of ovules intact on 
the placenta(e) together with a short length of 
pedicel is excised and cultured by inserting only 
the pedicel in the medium (Rangaswamy and 
Shivanna 1967). Pollen grains are dusted on cul- 
tured ovule mass (Shivanna and Rangaswamy 
1992). As this technique eliminates pollen-pistil 
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Some examples 


Interspecific crosses of Lilium (Van Tuy] et al. 1991; Van 
Tuyl and De Jeu 2003) 


Interspecific crosses of Nicotiana (Kuboyama et al. 1994) 


Interspecific crosses of Argemone (Maheshwari and Kanta 
1961) 


interaction altogether and brings pollen grains in 
direct contact with the ovules, it is likely to be 
more effective than other methods to overcome 
pre-fertilization barriers. In successful pollina- 
tions, pollen grains germinated on the ovule 
mass; pollen tubes entered the ovules and effected 
fertilization. The fertilized ovules developed into 
seeds. Ovule pollination has been used success- 
fully to produce interspecific as well as interge- 
neric hybrids particularly by Zenkteler and his 
associates (Zenkteler 1980, 1990; Zenkteler and 
Bagniewska-Zadworna 2001). Some of the suc- 
cessful crosses include Melandrium album x M. 
rubrum, M. album x Silene schafta (Zenkteler 
1980, 1990) and several interspecific crosses of 
Nicotiana (Reed and Collins 1978; DeVerna 
et al. 1987; Zenkteler 1990). Attempts were also 
made to obtain interspecific/intergeneric hybrids 
in Brassicaceae (Kameya and Hinata 1970; 
Zenkteler 1990; Zenkteler et al. 1987). Hybrids 
were realized in a few combinations, but in some 
others, although there was normal fertilization, 
hybrid embryos degenerated. Isolation and cul- 
ture of embryos on a suitable nutrient medium 
would probably enable the production of hybrids 
in such crosses also. In an interesting treatment, 
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placental cultures of some flowering plants were 
pollinated with pollen grains of a few gymno- 
sperms; pollen grains of Pinus and Ephedra 
could germinate on placental cultures of several 
flowering plants (Zenkteler and Bagniewska- 
Zadworna 2001). Although some pollen tubes 
were occasionally seen entering the ovules, no 
hybrid embryos were realized. 


17.4.2 In Vitro Fertilization 


Fertilization in flowering plants takes place deep 
inside the ovule, and this is one of the limitations 
to conduct experimental studies on fertilization. 
Following the dramatic progress of protoplast 
technology of somatic cells, a number of investi- 
gators started work on isolation of male and 
female gametes of flowering plants in the 1980s. 
Success was soon achieved in isolation of proto- 
plasts of embryo sacs and sperm cells of a number 
of species and to keep them in viable condition for 
considerable length of time (Cass 1997; Mathys- 
Rochon et al. 1997). In three-celled pollen, sperm 
cells were isolated by incubating pollen grains in 
hypotonic medium that results in bursting of pol- 
len grains or mechanically rupturing the pollen 
grains by gentle grinding in an isolation medium. 
The released sperm cells were purified and 
washed suitably. In two-celled pollen, a semi- 
vitro method (Shivanna and Rangaswamy 1992) 
was used to allow pollen tubes to grow partly 
inside the style and then emerge into a nutrient 
medium. This was followed by the treatment of 
pollen tubes with cell wall-degrading enzymes or 
subjecting pollen tubes to osmotic shock to the 
release of sperm cells from the tip of pollen tubes 
(Shivanna et al. 1987). Subsequently it was pos- 
sible to isolate the components of the embryo sac 
(egg, synergids, and central cell). By 1990s, all 
the basic requirements needed to try in vitro fertil- 
ization in flowering plants were available. 

Kranz and his associates were the first to 
achieve in vitro fertilization in maize by bringing 
isolated egg and sperm cells together under the 
microscope in microdroplets of the fusion 
medium (Kranz and Lorz 1993; Kranz 1997; 
Okamoto and Kranz 2005). In vitro formed 
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zygotes were grown successfully by using nurse 
culture method (by culturing zygotes on a semi- 
permeable membrane placed on fast-growing 
non-morphogenetic cell suspension cultures 
derived from maize embryos or microspores) into 
plantlets and eventually into adult fertile plants. 
Subsequently the sperm cell was also fused with 
the central cell; the fusion product did not give 
rise to the embryo but gave rise to an unorganized 
tissue comparable to the endosperm in vivo. 
Subsequently several modifications in the proto- 
cols to improve the efficacy of fusion and embryo 
development have been reported (Kranz 1997; 
Wang et al. 2006; Kranz et al. 2008). 

Apart from its application in tackling a range 
of problems fundamental to fertilization (see 
Okamoto and Kranz 2005; Wang et al. 2006), one 
of the most obvious practical applications of 
in vitro fertilization is in realizing wide hybrids. 
Although the success has so far been confined in 
achieving fertilization and embryo development 
between gametes of compatible species, studies 
of Kranz and his associates (Kranz 1997, 2008; 
Scholten and Kranz 2001) have shown that there 
is no technical difficulty in achieving in vitro fer- 
tilization between isolated egg and sperm cells in 
interspecific and intergeneric combinations. 
However, the development of zygotes is restricted 
to crosses between closely related species (Kranz 
and Dresselhaus 1996). Hybrid zygotes between 
maize (egg donor) and sperm cell of many other 
members of Poaceae gave rise to multicellular 
structures, while those resulting from the egg of 
maize and sperm of a distant species, Brassica, 
failed to divide. Further studies are needed to 
extend this technology to other species and to 
realize useful interspecific and intergeneric 
hybrids from in vitro fused zygotes. 


17.5 Methods to Overcome 
Post-fertilization Barriers 


Post-fertilization barriers operate after fertilization. 
Depending on the extent of reproductive isolation 
between the parent species, the embryo abortion 
may initiate at a very early stage of development or 
after the growth of embryo to different stages. 
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17.5.1 Embryo Rescue 


Embryo rescue has become the most effective 
and routinely used technique to overcome 
post-fertilization barriers. When the abortion 
occurs at a very early stage, it is difficult to 
excise the embryo and also to culture it suc- 
cessfully as its nutrient requirements are more 
complex and precise. Embryo excision is a 
problem even at later stages when the develop- 
ing seeds are very small. In such instances, it 1s 
more convenient to culture the whole ovule or 
even the ovary. Ovule and ovary culture facili- 
tates embryo growth in situ without exposure 
to in vitro disturbances. In crosses where the 
embryo aborts at later stages, embryo culture 1s 
the most ideal. There are a large number of 
successful hybrids realized through embryo 
rescue (see also Chap. 18 of this volume). For 
details of the technique and comprehensive 
examples, the reader may refer to Maheshwari 
and Rangaswamy (1965), Raghavan (1977, 
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1986, 1999), Rangan (1982), Williams et al. 
(1987), Sharma et al. (1996) and Van Tuyl and 
De Jeu (2003). A modified technique of embryo 
rescue termed ‘sequential culture’ has been 
reported to be more effective than either ovary 
or ovule culture alone in realizing many wide 
hybrids (Nanda Kumar et al. 1988; Agnihotri 
1993; Shivanna 2000). In sequential culture, 
ovaries are cultured 4—8 days after pollination; 
cultured ovaries are taken out 7—10 days after 
culture, dissected under aseptic conditions, and 
enlarged ovules (young seeds) are then re- 
cultured on a fresh medium. In successful 
crosses, cultured ovules grow further and ger- 
minate in vitro. In some crosses, it was neces- 
sary to dissect the embryo from cultured ovules 
and re-culture the embryo. Some examples of 
wide hybrids realized through ovary, ovule, 
embryo and sequential cultures are presented 
in Table 17.2. In a few crosses of legumes, 
hybrid embryos have been transplanted into the 
developing endosperm of compatible seeds 


Table 17.2 Some wide crosses produced through embryo rescue 


Effective methods 
Ovary culture 


Pollinated ovaries are excised before the onset of 
embryo abortion and cultured on a suitable nutrient 
medium 


Some examples 


Phaseolus (Sabja et al. 1990), Lilium (Van Tuy] et al. 1991), 
Eruca X Brassica hybrids (Agnihotri et al. 1990a), Brassica 
(Gundimeda et al. 1992), Brassica napus x Raphanobrassica 


(Agnihotri et al. 1990b) 


Ovary slice culture 


Pollinated ovary, a few days after pollination, is 
transversely cut into slices and the slices along with 
attached ovules are cultured 

Ovule culture 

Ovules are excised from pollinated ovaries before 
the initiation of embryo degeneration and cultured. 
In some crosses, cultured ovules are re-cultured on 
a different medium 


So far confined to Lilium (Kanoh et al. 1988; Van Tuy] et al. 
1991; Janson et al. 1993; Obata et al. 2000) 


Alstroemeria (Bridges et al. 1989; De Jeu et al. 1992), 
Cyclamen (Ishizaka and Uematsu 1992), Nicotiana (Reed and 
Collins 1978; Iwai et al. 1986), Lilium (Ikeda et al. 2003), 
Gentiana (Morgan 2004), Hylocereus (Cisneros and Tel-Zur 


2010), Asteraceae species (Wang et al. 2014) 


Embryo culture 


Embryo is dissected from fertilized ovules and 
cultured 


Allium (Nomura and Oosawa 1990), Lilium (Van Tuyl et al. 
1991), Solanum (Singsit and Hannaeman 1991), Gossypium 


(Mehetre and Aher 2004), Hylocereus (Sage et al. 2010) 


Sequential culture 


Culture of pollinated ovary, followed by culture of 
ovules from cultured ovaries and finally culture of 
embryos from cultured ovules. The time gap between 
each varies. In some species, culture of ovary and 
then ovule was enough to recover hybrids, while in 
some other species, culture of ovary, ovule and then 
embryo was successful in recovering hybrids 


Brassica (Nanda Kumar et al. 1988; Agnihotri et al. 1990a; 
Gundimeda et al. 1992; Nanda Kumar and Shivanna 1993; 
Vyas et al. 1995), Medicago (McCoy and Echt 1993), Lilium 
(Van Tuyl and De Jeu 2003) 
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(after removing compatible embryo), and such 
transplanted embryo-endosperm complex has 
been successfully cultured (Williams and 
Latour 1980). 


17.5.2 Wide Hybrids in Brassica 


Brassica is an important oilseed crop in several 
countries. All the three major cultivated species, 
B. rapa, B. juncea and B. napus, are susceptible 
to a number of diseases, pests and abiotic stresses 
and result in low productivity. The genes impart- 
ing resistance to these stresses are not available in 
the cultivated species, but many of their wild rel- 
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atives possess such genes (see Prakash et al. 
2011). A large number of wide hybrids have been 
produced in crop brassicas with many of their 
wild relatives through embryo rescue, especially 
from the University of Delhi in collaboration 
with the Indian Agricultural Research Institute, 
New Delhi (Shivanna 1995, 2003; Prakash et al. 
2011). The details of the crossability barriers 
between all the cultivated species and 12—20 wild 
relatives have been documented by Singh et al. 
(2007). Of the 100 cross combinations tested, 73 
crosses showed pre-fertilization barriers and 27 
crosses showed post-fertilization barriers. All the 
wide hybrids produced by this group through 
embryo rescue are listed in Table 17.3. 


Table 17.3 Wide (interspecific and intergeneric) hybrids of Brassica realized through embryo rescue 


Sr no. Hybrids 

1 Brassica rapa X Erucastrum gallicum 
wi B. rapa X E. abyssinicum 

3 B. juncea x Sinapis pubescens 
4 B. juncea x B. gravinae 

5 B. juncea x Enarthrocarpus lyratus 
6 B. napus x B. gravinae 

i Eruca sativa Xx B. rapa 

8 Brassica napus x Raphanobrassica 
9 B. cossoneana xX B. rapa 

10 B. cossoneana xX B. carinata 
1] B. cossoneana X B. juncea 

12 B. cossoneana x B. napus 

13 B. cossoneana xX B. oleracea 
14 B. fruticulosa x B. rapa 

15 B. maurorum xX B. rapa 

16 B. maurorum x B. nigra 

17 B. maurorum x B. juncea 

18 B. maurorum x B. oleracea 

19 B. maurorum x B. carinata 

20 Diplotaxis assurgens x B. rapa 
21 D. berthautii x B. rapa 

22 D. catholica x B. rapa 

23 D. catholica x B. juncea 

24 D. erucoides x B. rapa 

25 D. erucoides x B. juncea 

26 D. erucoides x B. napus 

2). D. erucoides x B. oleracea 

28 D. siifolia x B. juncea 

29 D. siettiana x B. rapa 

30 D. siifolia x B. napus 


Reference 

Batra (1991) 

Rao (1995) 

Gundimeda et al. (1992) 

Nanda Kumar et al. (1989) 

Gundimeda et al. (1992) 

Nanda Kumar et al. (1989) 

Agnihotri et al. (1990a) 

Agnihotri et al. (1990b) 

Verma (1993) 

Verma (1993) 

Verma (1993) 

Verma (1993) 

Verma (1993) 

Nanda Kumar et al. (1988) 

Chrungu et al. (1999) 

Chrungu et al. (1999) 

Chrungu et al. (1999) 

Chrungu et al. (1999) 

Chrungu et al. (1999) 

Vyas (1993) 

Vyas (1993) 

Mohanty (1996) 

Mohanty (1996) 

Vyas et al. (1995) 

Vyas et al. (1995) 

Vyas et al. (1995) 

Vyas et al. (1995) 

Batra et al. (1990) 

Nanda Kumar and Shivanna (1993) 

Batra et al. (1990) 
(continued) 
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Table 17.3. (continued) 


Sr no. Hybrids 

31 D. tenuisiliqua x B. rapa 

32 D. tenuisiliqua x B. juncea 

33 D. tenuisiliqua x B. napus 

34 D. tenuisiliqua x B. oleracea 
35 D. viminea x B. carinata 

36 D. viminea x B. napus 

37 Enarthrocarpus lyratus x B. campestris 
38 E. lyratus x B. carinata 

39 E. lyratus x B. napus 

AO E. lyratus x B. oleracea 

4] Erucastrum abyssinicum xX B. rapa 
42 E. abyssinicum x B. carinata 
43 E. abyssinicum x B. juncea 

44 E. abyssinicum x B. oleracea 
45 E. cardaminoides x B. oleracea 
46 E. gallicum x B. juncea 

47 E. gallicum x B. napus 

48 E. varium x B. rapa 

49 E. varium x B. juncea 

50 E. varium X B. oleracea 


17.6 Bridge Cross Hybrids 


As pointed out earlier, the strength of the cross- 
ability barriers of the cultivar may vary with dif- 
ferent wild relatives. The barriers may not be so 
strong with some of the wild relatives, but such 
species may not have desirable traits; those that 
have desirable traits may show strong barriers 
with the cultivar. Another simple method, referred 
to as bridge cross method, has been used by the 
breeders to circumvent such barriers. In this 
method, the hybrid is produced between the cul- 
tivar and one of the wild species (which does not 
show strong barriers with the cultivar), and this 
hybrid or its amphidiploid is used as the bridge 
species to raise hybrids with another wild species 
(which shows strong barriers with the cultivar but 
has desirable trait). This approach has been very 
effective in a number of crop plants — wheat, 
tobacco, potato, lettuce, sugarcane and Brassica 
(see Shivanna 2003). In some combinations, it 
was possible to realize bridge cross hybrids 
through field pollinations without resorting to 
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Reference 

Vyas (1993) 

Vyas (1993) 

Vyas (1993) 

Vyas (1993) 

Mohanty (1996) 
Mohanty (1996) 
Gundimeda et al. (1992) 
Gundimeda et al. (1992) 
Gundimeda et al. (1992) 
Gundimeda et al. (1992) 
Rao et al. (1996) 

Rao et al. (1996) 

Rao et al. (1996) 

Rao et al. (1996) 
Mohanty et al. (2009) 
Batra et al. (1989) 

Batra et al. (1989) 

Das (1993) 

Das (1993) 

Das (1993) 


application of any special technique. Bridge 
cross technique is simple and very useful particu- 
larly to transfer the cytoplasm from one species 
to the other (Shivanna 1995, 2000), although the 
transfer of nuclear genes through this method 
involves more elaborate breeding program 
(Hadley and Openshaw 1980). The bridge cross 
hybrids raised in Brassica by Delhi group are 
listed in Table 17.4. 


17.7. The Development of New 
Cytoplasmic Male Sterile 
(CMS) Lines in Brassica 


Male sterile lines are very important to exploit 
hybrid vigor in crop plants. By using male sterile 
line as the female parent, intra-line pollinations 
are prevented completely in hybrid seed produc- 
tion plots. Male sterile lines developed through 
different approaches (genic male sterile lines, 
cytoplasmic male sterile lines, male sterility 
induced through application of chemicals and 
also through recombinant DNA technology) are 
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Table 17.4 Wide hybrids in Brassica realized through 
bridge cross method 


Sr. 
no. Hybrid Reference 
1 (Diplotaxis siettiana x Nanda Kumar and 
Brassica rapa) Xx B. juncea Shivanna (1993) 
ys (D. siettiana x B. rapa) x B. Nanda Kumar and 
napus Shivanna (1993) 
3 (D. berthautii x B. rapa) x Vyas (1993) 
B. juncea 
4 (D. tenuisiliqua x B. rapa) x Vyas (1993) 
B. napus 
5 (D. erucoides x B. rapa) x B. Vyas et al. (1995) 
juncea 
6 (Erucastrum abyssinicum x —G.U. Rao 
B. juncea) x B. napus (unpublished) 
7 (E. abyssinicum x B. juncea) G.U. Rao 
x B. nigra (unpublished) 
8 (E. abyssinicum x B. juncea) G.U. Rao 
x B. rapa (unpublished) 
(E. cardaminoides x B. Mohanty et al. 
oleracea) x B. napus (2009) 
(E. cardaminoides x B. Mohanty et al. 
oleracea) x B. carinata (2009) 


available. Of these, cytoplasmic male sterile 
(CMS) lines have several advantages over others 
in producing hybrid seeds (see Shivanna and 
Sawhney 1997; Shivanna 2003). One of the stan- 
dard approaches to produce CMS lines is through 
the development of alloplasmic lines (containing 
the cytoplasm of one species and the nuclear 
genes of another species) as many of them result 
in cytoplasmic male sterility. This involves pro- 
duction of wide hybrids (wild species as the 
female and cultivar as the male parent). As the 
cytoplasm is inherited only through the egg in a 
majority of species, the hybrids possess the cyto- 
plasm of the wild species and the nuclear genome 
of both the parents. The hybrid or its amphiploid 
is repeatedly backcrossed with the cultivar (as 
male parent) to eliminate the nuclear genome of 
the wild species. Although many CMS lines were 
available in crop Brassica (Banga 1992; Shivanna 
2000; Prakash et al. 2011), they show many limi- 
tations such as chlorophyll deficiency, thermo- 
sensitivity of male sterility and presence of 
female sterility. Also, many of them do not have 
suitable restorers needed for hybrid seed produc- 
tion. There has been a need to produce more 
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Table 17.5 Cytoplasmic male sterile (CMS) lines of 
Brassica produced through sexual alloplasmics 


Cytoplasmic Recipient 

donor species Reference 

Diplotaxis Brassica Rao et al. (1994) 

slifolia juncea 

D. siifolia B. napus Rao and Shivanna 
(1996) 

D. erucoides B. juncea Malik et al. (1999) 

D. berthautii B. juncea Malik et al. (1999) 

D. catholica B. juncea Mohanty (1996) 

Erucastrum B. rapa Gundimeda et al. 

lyratus (1992), Deol et al. 


(2003) 


CMS and restorer lines. The Delhi University 
group has been able to produce several new CMS 
lines in the background of the cytoplasm of dif- 
ferent wild species. These CMS lines are listed in 
Table 17.5. 

Restorer lines have been developed subse- 
quently for some of these alloplasmic lines: B. 
juncea carrying Diplotaxis catholica cytoplasm 
(Pathania et al. 2003); B. rapa (Deol et al. 2003), 
B. juncea (Banga et al. 2003) and B. napus 
(Janeja et al. 2003) carrying Enarthrocarpus 
lyratus cytoplasm; and B. juncea carrying D. 
erucoides cytoplasm (Bhat et al. 2005). 


17.8 Multiplication of Hybrids 


The number of hybrids realized in wide crosses 
even after using biotechnological methods is 
rather limited. A large number of hybrids are 
needed for morphological and cytological stud- 
ies, induction of amphiploidy to restore fertility 
and to raise backcross progeny. Hybrids can eas- 
ily be multiplied through the use of in vitro cul- 
ture technique (Nanda Kumar and Shivanna 
1991; Agnihotri et al. 1990a, b; Shivanna 2000). 
Culture of shoot tips or single node segments on 
a medium containing one of the cytokinins is 
effective in inducing multiple shoots. The shoots 
are isolated and cultured on an auxin-containing 
medium to induce rooting. The plantlets are then 
hardened and transferred to the soil. Hybrids can 
also be multiplied through the induction of callus 
from hybrid embryos or hypocotyl segments and 
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subsequent regeneration of plantlets through 
shoot and root regeneration pathway or somatic 
embryogenesis pathway (Nanda Kumar and 
Shivanna 1991; Agnihotri et al. 1990a, b). 


17.9 Concluding Remarks 


The production of wide hybrids is an important 
pre-breeding step in transferring genes across 
species limits. Early breeders, in spite of low rate 
of success, were able to realize a number of wide 
hybrids in many crop species and to transfer 
important traits from the wild species to the culti- 
vars. Integration of some of the tools of biotech- 
nology to the breeding program has made the 
production of wide hybrids more effective, and a 
large number of wide hybrids have been pro- 
duced through this approach. Pollen storage is 
probably the simplest and effective technique to 
overcome nonsynchronous flowering and geo- 
graphical isolation of the parent species. Although 
several pollen banks of many horticultural spe- 
cies have come up to supply pollen for the breed- 
ing programs, pollen banks for crop species are 
yet to be established as a routine facility at the 
international level. Establishment of effective 
pollen banks, similar to seed banks, at global 
level would greatly facilitate the breeding pro- 
grams of crop species. 

There is a scope for further refinement of 
many of the available techniques to overcome 
pre-fertilization barriers. For example, by carry- 
ing out stump pollination or stylar grafting on 
cultured pistils (instead of flowers retained on 
plants), the operations can be performed and con- 
trolled more efficiently. Cultured pistils would 
provide a more convenient system to conduct 
additional treatments such as irradiation or high- 
temperature treatment to the pistil, which have 
not yet been tried seriously to raise wide hybrids. 
Cultured pistils are also ideal to perform intra- 
stylar pollinations (in hollow-styled systems) and 
mixed/mentor pollinations. These treatments can 
be combined with manipulation of the medium 
and other conditions to improve the growth of 
cultured pistils. Also, the full potentials of other 
in vitro techniques particularly in vitro pollina- 


K.R. Shivanna and B. Bahadur 


tion of ovules and in vitro fertilization are yet to 
be fully exploited. 

More importantly, a combination of tech- 
niques integrating the methods used to overcome 
pre- and post-fertilization barriers is likely to 
give better results. In the absence of such integra- 
tion, even when a particular method is effective 
in circumventing pre-fertilization barriers, it may 
not yield hybrids because of the operation of 
post-fertilization barriers. For example, in crosses 
between B. rapa and B. juncea, showing pre- 
fertilization barriers, bud pollination and stump 
pollination increased the frequency of ovules that 
were fertilized. However, hybrids could be recov- 
ered only when bud pollination and stump polli- 
nation were combined with ovule culture (Bhat 
and Sarla 2004). Combination of placental polli- 
nation with embryo rescue would be particularly 
effective in providing suitable conditions for 
hybrid embryos to continue their growth. Thus, 
plant breeders in the coming decades should try 
combination of techniques rather than a single 
technique to raise wide hybrids. 

Although a large number of wide hybrids have 
been realized in a number of crop species, intro- 
gression of desirable traits from wild species to 
the cultivars is limited due to lack of interge- 
nomic chromosome homoeology. This is one of 
the major problems to achieve alien gene transfer 
even after getting wide hybrids. Interspecific 
hybridization and polyploidization in ornamental 
plants are more simple and straightforward, as it 
aims to produce novel cultivars combining the 
traits of both the parents rather than to transfer 
specific traits from one parent to the other (Kato 
and Mii 2012). Even the fertility of the hybrid 
may not be a serious problem in ornamental 
plants as many of them can be propagated through 
vegetative multiplication or through 
micropropagation. 

The interest on wide hybridization through 
sexual pathway has greatly reduced after the 
development of the techniques of somatic hybrid- 
ization and genetic transformation. There are not 
many reports on the production of wide sexual 
hybrids in recent years. Although both somatic 
hybridization and genetic transformation have 
great potential in specific areas of crop 
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improvement program, they cannot replace sex- 
ual hybrids; they can only supplement them. 
Thus, there is a continuous need for the produc- 
tion of sexual wide hybrids in the coming years. 
It is necessary for the breeders to re-establish 
interest in sexual hybrids, particularly wide 
hybrids for transferring desirable traits from wild 
species/accessions to the cultivars. 
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Abstract 

Failure to produce a hybrid may be due to pre- or post-fertilization 
incompatibility barriers. The hybrid embryo, if produced, may abort 
before maturation. In such cases, the embryo resulting from interspecific 
or intergeneric crosses can be potentially rescued and cultured to produce 
a whole plant, thus facilitating transfer of desirable genes from wild rela- 
tives into cultivated species. This approach is especially warranted in hor- 
ticultural crops in situations where one or both parents may be seedless, 
initial fruitlet drop post-fertilization may be very high, the endosperm may 
be inimical, etc. Protocols have been developed for hybrid embryo rescue 
in several top-of-the-line fruit crops suffering from an inability to breed 
naturally in wide crosses. Techniques of hybrid embryo rescue figure 
among the very important commercially applicable breeding strategies 
under non-GM biotechnologies. It is a nonconventional, tissue culture- 
based technique used for creating new genotypes, often used to rescue 
plant embryos from aborting progeny ovules/seeds. Distant crosses are 
often sought to transfer genetic traits from secondary and tertiary gene 
pools to the cultivated, primary gene pool of crop plants. 
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be hampered by _ somatoplastic _ sterility, 
cytoplasmic-genic male sterility and structural 
differences in chromosomes. A very important 
and useful biotechnological tool for raising 
hybrid progeny in intractable crosses is embryo 
rescue. A given cross may fail for any of a num- 
ber of reasons: seedlessness in either or both par- 
ents, inimical influence of endosperm resulting in 
inhibited embryo development, poor fruit set/ 
heavy fruit drop at initial stages of the cross, dis- 
tant crossing (interspecific/intergeneric/intervari- 
etal or ‘intraspecific’ breeding resulting in 
embryo abortion), genetic incongruity of parental 
genomes, mismatch in ploidy levels, etc. Embryo 
culture is one of the earliest forms of in vitro 
culture applied to practical problems and is prob- 
ably the tissue culture technique that has proven 
of greatest value to breeders (Dunwell 1986). 
Technologies using excised hybrid embryo/ 
sequential embryo culture have been developed 
in anumber of horticultural crops (Leela Sahijram 
2004, 2011; Leela Sahijram et al. 2013). The 
underlying principle of embryo rescue technique 
is aseptic isolation of the embryo and its transfer 
to a suitable medium for facilitating its develop- 
ment under optimal conditions. The potential 
these techniques hold is a viable alternative to 
parasexual hybridization and somatic embryo- 
genesis (Stewart 1981). Post-zygotic barriers 
such as endosperm abortion and, at later stages, 
embryo degeneration are of common occurrence, 
leading to low fertility; but these have been over- 
come through the use of embryo rescue, and sev- 
eral hybrids have been developed. Genetic 
variation necessary for crop improvement is con- 
ventionally generated through hybridization, 
mutagenesis and, occasionally, polyploidy. More 
recently, biotechnological approaches have 
become available for creating genetic variability. 

In embryo rescue, however, the embryo is 
removed before seed abortion occurs and is 
grown outside the parent plant to produce a new 
plant to enable crosses to be made between spe- 
cies not normally sexually compatible. Plant 
breeders usually rescue inherently weak, imma- 
ture or hybrid embryos to prevent degeneration. 
Successful production of plants from cultured 
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embryos largely depends upon maturation stage 
and composition of the medium (Raghavan, 
2003). Abortion of embryos at one or the other 
stage of development is a characteristic feature of 
distant hybridization. Several refinements have 
been made in embryo culture/embryo rescue 
techniques which have been a popular approach 
for raising hybrids from a number of incompati- 
ble crosses. It has even allowed synthesis of triti- 
cale, a new hybrid species, resulting from a cross 
between rye and wheat. Related applications like 
ovule/ovary/placental cultures through sequential 
embryo culture have also been developed. 
Embryo culture applied to practical problems 1s a 
tissue culture technique that has proven to be of 
greatest value to breeders. 

There are several situations where it may not 
be possible to obtain hybrids from parent plants. 
Distantly related plant species in a breeding pro- 
gramme (‘wide hybridization’) may result in no/ 
aborted hybrid embryos. Seedless parents may be 
unable to produce seeds in a cross. Alternately, 
breeding may be hindered owing to heavy fruit 
drop in the early stages of fruit development. In 
all such cases, hybrid embryo rescue is a very 
powerful and useful tool and an indispensable 
technique. The procedure involves dissection of 
the developing embryo out from its environment 
and culturing it on a synthetic medium to produce 
hybrid plants. In certain instances, ovule culture 
may be needed preceding excised embryo cul- 
ture. Using this method, we have raised hybrids 
in seedless grape and lime, in difficult crosses of 
papaya, mango, banana, rose, etc. 

Currently, embryo rescue holds great promise 
not only for effecting wide crosses but also for 
obtaining plants from poorly developed embryos, 
for obtaining haploid plants as well as for short- 
ening the breeding cycle (Sharma et al. 1996). 


18.2 The Technique of Embryo 
Culture 


Following fertilization, the zygote undergoes an 
asymmetrical cell division that gives rise to a 
small apical cell, which becomes the embryo and 
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a large basal cell (called the suspensor), which 
functions to provide nutrients from the endosperm 
to the growing embryo. From the eight-cell stage 
(octant stage) onwards, the zygotic embryo 
shows clear embryo patterning, which forms the 
main axis of polarity, and the linear formation of 
future structures. These structures include the 
shoot meristem, cotyledons, hypocotyl and root 
and root meristem: they arise from specific 
groups of cells as the young embryo divides, and 
their formation has been shown to be position 
dependent. In the globular stage, the embryo 
develops radial patterning through a series of cell 
divisions, with the outer layer of cells differenti- 
ating into the “‘protoderm’. The globular embryo 
can be thought of as two layers of inner cells with 
distinct developmental fates; the apical layer will 
go on to produce cotyledons and shoot meristem, 
while the lower layer produces the hypocotyl and 
root meristem. Bilateral symmetry is apparent 
from the heart stage; provascular cells will also 
differentiate at this stage. In the subsequent tor- 
pedo and cotyledonary stages of embryogenesis, 
the embryo completes its growth by elongating 
and enlarging. 

Embryo culture is the sterile isolation and 
growth of an immature or mature embryo in vitro, 
with the goal of obtaining a viable plant. The first 
attempt to grow embryos of angiosperms was 
made by Hannig in 1904 who obtained viable 
plants from in vitro isolated embryos of two cru- 
cifers Cochlearia and Raphanus. In 1924, 
Dietrich grew embryos of different plant species 
and established that mature embryos grew nor- 
mally, but those excised from immature seeds 
failed to achieve the organization of a mature 
embryo. They grew into seedlings, skipping the 
stages of normal embryogenesis and without 
completion of the dormancy period. Laibach 
(1925, 1929) demonstrated the practical applica- 
tion of this technique by isolating and growing 
the interspecific cross Linum perenne xX L. aus- 
triacum that otherwise aborted in vivo. This led 
Laibach to suggest that in all those crosses where 
viable seeds are not formed, it may be appropri- 
ate to excise their embryos out and grow them in 
an artificial nutrient medium. Embryo culture is 
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now a well-established and useful branch of plant 
tissue culture. 


18.2.1 Types of Embryo Cultures 


There are broadly two types of embryo cultures: 

1. Mature embryo culture is the culture of mature 
embryos derived from ripe seeds. This type of 
culture is done when embryos do not survive 
in vivo or become dormant for long periods of 
time or is done to eliminate inhibition of seed 
germination. Seed dormancy of many species 
is due to chemical inhibitors or mechanical 
resistance present in the structures covering 
the embryo, rather than dormancy of the 
embryonic tissue. Excision of the embryos 
from the testa and culturing them in nutrient 
media may bypass such seed dormancy. Some 
species produce sterile seeds (which may be 
due to incomplete embryo development). 
Embryo culture procedures may yield viable 
seedlings. Embryos excised from the develop- 
ing seed at or near the mature stage are auto- 
trophic and are grown on a simple inorganic 
medium with a supplemental energy source. 

2. Immature embryo culture, also known as 
embryo rescue, is the culture of immature 
embryos to rescue embryos from wide crosses, 
crosses involving seedless parent(s), or where 
fruit fall is heavy in early stages of embryo 
development. This is mainly used to avoid 
embryo abortion, with the purpose of produc- 
ing a viable (hybrid) plant. Wide hybridiza- 
tion, where individuals from two different 
species of the same genus or different genera 
are crossed, often leads to failure of the cross. 
There are several barriers which operate at 
pre- and post-fertilization levels to prevent 
successful gene transfer from wild into culti- 
vated species. Pre-fertilization barriers include 
all factors that hinder effective fertilization, 
which is usually due to inhibition of pollen- 
tube growth by the stigma or upper style. 
Post-fertilization barriers hinder or retard 
development of the zygote after fertilization 
and inhibit normal development of the seed. 
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This frequently results from failure of the 
hybrid endosperm to develop properly, lead- 
ing to starvation of the hybrid embryo, or 
results from embryo-endosperm incompati- 
bility where the endosperm produces toxins 
that kill the embryo. 

Evidence suggests that embryos of inviable 
hybrids possess the potential for initiating devel- 
opment, but are inhibited from reaching adult 
size with normal differentiation. Endosperm 
development precedes and supports embryo 
development nutritionally, and endosperm failure 
has been implicated in several cases of embryo 
abortion. Endosperm failure generally results in 
abnormal embryo development and eventual star- 
vation. Thus, isolation and culture of hybrid 
embryos prior to abortion may circumvent these 
strong post-zygotic barriers. Production of inter- 
specific and intergeneric hybrids is the most 
impressive and conspicuous application of 
embryo rescue and culture technique, particu- 
larly for subsequent valuable gene transfer from 
wild species. The underlying principle of embryo 
rescue technique is the aseptic isolation of the 
embryo and its transfer to a suitable medium for 
development under optimal conditions. There are 
normally no problems of disinfection of embryos 
in such cultures. Florets are removed at the proper 
time, and either the florets or the ovaries are ster- 
ilized. Ovules can then be removed from the ova- 
ries. The tissue within the ovule in which the 
ovule is embedded is already sterile. For mature 
embryo culture, either single, mature seeds are 
disinfected, or if the seeds are still unripe, then 
the still-closed fruit is disinfected. The embryos 
can then be aseptically removed from the ovules. 
Using embryo culture to overcome seed dor- 
mancy requires a different procedure. Seeds hav- 
ing hard seed coats are sterilized and soaked in 
water for a few hours to a few days. Sterile seeds 
are then split open and the embryos excised out. 

The most important aspect of embryo cul- 
ture work is the selection of the medium neces- 
sary to sustain continued growth of the embryo. 
In most cases, a standard basal plant growth 
medium with major salts and trace elements 
may suffice. Mature embryos can be grown in a 
basal salt medium with a carbon energy source, 
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such as sucrose. Young embryos, in addition, 
require various vitamins, amino acids and 
growth regulators and, in some cases, natural 
endosperm extracts. Young embryos should be 
transferred to a medium with high sucrose con- 
centration (8-12 %) which approximated the 
high osmotic potential of the intracellular envi- 
ronment of the young embryo sac and a combi- 
nation of hormones that supports growth of the 
heart stage embryo (i.e. a moderate level of 
auxin and a low level of cytokinin). Reduced 
organic nitrogen in the form of asparagine, glu- 
tamine or casein hydrolysate is always benefi- 
cial in embryo culture. Malic acid is often 
added to the embryo culture medium. After 
1—2 weeks (when the embryo ceases to grow), it 
must be transferred to a second medium with 
normal sucrose concentration, low levels of 
auxin and a moderate level of cytokinin, which 
allows for renewed embryo growth, with direct 
shoot growth in many cases. Where the embryo 
does not show direct shoot formation, it can be 
transferred to a medium for callus induction, 
followed by shoot induction. When the embryos 
have grown into plantlets in vitro, these are 
generally transferred to sterile soil and grown 
to maturity. Hybrids have been successfully 
developed in mango, banana, papaya, seedless 
grape and citrus, onion, capsicum and hot pep- 
pers using intergeneric/interspecific/intervari- 
etal crosses. 


18.2.2 General Applications 
of Embryo Culture 


Obtaining rare hybrids: In plant breeding pro- 
grammes, not all desirable crosses are successful. 
Where a hybrid embryo capable of normal devel- 
opment aborts due to inadequate supply of nutri- 
tion inside the seed, or due to embryo-endosperm 
incompatibility, it is possible to raise complete 
hybrid plants through embryo rescue. Success 
has been achieved in prevention of abortion in 
wide crosses, and interspecific crosses in cotton, 
barley, tomato, rice, legumes, flax, etc., have 
been generated. 


18 Hybrid Embryo Rescue in Crop Improvement 


18.2.2.1 Development of Intergeneric 
Hybrids 

Prevention of abortion in wide crosses has been 
shown to be possible. Success has been achieved 
in interspecific crosses in cotton, barley, tomato, 
rice, legumes, flax, etc. In several crop plants, 
even intergeneric hybrids have been successfully 
developed through hybrid embryo rescue. 
Examples include wheat x barley, wheat x rye, 
barley x rye, maize x Tripsacum, Raphanus sati- 
vus X Brassica napus, Actinidia deliciosa x A. 
eriantha and Actinidia deliciosa x A. arguta. In 
Carica and Citrus also distant hybrids have been 
produced from their relatives using embryo res- 
cue. In peach, a cross between an early Japanese 
cultivar Sunago Wase and a Chinese cultivar 
Yuhualu resulted in “Zaoxialu’ — an extra early 
maturing hybrid (55 days to maturity, in 
Hangzhou, China). Some hybrids obtained 
through embryo rescue have recombined desir- 
able genes like earliness and disease and pest 
resistance (see Table 18.1). Embryo rescue is also 
used to recover crosses between diploids and 
tetraploids. 


18.2.2.2 Production of Haploids 

Embryo culture can be used for the production of 
haploids or monoploids. In 1970, a technique was 
developed to produce barley monoploids. 
Interspecific crosses are made with Hordeum bul- 
bosum as the pollen parent, and the resulting 
hybrid embryos are cultured. But, these exhibit 
H. bulbosum chromosome elimination, resulting 
in monoploids of the female parent H. vulgare. 
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18.2.2.3_ Overcoming Seed Dormancy 
Seed dormancy can be caused by numerous 
factors, including endogenous inhibitors, specific 
light requirements, low temperature, dry storage 
requirements and embryo immaturity. Seed dor- 
mancy in /ris is due to the presence of a stable 
chemical inhibitor in the endosperm. In American 
basswood (Tilia americana), the seed is born 
within a tough, indehiscent pericarp where the 
resistance is mechanical. Naturally vegetatively 
propagated plants like bananas and Colocasia 
produce seeds that do not germinate in nature, 
probably due to recalcitrant dormancy. These 
stand to benefit from embryo culture. 


18.2.2.4 Shortening the Breeding 
Cycle 

Many species exhibit seed dormancy which is 
localized in the seed coat and/or in the pericarp. 
Removal of these inhibitions promotes seed ger- 
mination, e.g. Brussels sprout, rose, apple, oil 
palm and /ris. Hollies Ulex) embryos remain in 
the immature heart stage even though the fruits 
may have reached maturity. It takes 3 years for 
Ilex opaca seeds from mature berries to complete 
their embryonic development and to begin germi- 
nation. Through embryo rescue, hybrid plants 
from this can be obtained in 2—3 weeks. Rosa 
normally takes a year to come to flowering; 
through embryo culture, it was shown that it was 
possible to produce two generations in a year. 

Prevention of embryo abortion with early rip- 
ening stone fruits: Early ripening varieties of 
peach, cherry, apricot and plum produce sterile 
seeds which do not germinate under natural 


Table 18.1 Resistance traits transferred to crop species using hybrid embryo rescue 


S.no. Species used for crossing 


bi 


Lycopersicon esculentum x L. peruvianum 
Solanum melongena x S. khasianum 
Solanum tuberosum x S. etuberosum 
Brassica oleracea Xx B. napus 

Brassica napus x B. oleracea (kale) 
Triticum aestivum x Thinopyrum scirpeum 
Hordeum sativum x H. vulgare 

Hordeum vulgare x H. bulbosum 


OO OI NW!) NM! BH) GW!) bd 


Oryza sativa xX O. minuta 


Resistance trait(s) 

Virus, fungi, nematodes 

Brinjal fruit and shoot borer (Leucinodes orbonalis) 

Potato leaf roll virus 

Triazine resistance 

Cabbage aphid 

Salt tolerance 

Powdery mildew and spot blotch 

Powdery mildew 

Blast (Pyricularia grisea), bacterial blight (Xanthomonas oryzae) 
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conditions and, eventually, decay in the soil. Seed 
sterility here may be due to incomplete embryo 
development. In early ripening stone fruits, trans- 
port of water and nutrients to the immature 
embryo is cut off too soon sometimes, resulting 
in embryo abortion. 


18.2.2.5 Propagation of Rare Plants 
‘Macapuno’ coconuts are prized for their charac- 
teristic soft endosperm which fills the whole nut. 
These nuts fail completely to germinate because 
the endosperm invariably rots before the germi- 
nating embryo can emerge out of the shell. 

Embryo culture has been practised as a gen- 
eral method in horticultural crops including avo- 
cado, peach, nectarine and plum. Two cultivars, 
‘Goldcrest peach’ and ‘Mayfire nectarine’, have 
resulted from embryo culture and are now com- 
mercially grown. 


18.2.2.6 Rapid Seed Viability Test 

Breaking of seed dormancy is possible through 
embryo culture. Therefore, it also allows the use 
of this technique as a rapid test for seed viability. 
Germination of excised embryos ts reliable and is 
capable of more exact interpretation of seed via- 
bility than the commonly used stain tests. 


18.2.2.7_ Haploid Formation 

Haploid production with the aid of embryo cul- 
ture following distant hybridization: In such 
crosses, double fertilization may occur normally, 
but selective elimination of the entire comple- 
ment of chromosomes of one parent may occur, 
leading to haploid embryo formation. Such 
embryos do not survive unless rescued by cultur- 
ing them. 


18.2.2.8 Embryos for Propagation/ 
Conservation/Germplasm 
Exchange 
Embryos are excellent material for in vitro propa- 
gation, especially in conifers and in Gramineae 
family. As per IPGRI, Rome, it is compulsory to 
exchange germplasm of coconut internationally 
using embryo cultures (otherwise, shipping of 
whole nuts is cumbersome, besides requiring 
huge cargo space on flights or ships). 
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18.2.2.9 Germination of Seeds 
of Obligate Parasites 

Parasitic plants produce seeds that can be 
germinated to produce plantlets only on their 
chosen host plant. Germination of seeds of obli- 
gate parasites in vivo without the host plant is 
impossible, but is achievable with embryo cul- 
ture. Insectivorous plants have also been shown 
to propagate successfully on a synthetic medium 
using these techniques. 


18.3. Application of Embryo 
Rescue in Fruit Crops 


Embryo rescue techniques in fruit crops play an 
important role in breeding new early maturing 
seedless triploid types as well as obtaining dis- 
tant hybrids preventing embryo degeneration at 
early development stage and shortening the 
breeding cycle (Y1 Hualin et al. 2001). Application 
and progress of embryo rescue in fruit crop 
breeding was reviewed, and several problems, 
e.g. suitable culture period nutrient and environ- 
ment conditions, were briefly introduced as well 
in this paper. With interspecific crosses, interge- 
neric crosses and crosses between diploids and 
tetraploids, the endosperm often develops poorly 
or not at all. A synthetic medium can compensate 
for lack of support from the endosperm. 


18.3.1 Seedless Grapes 


The second food crop to be fully sequenced 
genomically after rice is the grape. Improvement 
in grape is slow and time-consuming since the 
crop is heterozygous and exhibits pronounced 
inbreeding depression (Leela Sahiram 2011). 
Thompson Seedless is the ruling variety of grape 
in India and other parts of the world, but is sus- 
ceptible to downy mildew disease caused by a 
deuteromycete fungal pathogen, Plasmopara viti- 
cola. Obtaining a downy mildew-resistant 
Thompson Seedless would be a boon to the global 
viticulture industry, resulting in massive savings 
in pesticide use while providing a cleaner, safer 
environment. Seedless grapes are 
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stenospermocarpic, in that these form embryos 
successfully, but the embryo does not develop and 
aborts due to the inability of the ovular wall to 
expand. These embryos can, however, be rescued 
before they abort by initially culturing the whole 
ovule; after about 8 weeks, the developing embryo 
can be dissected out from the ovule and grown on 
a different medium. This is termed ‘sequential 
embryo culture’. Ramming (1990) pioneered 
hybrid embryo rescue in seedless grape breeding. 
Sequential hybrid embryo rescue was carried out 
using “Thompson Seedless’ as the maternal parent 
and 15 different pollen donor parents (Tables 18.2 
and 18.3) variously resistant to downy mildew 
disease (Leela Sahijram and Raghavendra 2005). 
The authors used a 2-step, sequential embryo res- 
cue: in ovulo embryo culture followed by ex ovulo 
embryo culture (Kanamadi et al. 1999a, b, c; 
Leela Sahijram and Kanamadi 2004; Leela 
Sahijram et al. 2005a, b). 

Seedless grape cultivars (Vitis vinifera L.) are 
widely grown in Europe, America and Asia. 
Fungal diseases are a great threat of these culti- 
vars. Several wild Chinese Vitis species show 
high resistance to several fungal diseases. 
Therefore, an investigation was conducted by Lili 
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Tian et al. (2008) to assess the potential of incor- 
porating these species into a breeding project for 
the development of disease-resistant seedless 
cultivars. Hybridization was accomplished using 
V. vinifera as the female parent and wild, Chinese 
Vitis spp. as male parents. Jn ovulo embryo res- 
cue was used for developing hybrid plants from 
seedless females. An efficient protocol was 
reported from them for in vitro embryo rescue 
and plant development using the cross Emerald 
Seedless x Beichun. Ovules were excised from 
immature fruits 7 weeks after pollination (WAP) 
and cultured in the double-phase ER medium 
supplemented with 6.0 % sucrose and 0.3 % acti- 
vated charcoal (AC). Following 8—12 weeks of 
culture, embryos were removed from the ovules 
and transferred onto WPM supplemented with 
1.0 mM 6-benzyladenine (6-BA), 2.0 % sucrose 
and 0.2 % AC and solidified with 0.6 % agar. 
After 8 weeks of culture, the embryos germinated 
and subsequently grew into whole plantlets. With 
optimized parameters developed in this study, 
about 34.0, 91.2 and 77.4 % of embryo forma- 
tion, embryo germination and plant development, 
respectively, were obtained. When this protocol 
was applied to 11 other cross combinations, 


Table 18.2 Data on crossing “Thompson Seedless’ (female parent) with 14 downy mildew-resistant pollen donor 


parents and open-pollinated “Thompson Seedless’ 


Pollen donor Total no. of ovules 


No. of fully developed % hybrid 


Sl. no parent used cultured from the cross hybrid plantlets recovered plants recovered 
1 Vitis tiliifolia 165 28 16.97 
2 *Madresfield Court’ 134 57 42.54 
3 Fruehort veltliner 110 48 43.64 
4 Vitis candicans 39 3 7.69 
5 ‘SV 23501’ 5 0.00 
6 ‘SV 18315’ 172 14 8.14 
7 ‘SV 12309’ 58 33 56.90 
8 ‘Catawba’ 60 23 38.33 
9 ‘James’ 117 19 16.24 
10 Riparia x Rupestris 134 25 18.66 
11 ‘SV 12364’ 189 64 33.86 
12 Vitis assamica 111 26 23.42 
13 ‘SV 18402’ 51 1] 21.57 
14 Vitis lanata 8 0 0.00 
15 Open-pollinated 37 13 35.14 

Thompson Seedless 

Total 1,390 364 26.19 
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genotype was found to significantly influence 
embryo formation, embryo germination and plant 
development, with different frequencies of hybrid 
plants obtained successfully in all the crosses. 

A highly efficient embryo rescue technique 
using in ovulo culture is critical for maximizing 
success when using stenospermocarpic female 
parents to breed new seedless grape varieties. 
The effects of medium composition, pollen par- 
ent and year of cross on embryo development and 
recovery in cultured ovules were investigated by 
Liu et al. (2008) to improve in ovulo embryo res- 
cue from stenospermocarpic parents commonly 
used in breeding seedless grapes for the 
Australian table and dried grape industries. 
Increasing CaCl2 concentrations in the culture 
medium improved embryo recovery, but embryo 
emergence from ovules and germination rates 
were unaffected by varying Fe-EDTA concentra- 
tions. Casein hydrolysate supplementation in the 
culture medium improved embryo recovery, 
emergence and germination. Embryo recovery 
varied with parental genotype and ranged from 5 
to 14 % in four self-pollinated genotypes. Mean 
embryo recoveries from ovules cultured 50 days 
after controlled cross-pollination from berries of 
the pollen-sterile, stenospermocarpic variety 
‘Carina’ were 36 and 26 % where the pollen par- 
ents were seeded (11 crosses) and seedless (6 
crosses), respectively. Embryo recovery and ger- 
mination were affected by the year in which the 
cross was made; there was also a significant cross 
x year interaction. Results indicated that in ovulo 
embryo rescue can be improved by increasing 
CaCl2 concentration and incorporating casein 
hydrolysate supplement in the basal medium, by 
exploiting genotypic differences using seedless 
parents yielding higher proportions of rescued 
hybrids and, possibly, by understanding environ- 
mental effects on female parents to maximize the 
number of hybrids produced. 

Pommer et al. (1995) used 18 seedless grape 
genotypes differing in ripening season (early, 
mid and late) and in seed trace size (small, 
medium and large) and harvested at 6, 10, 14, 18 
and 22 weeks post-bloom (wpb). Using embryo 
rescue techniques, it was studied if embryos 
aborted as the fruit matured and what percentage 
of embryos remained viable at later stages. Size 
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of the seed trace was also investigated to 
determine its influence on embryo viability dur- 
ing maturation. Genotype was found to have a 
great influence on embryo culture. Late-maturing 
genotypes showed fewer (1) embryos that could 
be rescued, (2) germinated embryos and (3) 
transplantable plants, than early and midseason 
ones. The best time for grape embryo rescue/cul- 
ture was 6 and 10 weeks post-bloom when the 
largest number of germinated embryos and trans- 
plantable plants was obtained. Genotypes with 
the largest ratio for seed trace weight/seed trace 
length (i.e. largest density) showed the greatest 
tendency to have the largest number of ovules 
with embryos, more germinated embryos and 
more transplantable plants. The study also 
showed that it was possible to recover plants 
from mature fruits harvested late, although the 
rate of success was much reduced. 

Somatic embryo formation was shown from 
immature zygotic embryos within ovules of ste- 
nospermocarpic seedless grapes (Vitis vinifera 
L.), when cultured for 2 months on liquid medium 
(Emershad and Ramming 1994). Somatic 
embryos continued to proliferate after excision 
and transfer to Emershad/Ramming medium sup- 
plemented with | #M_ benzylaminopurine and 
0.65 % TC agar. Plant development from somatic 
embryos was influenced by genotype, medium, 
phase (liquid, agar), stage (torpedo, mature) and 
their interactions. Optimal plant development 
occurred on Woody Plant Medium supplemented 
with 1.5 % sucrose + 1/~M benzylaminopurine + 
0.3 % activated charcoal and 0.65 % TC agar. 

In grape, pollen donor parent overwhelmingly 
determined the outcome of crossing and success 
rate thereof. From the directed crosses, of the 
1,500 hybrids generated and grown in the glass- 
house, about 700 were transferred to the field 
(Fig. 18.1), established to maturity and evaluated 
for downy mildew resistance while retaining 
desirable pomological qualities of the seed parent. 
In some crosses involving seedless x seedless par- 
ents, the authors have demonstrated through his- 
tological studies that the zygote does not develop 
beyond the first cell division (i.e. 2-cell stage). 

Bharathy et al. (2003, 2005) in their study on 
‘Thompson Seedless’, ‘Flame Seedless’ and 
eight other varieties of grape found embryo 
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Fig. 18.1 Sequential hybrid embryo rescue/culture in controlled crosses of grape with ‘Thompson Seedless’ as female 
parent and downy mildew-resistant pollen donors (Source: Leela Sahijram et al. 2013) 
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recovery to increase significantly with application 
of benzyladenine — a cytokinin — at pre-flowering 
and flowering stages. Maximum embryo recov- 
ery (47.57 %) was obtained with “Thompson 
Seedless’ (TS) x ‘Concord’, followed by TS x 
SV18402 (29.75 %). Murthy et al. (2006) 
achieved field establishment and _ screened 
embryo-rescued hybrid seedlings of grape with 
special reference to downy mildew. 


18.3.2 Mango 


Mango (Mangifera indica L.) is the leading fruit 
crop of India. It has long been known to be a dif- 
ficult system in traditional breeding programmes 
because of inherent characteristics such as: 
1. Long juvenile phase 
2. High level of heterozygosity resulting in 
unpredictable outcome in hybridization 
3. Heavy fruit drop leading to low retention of 
crossed fruits 
4. One seed per fruit 
5. Polyembryony in some cultivars 
6. Large acreage required for meaningful assess- 
ment of hybrids 
Classically, barring a few hybrid varieties 
resulting from planned hybridization — pro- 
grammes, almost all known cultivars have 
resulted from selection of chance seedlings 
resulting from natural cross-pollination. In 
Florida and South Africa, for example, none of 
the cultivars developed have come from a breed- 
ing programme. Modern horticultural and indus- 
trial requirements in mango breeding (Iyer and 
Degani 1997) emphasize: 
. Precocious bearing 
. Dwarf tree habit 
Heavy and regular bearing 
. Freedom from physiological disorders 
. Resistance to major pests and diseases 
. Good shipping qualities and shelf life 
. Ideal tree architecture 
In controlled crosses in this crop, fruit set in 
relation to the original number of flowers 
pollinated is very low (Table 18.4), sometimes as 
low as 0.01 % (Iyer 1991). Therefore, embryo 
rescue-based technology was applied for improv- 
ing breeding success. 


IDNR WNE 
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Table 18.4 Controlled intraspecific cross in mango (cv. 
Alphonso x Kerala Dwarf) 
487 


Total no. of flowers used as female parent 2,646 
for pollination 


Total no. of panicles used for crossing 


208 
7.86 % 
193 (92.8 %) 


Fruit set (no.) 
% Fruit set 


No. of fruit/fruitlets lost to leaf-hopper 
infestation/sooty mould 


No. of excisable hybrid embryos 15 
Ratio of panicle to fruit set 1:0.42 
Ratio of flowers crossed to fruit set 176.4:1 


Source: Leela Sahijram et al. (2013) 


18.3.2.1 Floral Biology 

and Pollination in Mango 

The total number of flowers in a panicle may vary 
from 1,000 to 6,000 (Mukherjee 1953). Initial 
fruit set in mango is directly related to the pro- 
portion of perfect (hermaphrodite) flowers to sta- 
minate flowers, although the final fruit set does 
not necessarily depend on this ratio. This propor- 
tion becomes critical for optimum fruit set in a 
cultivar when the proportion drops to 1 %. 


18.3.2.2 Breeding Objectives 
in Mango 

Developing cultivars with: 

1. Regular bearing 

2. Dwarf tree habit with precocity in bearing 

3. Attractive, good-sized (300-500 g), good 
shape, good quality fruits (taste, flavour and 
firm flesh without fibre), high pulp-stone ratio 
With regard to improvement of rootstocks by 

breeding, the main desirable features are: 

1. Polyembryony (a recessive trait) 

2. Dwarfing influence on the scion 

3. Tolerance to adverse soil conditions (pH, cal- 
careous soil, etc.) 

4. Good scion compatibility 


18.3.2.3 Breeding for Special 
Objectives 
18.3.2.3.1 Dwarfness: For Orchard 
Management and Fruit Quality 
Owing to the obvious benefits of comparatively 
dwarf trees for orchard management and fruit 
quality, attempts were focused on obtaining 
hybrids with a dwarf tree framework. Breeding 
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for dwarfness is important in mango, since a con- 
sistent dwarfing effect of any rootstock has not 
been established to date. Indian cultivars found to 
be useful as a source of imparting dwarfness 
include ‘Kerala Dwarf’, ‘Amrapali’, ‘Janardhan 
Pasand’ and ‘Nileshwar Dwarf’ (Singh 1990). 

Controlled crosses were made using the fol- 
lowing mango varieties as parents, based on 
breeding priority (Leela Sahijram et al. 2013) 
‘Kerala Dwarf’ as the male parent to introgress 
dwarfness. Factors detrimental to initial embryo 
growth and development were found to be fun- 
gal/bacterial contamination, excessive exudation 
of phenolics into the medium and poor response 
of the embryo. 


S.no. Cross combination 

1. ‘Amrapali’ x “Kerala Dwarf’ — (AKD) 

2. ‘Alphonso’ x ‘Kerala Dwarf’ — (ALK) 

3. ‘Arka Anmol’ x ‘Kerala Dwarf’ — (AnKD) 
4 Open-pollinated Totapuri (OpTp) 


Young fruitlets containing immature embryos 
from these monoembryonic cultivars were har- 
vested at 4-6 weeks post-pollination and excised 
embryos cultured in vitro. In AKD, 69.23 % and in 
OpTp 82.68 % of the embryos cultured on defined 
media developed into normal plantlets in vitro. In 
AKD, cultures responded, with shoot and root 
induction. Upon normal plantlet development, 
hybrid vitroplants were transplanted extra vitrum 
in a phased manner into mini pots. Eight weeks 
later, these hybrid plants were transferred to larger 
pots and, finally, into the field. In OpTp plantlet 
survival at final field establishment was just 3.81 %. 
Field-established plants showed overall survival 
percentage of 34.4 % in AKD. In the controlled 
cross ALK and in OpTp, rooting was achieved in 
70.9 and 72.4 % cultures, respectively. Performance 
of ALK and OpTp was comparable (77.8 and 
77.5 %, respectively) on a medium containing 
1,500 mg/I casein hydrolysate (CH) + 6 % sucrose 
(w/v). OpTp responded best (87.5 %) on a medium 
containing 1,000 mg/l CH + 3 % sucrose (w/v). 

In ALK, 88 hybrid embryos out of a total of 
109 from the controlled cross formed plantlets in 
vitro (80.73 %), of which only 2 hybrids survived 
field establishment (2.27 % survival rate). In 
AnKD, 244 hybrid embryos were excised and 
cultured in vitro of which 177 formed normal 
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plantlets (72.5 %). In the subsequent year, in 
OpTp, out of 232 embryos, 166 formed plantlets 
in vitro (79.55 %). 

Cultures showing both shoot and root inductions 
were transplanted extra vitrum into large, polyvinyl 
pots. In all, 80.7 % cultures of ALK (Fig. 18.2) and 
82.7 % of OpTp were found suitable for transfer to 
soil. Finally, overall survival percentage of 4.54 and 
3.81 % was obtained in ALK and OpTp hybrids, 
respectively. 84.77 % of AnKD embryos developed 
into plantlets in vitro. Overall percentage field sur- 
vival in AnKD was 4.11. 


18.3.2.4 Ex Vitro Shoot-Tip Grafting 

(Ex Vitro STG) 
A protocol earlier also worked out for the cross 
‘Mulgoa’ x ‘Neelum’. Using immature zygotic 
embryos as starter explants, Leela Sahijram et al. 
(1996) could also induce somatic embryogenesis 
in mango embryos/nucellus and elucidate factors 
influencing the same. These plantlets did not sur- 
vive beyond 4 months of ex vitro transfer, resulting 
in very high mortality despite their healthy status at 
transplanting. Therefore, an alternative strategy 
was devised to rescue the hybrid vitroplants. 
Hence, a novel technique of ex vitro shoot-tip 
grafting (ex vitro STG) was devised by the authors 
(Leela Sahijram and Bhaskara Reddy 2009) using 
“Totapuri’ as the rootstock. The hybrids were grown 
to maturity and evaluated. These were further mul- 
tiplied on various rootstocks. 

‘Totapuri’ (rootstock) stones were germinated 
in the traditional manner in seed pans on a sand 
medium. Shoot apices of the hybrid plantlets were 
excised and grafted ex vitro onto the rootstock 
using the epicotyl grafting technique. The diame- 
ter of the scion shoot apex at the cut surface was 
2—4 mm. Graft union established successfully in 
3 weeks, and at the end of 8 weeks, the scions 
showed vigorous growth and development. Overall 
graft success rate was 79.54 %. Ex vitro STG was 
carried out using vitroplantlets of Alphonso x 
Kerala Dwarf, open-pollinated (1) Alphonso, (2) 
Ratnagiri Alphonso and (3) Totapuri, with a suc- 
cess rate of 66.7, 100, 72.2 and 92.31 %, respec- 
tively. These grafts were transferred to polybags of 
size 9” ht x 7” dia containing sand, soil and vermi- 
compost at 1:1:1, at 10-12 weeks. The grafts were 
further transferred to larger pots containing sand, 
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Fig. 18.2 Hybrid embryo rescue in the mango cross 
‘Alphonso’ x ‘Kerala Dwarf’: vitroplants derived through 
embryo rescue (a) on semisolid medium in culture vessels 


soil and farmyard manure at 1:1:1, at 6 months, 
and, subsequently, field-planted and hybrids evalu- 
ated for desirable traits. 


18.3.3 Papaya 


Leela Sahijram and Doreswamy (1993) tested the 
efficacy of using placental cultures for rescuing 
hybrid embryos from controlled crosses involv- 
ing Carica papaya and Vasconcellea cauliflora. 
They were able to induce a transient expression 
of anthocyanin gene (inky blue in colour) in pla- 
cental cultures of hybrid papaya. Intergeneric and 
interspecific crosses were used for hybrid embryo 
rescue, and hybrid plants were recovered. 
Vitroplantlets were transplanted into soil ex vitro. 


and (b) on ex vitro transfer to non-sterile tap water and (c) 
transplanted into soil mix in polyvinyl mini pots (Source: 
Leela Sahijram et al. 2013) 


18.3.4 Seedless Lime 


Prasad et al. (1996) emphasized the importance 
of embryo rescue in improving seedless lime 
quality. Controlled crossing between seedless 
lime and acid lime resulted in immature zygotic 
embryos that were rescued in vitro and grown to 
maturity in the field. The objective here was to 
develop a hybrid of acid lime with resistance to 
citrus canker. 

The role of embryo culture techniques in the 
improvement of fruit quality in seedless lime was 
elucidated by Prasad et al. (1996). Rao et al. 
(2011) highlighted the importance of genetic 
resources in citrus fruits and their exploitation in 
citrus improvement programmes using various 
in vitro techniques such as embryo rescue, 
nucellus culture, etc. 
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18.3.5 Banana 


Commercially popular varieties are seedless 
owing to their triploid status. Protocols have been 
developed for rescuing excised hybrid embryos 
from crosses involving Musa acuminata and M. 
balbisiana. Hybrid plants were raised to maturity 
in the field and evaluated (Doreswamy and Leela 
Sahijram 1991, 1993; Chadha and Leela Sahijram 
2000a, b). The hybrids were added to the germ- 
plasm pool in the Institute for further use in 
breeding. 


18.3.6 Olive 


Acebedo et al. (1997) found the growth of plant- 
lets derived from in vitro germinated embryos in 
the greenhouse to be normal. Thus, embryo cul- 
ture can increase the efficiency and shorten the 
time for starting initial progeny evaluation, 
thereby speeding up seedling development in 
olive breeding programmes. 


18.4 Application of Embryo 
Rescue in Floricultural Crops 


Several crops in floriculture have benefitted from 
in vitro interventions in breeding programmes. 
Van Tuyl and Lim (2003) reviewed interspecific 
hybridization and polyploidization as tools in 
ornamental plant breeding. Some of the suc- 
cesses are reported hereunder. 


18.4.1 Rose 


Marchant et al. (1994) used embryo culture for 
the production of Fl hybrids in English rose. 
Rout et al. (1999) reviewed advances made in the 
biotechnology of rose and Holeman (2009) out- 
lined a simplified method of embryo culture in 
rose. Hybrid/zygotic embryos were rescued from 
controlled crosses and open pollination, respec- 
tively, in rose cultivars. Mohapatra and Rout 
(2005) in their study on embryo rescue in flori- 
bunda roses rescued immature embryos from 
‘Arunima’ and ‘Shocking Blue’ roses. The time 
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required for breeding roses through conventional 
breeding, for example, is very long. It 1s occa- 
sionally hampered by premature abortion of the 
developing embryo, resulting in few or no viable 
seeds (Rout et al. 1999). Rose is highly heterozy- 
gous. Lack of germination is due to mechanical 
restriction of embryo expansion by the presence 
of a thick, hard pericarp or due to dormancy regu- 
lated by growth inhibitors present within the 
achene. There are also a few reports of culture of 
mature embryos in vitro. Embryo rescue in Rosa 
hybrida and English roses is also reported from 
elsewhere. Gudin (1994) also demonstrated the 
usefulness of embryo rescue in Rosa hybrida L. 


18.4.2 Lilies/Liliums 


In order to introduce new traits such as disease 
resistances, flower shape and colour from wild 
species into the cultivar assortment of lily, it is 
necessary to overcome interspecific crossing bar- 
riers. Several techniques like the cut-style 
method, the grafted-style method and the in vitro 
isolated ovule pollination technique have been 
developed to overcome pre-fertilization barriers. 
Post-fertilization barriers can be circumvented by 
in vitro pollination and/or rescue methods as 
embryo, ovary-slice and ovule culture. Using 
these techniques, wide interspecific lily crosses 
with species and cultivars from the different sec- 
tions of the genus Lilium (L. longiflorum, L. hen- 
ryi, L. canadense, L. pardalinum, L. concolor, L. 
dauricum, L. candidum, L. rubellum, L. marta- 
gon, Asiatic and Oriental hybrids) could be made 
(Van Tuyl et al. 2000, 2002) and breakthroughs 
achieved. 

Interspecific hybrids have been developed in 
liltums using early stage ovule culture in vitro 
(Wang et al. 2009). Wang et al. (2009) success- 
fully generated interspecific hybrids between 
Lilium longiflorum and L. lophophorum var. lin- 
earifolium culturing whole ovules at early stages 
of their development. Low seed set following 
self-pollination is known to be caused by reduced 
ovule availability resulting from embryo sac 
degeneration. Tammy et al. (1999) hypothesized 
this to be due to the absence of a required stimu- 
lus for normal ovule development. If this is 
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correct, current concepts of SI may need to be 
broadened to include a wider range of pollen- 
pistil interactions. 

Crosses between the diploids Aylocereus 
polyrhizus and H. undatus in both directions 
were performed by Aroldo and Noemi (2010). 
The aim of their study was to develop an efficient 
methodology to rescue embryos following inter- 
specific crosses in the genus Hylocereus. 

Fertilized ovules carrying embryos at very 
early pro-embryonic stages were excised from 
ovaries 5 days after pollination (DAP) and placed 
on half-strength basal MS medium containing 
680 uM glutamine, 0.55 uM a-naphthaleneacetic 
acid (NAA), 0.45 uM thidiazuron (TDZ) and vari- 
ous concentrations of sucrose. After 30 days in 
culture, ovules were isolated from the surrounding 
tissue and transferred to the same fresh medium. 
Significant differences were found between the 
main effects (cross and sucrose concentration) in 
ovule response, 1.e. increased ovule size and callus 
formation. The best responses were obtained in the 
cross, H. polyrhizus x H. undatus, and sucrose 
concentration of 0.09 M. In terms of embryo con- 
version, polyembryony and the number of regen- 
erated plants, the highest responses were observed 
on a culture medium supplemented with 0.17 M 
sucrose in both the interspecific crosses. All the 
plants tested by flow cytometric ff were found to 
be diploid. Fluorescent amplified fragment length 
polymorphism (fAFLP) confirmed hybrid origin 
of the regenerated plants. This study reported the 
success of a three-step embryo rescue procedure 
for Hylocereus species. The procedure developed 
provides means for producing plants from very 
early embryo stage, thus expanding the prospects 
for vine cactus breeding programmes. Yuan et al. 
(2003) demonstrated shortening of breeding cycle 
in spider lily (Lycoris spp.) through embryo 
culture. 

Differential ovule development following 
self- and cross-pollination and the basis of self- 
sterility in Narcissus triandrus (Amaryllidaceae) 
were shown by Sage et al. 1999. Earlier, Hayashi 
et al. (1986) demonstrated ovary-slice culture in 
Lilium formosanum. Wallace and Ikeda et al. in 
2003 produced seedlings from ovules excised at a 
time when they contained zygote stage of the 
hybrid product and cultured them in Lilium spp. 
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18.4.3 Pelargonium 


As a means of integration of in vitro techniques 
in ornamental plant breeding, interspecific 
crosses were made in Pelargonium by applying 
embryo rescue methods. Some reports of artifi- 
cial hybrids are available, but no evidence of 
natural hybridization is seen. 


18.4.4 Alstroemeria 


Interspecific hybridization in the — genus 
Alstroemeria is hindered by post-fertilization bar- 
riers (Buitendiyk et al. 1995). Histological analysis 
revealed poor endosperm development from 
18 days after pollination onwards, followed by 
malformation and abortion of embryos. To create 
interspecific hybrids between Alstroemeria aurea, 
A. pelegrina, A. magnifica, A. inodora and A. psit- 
tacina in diallelic combinations, an ovule culture 
technique was developed. Influence of age of 
ovules, sucrose concentration of medium and tem- 
perature and light during culture were tested. 
Harvesting ovules before the onset of endosperm 
degeneration, 1.e. at 14 days after pollination, cut- 
ting them into halves and culturing the micropylar 
halves in a rotating liquid culture medium contain- 
ing 6 % sucrose at 21 °C in the dark, led to suc- 
cessful embryo rescue. Germinated embryos were 
subcultured in vitro until rhizomes were formed, a 
prerequisite for successful transfer to the green- 
house. Full-grown plants all showed interspecific 
morphological traits, and an analysis of chromo- 
some complement confirmed their hybrid nature. 
Diploid hybrid plants were obtained in all the 20 
interspecific 2x-2x combinations. A total of 260 
interspecific hybrid plants were produced. Half- 
ovule culture of 2x-4x and 4x-2x crosses resulted 
in 43 triploid hybrid plants. Because interspecific 
hybrids were obtained in 100 % of the interspecific 
combinations, it 1s expected that the described 
technique can be applied to overcome post-fertil- 
ization barriers in most crosses within the genus 
Alstroemeria. Chunsheng and Bridgen (1996) 
studied the effect of genotype, culture medium and 
developmental stage of the embryo on in vitro 
responses of ovule cultures in the interspecific 
hybrids of Alstroemeria. 
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18.4.5 Cacti 


Embryo rescue and plant regeneration have been 
successfully demonstrated in cacti following 
interspecific crosses in the genus Hylocereus 
(Aroldo and Noemi 2010). Gynogenesis has also 
been shown to be possible in the vine cacti 
Hylocereus and Selenicereus. 


18.4.6 Other Ornamental Crops 


In primulas, Kato et al. (2001) obtained different 
genomic combinations in intersection hybrids 
from crosses in primulas through embryo rescue. 
Two types of triploids with different genome 
combinations were generated. Interspecific 
hybridization in rhododendron has been shown to 
be possible using embryo rescue in the genus 
Rhododendron ( Eeckhaut et al. 2007). 

Other attempts at embryo rescue include that 
in Narcissus and Zinnia (Miyajima 2006) and 
Ornithogalum (Neiderweiser 1990). Kasten and 
Kunert (1991) devised a method for culture of 
isolated immature embryos of various lupins 
(Lupinus species); Custers (1995) overcame 
interspecific crossing barriers in tulips using 
embryo rescue by successful direct transfer of 
Tulipa kaufmanniana Regel germplasm into that 
of 7: gesneriana L. 


18.5 Application of Embryo 
Rescue in Vegetable Crops 


Wide hybridization to a vegetable breeder and 
cytogeneticist is the first step in transferring 
genes from wild species into cultivated ones. 
Embryo rescue technique has been successfully 
demonstrated in tomato, capsicum, hot peppers, 
okra and radish. In India, embryo culture work 
has been initiated in okra for rescuing interspe- 
cific hybrids. The following is by no means an 
exhaustive account of work reported in literature 
thus far. 
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18.5.1 Tomato 


Smith (1944) and Demirel and Seniz (1997) 
made observations on the possibility of utilizing 
embryo culture for improvement of tomato. For 
in vitro breeding in tomato, Aragao et al. (2002) 
used three culture media in combination with 
distinct accessions, crossing generations and 
periods of time after artificial pollination and 
evaluated them to identify more efficient proto- 
cols to recover interspecific hybrids between 
Lycopersicon esculentum and L. esculentum. 
Bhattarai et al. (2009) found germination of 
immature seeds to be a better alternative to culture 
of excised immature embryos in a bid to hasten 
tomato breeding programmes. 


18.5.2 Capsicum/Hot Peppers 


In the state of Meghalaya in India, chilli (hot pep- 
pers) is the third most important spice crop after 
ginger and turmeric. However, diseases, namely, 
tobacco mosaic virus, root rot, tomato spotted 
wilt virus, etc., lead to considerable decline in 
yield. Capsicum chinense, C. annuum and C. fru- 
tescens were crossed with each other and embryos 
rescued between 27 and 33 days after pollination. 
The highest percentage of embryo growth was 
observed on MS medium with 0.5 mg/l GA; and 
0.05 mg/l NAA. Hybrid plants were obtained and 
their hybridity confirmed using morphological as 
well as RAPD markers (Debbarama et al. 2013). 


18.5.3 Radish 


Intergeneric herbicide resistance transfer to rad- 
ish has been recently accomplished using embryo 
rescue (Mithila and Hall 2012). For introgressing 
auxinic herbicide (Dicamba) resistance from 
wild mustard (Sinapis arvensis) into radish 
(Raphanus sativus; 2n = 18), embryo regenera- 
tion and hybrid plant production was achieved 
involving several hundred reciprocal crosses per- 
formed between these two species. Upon altering 
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cultural conditions and media composition, a 
high frequency of embryo regeneration and 
hybrid plant establishment was achieved. 


18.5.4 Okra (Lady’s Finger) 


As resistance is not available in cultivated species 
of okra (Abelmoschus esculentus Moench), inter- 
specific crosses were made between A. esculen- 
tus and A. moschatus (resistant wild species) to 
develop resistant varieties. Post-zygotic incom- 
patibility was found to operate between the species. 
Crossed seeds were shrivelled and non-viable. /n 
vitro embryo rescue to overcome the incompati- 
bility revealed that culturing 12- and 15-day-old 
embryos of A. esculentus var. Kiran (a high- 
yielding line) x A. moschatus and A. esculentus 
var. Anakomban (a landrace) x A. moschatus on 
MS medium supplemented with BA 0.5 mg I! 
and CW 150 mlI' yielded transplantable hybrids. 
Embryos of crosses A. moschatus x A. esculentus 
(i.e. reciprocal cross) turned brown by the 11th 
day of pollination and could not be cultured in 
vitro (Rajamony et al. 2006). 

In another study (Fatokun 1987), crosses were 
made between members of two West African 
okra types, “Soudanien’ and ‘Guineen’. All 
crosses succeeded in both directions, but F, 
plants that showed hybrid vigour for plant stature 
were partially sterile. Cytological observations of 
these F, plants revealed abnormal meiosis which 
resulted in the production of microspores of 
variable size. The frequency of viable pollen 
(indicated by acetocarmine staining) was low in 
the hybrids: 35.80 % (U.1.92xU.I.313) and 
39.41 % (1bk-1 x U.I1.215). The number of seeds 
produced per fruit was also low in the hybrids and 
only a few of these seeds were viable. The authors’ 
laboratory has initiated work on in ovulo and ex 
ovulo culture of okra embryos with a view to 
generating interspecific hybrids in the long run. 
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18.5.5 Artichoke 


Globe artichoke breeders have two important 
problems: lengthy seed-to-seed cycle and seed- 
borne diseases caused by fungal pathogens. Both 
problems can be solved by embryo rescue. 
Embryos were collected from five cultivars in 
different post-pollination stages (Cravero and 
Crointy 2007). Root induction was poor, but 
shoot development per explant was better. Twenty 
days were required as the optimal time for 
embryo rescue. 


18.5.6 Tuber Crops 


Cassava is one of the most cultivated tuber crops 
in the tropical world. The importance of cassava 
iS growing as a food security crop in sub-Saharan 
Africa, where malnutrition is a menace (Akinbo 
et al. 2010). However, a major hindrance in rapid 
improvement is the long generational cycle of 
cassava, poor germination of seeds and a low 
multiplication rate of stem cuttings. Jn vitro ger- 
mination of 495 seeds from backcross population 
was done. Each genotype was multiplied for suf- 
ficient planting material, hardened in the green- 
house and transplanted to the field. Percentage 
germination of the seeds in embryo culture was 
high (66 %). Raising plantlets in the greenhouse 
was found to be useful to select healthy plants 
and thus obtain a uniform stand in the field. The 
genotypes were planted in a single row trial and 
harvested 8 months after planting. Transplanted 
plantlets gave 98.89 % establishment. Yield- 
related traits were significantly high compared to 
results from past experiments. The high percent- 
age of plant recovery from seed to the field is a 
means of overcoming some problems associated 
with the traditional method of cassava breeding 
through direct seed planting to generate planting 
materials. 
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18.6 Molecular Studies 
on Putative Hybrids 
Obtained Through Embryo 
Culture 


Morphological parameters of the two parents in a 
cross and their hybrid progeny can be compared 
to look for clear-cut differences. Morphological 
traits, however, are subject to environmental 
influences and can be misleading. Besides, these 
lack power of resolution to identify hybrids at 
juvenile stages, for which it becomes necessary 
to wait until plant maturity (Debbarama et al. 
2013). Distant crosses rescued through embryo 
rescue can be confirmed for hybridity using 
molecular markers such as RAPD, AFLP, etc. 
Therefore, testing of hybridity using molecular 
markers is advantageous, as it obviates these 
hindrances. 


18.6.1 Use of Molecular Markers 
in Mango Embryo Rescue 


Mini- and microsatellite DNA markers are highly 
polymorphic in many species due to a variable 
number of tandem repeats. DNA fingerprints 
obtained by hybridization of mini- and microsat- 
ellite probes with genomic DNA were shown to 
be useful for cultivar identification. DNA finger- 
print information for identification of nucellar 
and zygotic seedlings was attempted. For this 
purpose, DNA was isolated using CTAB method 
from fully mature leaves of 6-month-old seed- 
lings (stones with single plants, as well as poly- 
embryonic seedlings) and from _ field-grown 
mother trees. DNA samples were quantified 
using spectrophotometer and confirmed for the 
presence of DNA using’ agarose _— gel 
electrophoresis. 

To multiply Jeffrey’s minisatellite probe (33.6 
plasmid DNA was used as the probe), the plas- 
mid was incorporated into DH5a E. coli compe- 
tent cells and cultured on LB Agar medium with 
ampicillin at 50 pg/ml. Single colonies from the 
transformed ones were isolated and inoculated 
onto the same medium. After adequate growth of 
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the culture, the plasmid was isolated and checked 
for the presence of minisatellite 33.6 (700 bp 
DNA) on 0.8 % agarose gel, using restriction 
enzymes Bam HI and EcoR1. Further develop- 
ment of the nonradioactive labelled probe (33.6 
minisatellite), restriction digestion of the plant 
genomic DNA and Southern transfer/hybridiza- 
tion were worked out. 

18.6.1.1 PCR Amplification Using 

ISSR Primers 

ISSR primers UBC 814, 835, 841, 844, 848, 868, 
873, 881, 898 and 901 were used. The reaction 
mixture (20 pl) contained 10 mM Tris-HCl (pH 
8.3), 50 mM KCl, 2.0 mM MgCl, 0.2 mM mixed 
dNTP, 1 mM primer, 0.5 unit Taq polymerase and 
25 ng/ul of genomic DNA. DNA amplification 
was obtained through 40 cycles (92 °C for 1 min, 
42 °C for 1 min 30 s and 72 °C for | min 30 s), 
followed by extension at 72 °C for 8 min. Of the 
ten ISSR primers used, only six (UBC 835, 841, 
848, 868, 873, 898) showed amplification. Bands 
amplified in the hybrids were similar to that in 
the parents suggesting that these were true 
hybrids. Forward and reverse primers, viz. 1F1R, 
S5F5R, 6F6R and 9F9OR, were also used, of which 
1FIR, 6FOR and 9FOR showed amplification. 


18.6.2 Use of Molecular Markers 
in Grape Embryo Rescue 


Leela Sahiram and Raghavendra (2005), 
Raghavendra et al. (2006) and Leela Sahijram 
and Reddy (2009) carried out molecular studies 
using ISSR markers in seedless grape hybrids 
and confirmed the hybridity of rescued progeny. 


18.7. Basic Studies Carried Out 
Using Embryo Cultures 


In addition to applied uses of embryo culture, the 
procedure is useful in basic studies. Growing 
embryos outside the ovule (ex ovulo) is an 
excellent way to study nutrition and metabolism 
of the embryo at various stages of 1ts development. 
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The technique can also be used to examine 
growth requirements of embryos, effects of phy- 
tohormones and environmental conditions on 
zygotic embryogenesis and the regeneration 
potential of whole embryos or their segments. 
Embryo culture can be used to localize sites of 
germination promoters and inhibitors, for studies 
on embryogenesis and for cryopreservation. 
Embryo culture is useful in understanding preco- 
cious germination. Studies have shown that seed 
tissues play an important role in regulating the 
development of embryos in situ. Cotyledon 
growth stops almost immediately upon excision 
of immature embryo, indicating probable cessa- 
tion of cell division, as seen in cotton. An embryo 
is programmed to germinate even when it is very 
small. 

In situ, the embryo may not germinate because 
it lacks the germination-specific mRNA. But, 
removal of the seed from its environment acti- 
vates the required machinery of embryo cells to 
synthesize the information necessary for germi- 
nation. It has been shown in cotton that 
germination-specific mRNA is normally tran- 
scribed when cotyledons are about three-fifths 
their final size. However, to check precocious 
germination that may lead to vivipary (a lethal 
development), translation of mRNA is prevented 
by simultaneous appearance of abscisic acid 
(ABA) in the seed. Embryo excised at this stage 
undergoes precocious germination which sug- 
gests that ABA is contained in tissues surround- 
ing the embryo rather than in the embryo itself. 
This conclusion is also supported by the fact that 
exogenous application of ABA to excised 
embryos also prevents their precocious germina- 
tion. Accumulation of ABA during late stages of 
seed development has been shown to occur in a 
number of plants. 


18.8 Conclusions 


A very important and valuable biotechnological 
tool for raising hybrid progeny in intractable 
crosses 1s embryo rescue. It is most often used to 
rescue embryos from interspecific and interge- 
neric crosses and from embryos that do not fully 
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develop naturally (as in early ripening and 
seedless fruit crops where the embryo aborts). 
Distantly related plant species in a breeding pro- 
gramme (‘wide hybridization’) may result in no/ 
aborted hybrid embryos. Seedless parents may be 
unable to produce seeds in a cross. Alternately, 
breeding may be hindered owing to heavy fruit 
drop in the early stages of fruit development. In 
all such cases, hybrid embryo rescue is a very 
powerful and useful tool and an indispensable 
technique. The method also can be used to rescue 
seedless triploid embryos, produce haploids, 
overcome seed dormancy or determine seed via- 
bility. It is useful in understanding embryo mor- 
phogenesis and precocious germination. As 
research continues with this technique, new and 
valuable uses will be developed to assist biotech- 
nological breeding of plants. 

Embryo rescue/culture as a technique has 
found wide acceptance and utilization. Many 
interspecific and intergeneric hybrids have been 
successfully produced by culturing immature 
embryos that normally do not survive to maturity 
in situ. Embryo culture techniques are also used 
to rescue embryos from early maturing fruit vari- 
eties, to hasten maturation in some species and to 
overcome dormancy requirements in others. Jn 
ovulo embryo culture facilitates embryo develop- 
ment from the zygote stage to maturity when the 
ovule is cultured. A few interspecific and interge- 
neric hybrids have been made by first accom- 
plishing fertilization in vitro and then culturing 
the ovules to maturity. Self-incompatibility in 
some species can be circumvented with these 
techniques. The potential of these may be a via- 
ble alternative to parasexual hybridization and 
somatic embryogenesis. 
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Ashwani Kumar and Nidhi Gupta 


Abstract 

Triploid hybrids have one of the most important traits, seedlessness, which 
is the characteristic for the fresh-fruit market. Triploid embryos are found 
in small seeds that do not germinate. Hybridization-based extensive breed- 
ing programmes require very efficient methodologies for embryo rescue 
and evaluation of ploidy. Biotechnology provides powerful tools for plant 
breeding. Triploid plants raised from endosperm are generally sterile. 
Endosperm-ploidy levels and its applications in plant breeding have been 
discussed here. Endosperm-raised triploid plants are of commercial value, 
e.g. timber-yielding plants, edible fruit plants or ornamentals propagated 
vegetatively and multiplied mainly through micropropagation. Illustration 


cases of many successful endosperm cultures are described here. 
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19.1 Introduction 


In a fertilization process, the egg fuses with one 
of the male gametes to form a zygote, which 


A. Kumar 
Department of Botany, University of Rajasthan, 
Jaipur, Rajasthan 302004, India 


N. Gupta (D<)) 

Department of Biotechnology, C.C.S. University, 
Meerut, Uttar Pradesh 250004, India 

e-mail: nidhi.05.gupta@ gmail.com 


Bir Bahadur et al. (eds.), Plant Biology and Biotechnology: Volume II; Plant Genomics 


afterward forms the embryo. The other male 
gamete fuses with the central cell containing two 
haploid nuclei. This second fusion is actually a 
double fertilization and triple fusion which often 
results in a triploid structure, the endosperm, and 
found to be present in all angiosperm families 
except Orchidaceae, Trapaceae and 
Podostemaceae. Such endosperm-raised triploid 
plants are generally sterile, but this seedlessness 
does not affect commercial utility of such plants, 
e.g. edible fruit plants, timber-yielding plants or 
ornamentals which are multiplied mainly through 
micropropagation or propagated vegetatively. 
The growth of triploids is generally higher than 
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respective diploids (Thomas and Chaturvedi 

2008). Also, triploids are more vigorous than dip- 

loids (Morinaga and Fukushima 1935). 

Rather than the typical pair of chromosomes, 

a cell having three complete sets of chromosomes 
is called triploid. To produce viable offspring, 
chromosomes need to occur in pairs. But due to 
chromosomal number three, the triploid plants 
are sterile as the odd numbers of chromosomes 
are unable to pair up properly. Such plants do 
flowering and bear fruits, but flowers cannot be 
fertilized and fruit is sterile. Some of the exam- 
ples of triploid crops are: 

e Seedless watermelons (Citrullus vulgaris) 
produced due to cross between tetraploid 
females and diploid males. These are com- 
mercially cultivated in Japan. 

e Triploid sugar beets (Beta vulgaris) produce 
larger roots with more sugar content. 

e TV29 of tea produced by Tea Research 
Association of India is cultivated in North 
India. It produces larger shoots and leaves and 
is tolerant to drought. 

¢ Cultivated banana (Musa paradisiacal) pro- 
duces larger and seedless fruits. 


19.2 Endosperm and Origin 
of Triploids 


Endosperm is a natural and unique triploid tissue 
in its origin, ploidy level and nature of growth. It 
is the triploid stage of the flowering plant which 
is produced by fusion of three haploid nuclei; two 
from the female gametophyte and one from the 
male gametophyte (Thomas et al. 2000). It lacks 
histological differentiation. Lampe and Mills’ 
(1933) first report on endosperm culture was on 
maize, whereas La Rue (1949) first reported the 
establishment of tissue cultures in maize from 
immature endosperm. Since then, mature and 
immature endosperm of various species has been 
shown to form continuously growing calli 
(Bhojwani and Razdan 1996). Johri and Bhojwani 
(1965) demonstrated totipotency of endosperm 
for the first time. They also demonstrated direct 
shoot formation from cultured mature endosperm 
of cherry ballart (Exocarpos cupressiformis). By 
the time, embryo/shoot/plantlet regeneration 
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from endosperm has been reached to dozen of 
species (Bhojwani and Razdan 1996). 

In tissue culture, endosperm tissues provide 
natural material for regenerating plants with trip- 
loid chromosome number, and thus, regeneration 
of plants from this tissue offers a direct method to 
produce triploids. A number of successful regen- 
eration reports of organogenesis and somatic 
embryogenesis are available. Endosperm culture 
(Johri and Bhojawani 1977), reviews on endo- 
sperm (Cheema and Mehra 1982; Bhatnagar and 
Sawhney 1981), micropropagation (Driver and 
Kuniyuki 1984), walnut tissue culture (Mc 
Granahan et al. 1987), embryo rescue (Mc 
Granahan et al. 1986), somatic embryogenesis 
(Tulecke and McGranahan 1985), triploids in 
woody perennials (Lakshmi Sita 1987), Hordeum 
vulgare (Sehgal 1974; Sun and Chu 1981), 
Triticum aestivum (Sehgal 1974) and Oryza 
sativa (Bajaj et al. 1980; Nakano et al. 1975) are 
already in records (Fig. 19.1). 


19.3. Production of Triploids 


Triploids can be produced by crossing an induced 
tetraploid plant with normal diploid plant. 
Tetraploids can be produced by treating the ter- 
minal buds of diploid plants with chemicals such 
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Fig.19.2 (a) Asexual triploid reproduction via parthenogenesis. (b) Triploid-diploid sexual reproduction 


as colchicine, oryzalin, pronamide, amiprophos 
methyl and trifluralin (Wan et al. 1991). However, 
such crosses are not always fortunate as it results 
in reduced seed setting compared to cross 
between two diploids (Sikdar and Jolly 1995). 
Moreover, seedling survival and seed germina- 
tion are also very low. Still, triploids play an 
important role in biomass and soil conservation 
and thus represent a significant importance in 
shrubs and trees. They help in preserving vast 
amounts of photosynthetic energy and thus pro- 
mote vegetative growth. Similarly, seedlessness 
is used to increase the quality of several fruits, 
like banana, papaya, grapes, apple, etc. In some 
plants, like Miscanthus sinensis, seed-sterile trip- 
loids have been grown to prevent seed dispersal 
in the environment (Petersen et al. 2002) 
(Fig. 19.2). 


19.4 Examples of Triploid Plants 


Triploid seedless trait has been described in many 
crops, especially in fruits. Artificially, triploid 
fruits are produced by first developing tetraploids 
using above-mentioned chemicals, which are 
then crossed with respective diploids. Such fruits 
are then commercially used. 


19.4.1 Watermelon (Citrullus 
vulgaris) 


When tetraploid females are crossed with diploid 
males, seedless watermelons (Citrullus vulgaris) 


are produced. Native African vine Citrullus 
lanatus (syn. C. vulgaris) derived modern variet- 
ies of the watermelon that are unable to produce 
viable gametes during meiosis, and thus, their 
ripened melons are seedless. Wild populations of 
C. lanatus var. citroides are common in Central 
Africa and give rise to domesticated watermelons 
var. lanatus (Robinson and Decker-Walters 
1997). Wild, ancestral watermelons (var. citroi- 
des) have a spherical, striped fruit and white, 
slightly bitter or bland flesh and are commonly 
known as the citron or citron melon (Fig. 19.3). 

Japan commercially grows seedless watermel- 
ons which are produced by crossing tetraploid 
female with diploid male lines. Reciprocal cross 
was also tried but was not successful. Seeds pro- 
duced by triploid plants are not true seeds; they 
are small in size having white rudimentary struc- 
tures like that of cucumber (Cucumis sativus) 
seeds. However, a few normal sized seeds may 
occur, but they are generally empty. It is also to 
be noted that all cultivated triploid watermelons 
do not have red pulpy flesh. They may have seed- 
less yellow, sweet flesh (Fig. 19.4). 


19.4.2 Little Gourd (Coccinia grandis) 


Babu and Rajan (2001) developed a triploid vari- 
ety of Coccinia grandis, fruit of which is used as 
a vegetable. It was also produced by crossing a 
normal diploid parent with colchicine-induced 
tetraploid. 2.4 % of seeds per fruit were observed. 
Morphologically, the triploid plants were some- 
what resembled to the diploid, but the substantial 
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Fig. 19.3 Citron melon 


features were its vigorous growth, increased fruit 
size, lower astringency and higher yield. 
However, these triploid fruits were tastier with 
good amount of vitamin A, vitamin C and iron 
and had less polyphenols; hence, they could be 
used as a salad crop. This plant also has many 
medicinal properties against diabetics, skin infec- 
tions and bronchitis (Table 19.1 and Fig. 19.5). 


19.4.3 Citrus 


Citrus fruits are the most extensively and primar- 
ily produced fruit tree crop in the world (FAO 
2009) for the fresh-fruit market, especially in the 
Mediterranean area. Area-wise, Spain is the main 
producer which covers a surface of 330,000 ha 
and produces about 6.3 million tons of citrus. 
Diploids are the available genetic resources 
for citrus fruit, and their naturally produce seeds 
include polyploid individuals. These natural 
polyploid plants can give rise to interesting char- 
acteristics in citrus fruit; thus, they are very use- 
ful for genetic breeding projects. CIRAD (French 
Agricultural Research Centre for International 
Development) has developed genetic breeding 
programmes for citrus fruit in the Mediterranean 
Basin to create triploid varieties of sterile and 
seedless fruit and tetraploid rootstocks resistant 





Fig.19.4 Triploid watermelon having red flesh 


to abiotic constraints, such as water deficiency or 
salinity, both from predominantly diploid genetic 
resources that would meet agronomic constraints, 
market expectations and consumer demand 
(Fig. 19.6). 


19.4.4 Mandarin 


AS per increasing consumer demand, seedless 
citrus fruits are the basic requirement for the 
fresh market. Mandarin triploid hybrids have this 
seedlessness trait as its one of the most important 
characteristics. The availability of a number of 
high-quality seedless varieties in mandarins is 
very low; thus the production and recovery of 
new Seedless triploid hybrids of mandarin variet- 
ies have a high priority for many citrus industries 
worldwide (Fig. 19.7). 

Citrus triploid hybrids can be recovered by 
2xx4x (Esen and Soost 1971b; Oiyama et al. 
1981; Starrantino and Recupero 1981), 2xx2x 
(Cameron and Frost 1968; Esen and Soost 197 1a; 
Geraci et al. 1975) and 4xx2x (Cameron and 
Burnett 1978; Esen et al. 1978; Aleza et al. 2009) 
sexual hybridizations as a consequence of the 
formation of unreduced gametes at low frequency 
(Aleza et al. 2010). 

For the first time, Esen and Soost (1971a) 
indicated that triploid embryos were mainly 
found in between one third and one sixth smaller 
seeds than normal seeds that do not germinate in 
conventional greenhouse conditions. However, 
still at relatively low germination percentages, 
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Table 19.1 Comparative evaluation of diploid, tetraploid and triploid of Coccinia grandis (Source: Babu and Rajan 


2001) 


Characters 

Days to flowering 

Flower size 

Fruit size 

Fruit length (cm) 

Fruit girth (cm) 

Fruit weight (g) 
Polyphenol per gram of fruit (fig) 
Fruit colour 

Leaf size 

Leaf thickness 

Fruit yield/plant/year (kg) 


Diploid 
40 
Medium 
Medium 
6.59 
7.40 
15.20 
0.300 
Green white strips 
Medium 
Medium 
10.32 





Fig.19.5 Coccinia grandis 





Tetraploid 

42 

Large 

Small 

4.49 

8.20 

14.50 

0.311 

Green with white strips 


Large 
Very thick 
9.34 


Triploid 
38 
Medium 
Large 
7.50 
11.60 
44.20 
0.090 
Green with white 
Medium 
Medium 
15.25 





Fig.19.6 Seedless lemon (Citrus limon) 


Fig.19.7 (a) Mandarin plant having flowers and fruits. (b) Seedless mandarin 
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the in vitro culture of whole seeds with their 
integuments can improve germination rates 
(Ollitrault et al. 1996). In rare cases, triploid 
hybrids can be found in conventional greenhouse 
seedlings, as in ‘A-12’ mandarin (Bono et al. 
2004) and ‘“Winola’ mandarin (Vardi et al. 1991). 


19.4.5 Neem (Azadirachta indica) 


Because of the arising use of neem and its prod- 
ucts in medicine, agriculture, cosmetics and ani- 
mal health care, it is an important and economic 
tree of India. Triploid plants of neem were 
obtained from immature endosperm culture 
(Chaturvedi et al. 2003). Over 66 % of the plants 
were triploid with chromosome number 36. A 
characteristic feature of the shoots of endosperm 
origin is the presence of a large number of multi- 
cellular glands. The selected triploids, expected 
to be sexually sterile, can be bulked up by micro- 
propagation (Fig. 19.8). 


19.4.6 Acacia (Acacia nilotica) 


Garg et al. (1996) describe somatic embryogen- 
esis and triploid plant regeneration from imma- 
ture endosperm cultures of Acacia nilotica, an 
important leguminous tree species suitable for 
afforestation of arid and marginal lands 
(Fig. 19.9). 
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19.4.7 Shanin (Petunia violacea) 


Gupta (1983) reported the formation of haploid, 
diploid and triploid plants by direct pollen 
embryogenesis in Petunia violacea. In certain 
species, especially in Petunia, an almost exclu- 
sive production of androgenic triploids has been 
reported which is useful in ornamental plants for 
the introduction of vigorous foliage and flowers 
(Fig. 19.10). 


19.4.8 Triticale 


Triticale, first bred in laboratories during the late 
nineteenth century, is one of the most successful 
synthetic allopolyploids produced by crossing 
tetraploid wheat or hexaploid wheat with rye. 
The grain was originally bred in Sweden and 
Scotland; however, now it is being grown com- 
mercially in many parts of the world, e.g. 
Germany, Canada, France and Poland (the largest 
area), covering an area of around 2.6 million 
hectares with an annual production of 8 million 
tons. Triticale high-yielding ability and grain 
qualities of wheat combined with tolerance abil- 
ity for adverse environment of rye provide its 
important and desirable features. In more than 
15 years, the yielding ability of triticales has been 
increased to about 90 %. However, in Sweden, 
the raw triticales yielded about 50 % of the stan- 
dard varieties of wheat. 





Fig. 19.8 Neem 


Fig.19.9 Acacia plant having flower and fruit 
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Fig.19.10 Shanin flower 


India have released three varieties of triticales: 
TL419, DT46 (amber colour grains) and TL1210. 
Although TL1210 grain yield is comparable to 
that of the best wheat varieties, its deep grain 
colour represents its chief drawback, thus mainly 
grown as a fodder crop in Punjab. To overcome the 
problem, Indian breeders have developed amber- 
coloured triticales by using white-seeded rye as 
one of the parents of the triticales (Fig. 19.11). 

Some other examples of allopolyploids are 
Raphanobrassica, the triploid (AAC) produced 
by crossing B. campestris (AA) with B. napus 
(AACC), Festuca-Lolium hybrids, allopolyploid 
clovers and some species hybrids in Rubus and 
jute (Corchorus sp.). 


19.4.9 Sugar Beet (Beta vulgaris L.) 


The triploid varieties of sugar beet are mixtures 
of diploid, triploid and other ploidy level plants. 
As compared to diploids, triploid sugar beets pro- 
duce more sugar and larger roots and 10-15 % 
higher yields per unit area, while tetraploids pro- 
duce smaller roots and_ lower _ yields. 
Commercially, Japan and Europe produce trip- 
loid varieties of sugar beet, but their popularity is 
declining rapidly. As the beet flower is small in 
size, triploid sugar beet seed production is quite 
difficult (Fig. 19.12). 

Triploid sugar beet seed may be produced by 
using any of the following two ways: (1) using 4x 





Fig. 19.11 Triticale 


plant as male and 2x as female or (2) using 4x 
plants as female and 2x as male. The first cross 
provides higher seed yield but a lower proportion 
of triploids, while the second gives lower seed 
yield but a higher proportion of triploids. 
Commercially, interplanting 4x and 2x lines in 
the ratio 3:1 is used for producing triploid sugar 
beet seed, and finally, seeds from both 4x and 2x 
plants are harvested. These harvested seeds con- 
sist of about 75 % triploid (3x) seeds. 


19.4.10 Cassava (Manihot esculenta) 


Cassava, commonly known as poor man’s crop, 
is an important root crop to be cultivated in tropi- 
cal countries and propagated by stem cuttings. It 
has become a subsidiary food in many countries. 
It is also used as cattle feed and its raw material 
for starch-based industries. Cultivated cassava 1s 
highly heterozygous and cross-pollinated, hav- 
ing a diploid number of chromosomes (2n = 36). 
Among artificially produced polyploids, cassava 
triploids have higher starch potential and a 
higher yield (Jos et al. 1987; Sreekumari and Jos 
1996). 

The first triploid variety of cassava named 
Sree Harsha was released in 1996 (Sreekumari 
et al. 1999) and was produced by crossing 
induced tetraploid plants with natural diploid. 
The use of a 2x female plant yielded better results 
than reciprocal crosses. Many features of triploid 
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Fig.19.12 Sugar beet: (a) seed, (b) flower, (c) root 


cassava make it superior than its diploid. These 
include higher harvest index, rapid bulking, 
higher yield, early harvestability, increased dry 
matter and starch content in the roots, shade 
tolerance and tolerance to cassava mosaic virus 
(Fig. 19.13). 


19.4.11 Tea (Camellia sinensis) 

Tea Research Association, India, has recently 
released a triploid clone of tea (Camellia sinensis 
var. assamica) for its commercial cultivation in 
northern parts of the country. This triploid culti- 
var, TV29, produces larger shoots and, thereby, 
biomass yields more cured leaf per unit area and 
is more tolerant to drought than the available dip- 
loid cultivars. The quality of the triploid clone is 
comparable to that of diploid cultivars used for 
making CTC (curl-tear-cut) tea (Fig. 19.14). 


19.4.12 Mulberry (Morus alba L.) 


Being an exclusive source of feed for silkworms, 
mulberry is an indispensable crop for the sericul- 
ture industry. Both natural and in-vivo-induced 
mulberry triploids have been reported (Das et al. 
1970; Katagiri et al. 1982; Dwivedi et al. 1989). 
Many of the triploid lines are superior to its dip- 
loids (Thomas et al. 2000), in cold and disease 
resistance (Hamada 1963) and in yield and nutri- 
tive qualities of leaves (Seki and Oshikane 1959). 
The endosperm callus differentiated shoots, 
which could be rooted, and full triploid plants 
have already been established in soil (Fig. 19.15). 


19.4.13 Ornamental 


Excised cultures of endosperm from immature 
fruits having zygotic embryo have been used at 
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Fig.19.14 Tea leaves 


early dicotyledonous stage to produce triploid 
annual phlox or Drummond’s phlox (Phlox 
drummondii Hook.) ornamental plants (Razdan 
et al. 2013). It was reported that over 70 % of 
annual phlox plants were triploid with a chro- 
mosome number of 2n=3x=21. The growth of 
triploids is generally higher than respective 
diploids (Thomas and Chaturvedi 2008). These 
triploid plants have greater size of leaves, stem, 
flowers and/or foliage with higher number of 
pollen and larger stomata as compared to natu- 
rally occurring diploid plants (Miyashita et al. 
2009). Moreover, triploid plant flowers showed 
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Fig.19.15 Mulberry plant with fruit 


enlarged central eye and bright colour, adding 
to their ornamental value (Razdan et al. 2008) 
(Fig. 19.16). 


19.4.14 Pomegranate (Punica 
granatum L.) 


Pomegranate is one of the oldest known 
fruit trees of the tropics and subtropics, cultivated 
for its delicious edible fruits. In addition, 
the tree is also valued for its pharmaceutical 
properties. 
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Fig.19.16 Phlox drummondii flowers 


19.5 Discussion 

Endosperm is a unique tissue in its origin, ploidy 
level and nature of growth. It is mostly formed by 
the fusion product of three haploid nuclei, one 
from the male gametophyte and two from the 
female gametophyte, and is, therefore, triploid. 
Traditionally, triploids are produced by crossing 
induced superior tetraploids and diploids. This 
approach is not only tedious and lengthy (espe- 
cially for tree species), but in many cases, it may 
not be possible due to high sterility of autotetra- 
ploids. In contrast, regeneration of plants from 
endosperm, a naturally occurring triploid tissue, 
offers a direct, single-step approach to triploid pro- 
duction (Bhojywani and Razdan 1996; Kumar 
2010; Kumar and Roy 2006, 2011; Kumar and 
Shekhawat 2009; Neumann et al. 2009). 


19.6 Conclusion 


In conclusion, gametic embryogenesis hold 
great promise for making a significant, low-cost 
and sustainable contribution to plant breeding, 
aimed at increasing farm productivity and food 
quality, particularly in developing countries and 
in an environmentally friendly way, helping to 
reduce the proportion of people suffering from 
chronic hunger and from diseases due to 
malnutrition. 
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Abstract 

Plant cell and tissue culture has been suggested as an alternative means 
for year-round production of secondary metabolites with an added 
potential of increasing yields by culture selection and manipulation, 
genetic transformation, hairy root cultures, and use of bioreactors for 
mass production. Secondary metabolite pathways and genes involved in 
those pathways have been identified, and regulation of transcription and 
transcription factors has been determined by studying functional genom- 
ics in conjunction with data-mining tools of bioinformatics. Besides this, 
advances in metabolic engineering enable researchers to confer new sec- 
ondary metabolic pathways to crops by transferring three to five, or 
more, heterologous genes taken from various other species. As an alter- 
native, the metabolic pathways of useful secondary metabolites have 
been modified to improve their productivity via genetic transformation. 
However, there is a need to understand metabolic pathways of secondary 
metabolism at the molecular level. Plant hairy roots offer a novel and 
sustainable tissue-based system that preserves multiple specialized cell 
types believed to be important in maintaining a better consistency in 
synthesis of bioactive secondary molecules. This paper will review state- 
of-the-art reports on improving production of secondary metabolites in 
tissue cultures in various plant species. 


Keywords 
Secondary metabolites ¢ Alkaloids * Saponins * Terpenoids ¢ Nicotine 


20.1 Introduction 


Plant cell and tissue culture technology has been 
known as a possible tool for studying both the 
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It has been used for production of large number 
of secondary metabolites. The degree of cellular 
differentiation and organization of the tissue, 
which is implied in a culture of this nature, 
favors the accumulation of these secondary 
compounds (Flores 1992; Wu et al. 2003, 2005; 
Kim et al. 2004; Kong et al. 2004; Thwe et al. 
2012). An increased secondary metabolite pro- 
duction is correlated with a slow cell division 
rate in cell suspension cultures (Lindsey and 
Yeoman 1983; Sharma et al. 2011). Similarly, 
secondary metabolite production at the station- 
ary phase of growth has been related with tissue 
organization (Tabata et al. 1972), morphological 
differentiation (Ramawat et al. 1985; Sharma 
et al. 2009), and low growth rates (Lindsey and 
Yeoman 1983). 

Controlled transcription of biosynthetic genes 
achieved by specific transcription factors is one 
major mechanism regulating secondary metabo- 
lite production in plant cells (Bopana and Saxena 
2010). Transcription factors are sequence- 
specific DNA-binding proteins that interact with 
the promoter regions of target genes and modu- 
late the initiation of mRNA synthesis by RNA 
polymerase I. These proteins regulate gene tran- 
scription depending on tissue type and/or in 
response to internal signals. The two well-studied 
secondary pathways are the phenylpropanoid 
pathways and its flavonoid branch and the terpenoid 
indole alkaloid biosynthetic pathway (Neumann 
et al. 2009). 

Hairy root cultures provide novel opportunities 
for production of valuable phytochemicals that are 
synthesized in roots. Hairy roots are developed 
by infecting plant leaf or stem tissue with 
Agrobacterium rhizogenes that transfers genes that 
encode hormone biosynthesis enzymes into the 
plants. Hairy root cultures have several advantages 
over undifferentiated plant suspension cell cul- 
tures. Hairy roots are genetically stable and grow in 
hormone-free culture media. Hairy roots show 
rapid growth and promote the synthesis of phyto- 
chemicals whose biosynthesis requires differenti- 
ated cell types. Hairy root lines producing valuable 
phytochemicals have been developed from various 
plant species (Dehghan et al. 2012; Cardillo et al. 
2013). Recently, have been used to hairy root cul- 
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tures to improve secondary metabolism com- 
pounds in Hyoscyamus niger (Zhang et al. 2004) 
and p-hydroxybenzoic acid (pHBA) glucose ester 
production in hairy roots of Beta vulgaris (Rahman 
et al. 2009), express foreign proteins or vaccine in 
tobacco (Shadwick and Doran 2007). Several 
TIAs’ biosynthesis genes have also been overex- 
pressed in C. roseus hairy root cultures (Zhao et al. 
2012a, b). Kochkin et al. (2013) demonstrated for 
the first time the presence of large amounts of gin- 
senosides malonyl-Rbl, malonyl-Rc, malonyl- 
Rb2, and malonyl-Rd in a suspension culture of 
Panax japonicus var. repens cell. 

Due to an increased appeal of natural products 
for medicinal purposes, metabolic engineering can 
have a significant impact on the production of phar- 
maceuticals and help in the design of new therapies 
(Bender and Kumar 2001; Kumar and Roy 2006, 
2011; Kumar and Sopory 2008, 2010; Neumann 
et al. 2009; Kumar and Shekhawat 2009; Kumar 
2010; Fernandez et al. 2010; Sharma et al. 2011; 
Kumar 2014). According to Bailey (1991), meta- 
bolic engineering is “the improvement of cellular 
activities by manipulation of enzymatic, transport, 
and regulatory functions of the cell with the use of 
recombinant DNA technology.’ Application of 
recombinant DNA methods can improve produc- 
tion of metabolite and protein products by altering 
pathways and regulate release process in down- 
stream processing. In many cases, this approach 
relies on the identification of limiting enzyme 
activities after successful pathway elucidation and 
metabolite mapping (metabolomics) (Neumann 
et al. 2009). Some of the important metabolites 
being produced in tissue culture and some tech- 
nologies to improve their production are presented 
in this review paper. 


20.2 Hairy Root Cultures 


20.2.1 Azadirachtin from Azadirachta 
indica A. Juss. (Neem) Cultures 


Azadirachtin (C35H44016), obtained from 
Azadirachta indica (neem), is a_ high-value 
secondary metabolite commercially used as a 
broad-spectrum biopesticide. Neem (Azadirachta 
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indica A. Juss.) plant tissue and cell culture have 
been used to obtain year-round production of 
azadirachtin and other neem metabolites with the 
added potential of increasing yields by culture 
selection and manipulation. Allan et al. (2002) 
established hairy root cultures from stem and leaf 
explants of Azadirachta indica A. Juss (neem) 
following infection with Agrobacterium rhizo- 
genes. ‘Transformation was confirmed using 
polymerase chain reaction analysis. Srivastava 
and Srivastava (2013) reported batch cultivation 
of Azadirachta indica hairy roots in different 
liquid-phase bioreactor configurations (stirred 
tank, bubble column, bubble column with poly- 
propylene basket, and polyurethane foam disk 
as root supports) to investigate possible scale-up 
of the A. indica hairy root culture for in vitro 
production of the biopesticide, azadirachtin. The 
hairy roots failed to grow in the conventional bio- 
reactor designs (stirred tank and bubble column). 
They reported batch cultivation of A. indica hairy 
roots in modified bubble column reactor (with 
polypropylene mesh support). The incorporation 
of a PUF disk as a support for the hairy roots 
inoculated inside the bubble column reactor 
facilitated increased biomass production and 
azadirachtin accumulation in_ hairy roots 
(Srivastava and Srivastava 2013). 


20.2.2 Tropane Alkaloids 
from Nicotiana tabacum 


Hairy root cultures of Nicotiana tabacum are a 
better alternative for tropane alkaloid production 
than cell suspension cultures, mainly because 
they are stable, both genetically and in alkaloid 
production during long subculture periods 
(Maldonado-Mendoza et al. 1993). The utility of 
hairy root cultures to produce valuable phyto- 
chemicals could be improved by repartitioning 
more of the desired phytochemical into the spent 
culture media, thereby simplifying the bioprocess 
engineering associated with the purification of the 
desired phytochemical. The majority of nicotine 
produced by tobacco hairy root cultures is retained 
within roots, with lesser amounts exuded into the 
spent culture media. Reduced expression of the 
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tobacco nicotine uptake permease (NUP1), a 
plasma membrane bound transporter, results in 
significantly higher nicotine accumulation in the 
media. Thus, NUP1-reduced expression lines pro- 
vide a genetic means to repartition more nicotine 
into the culture media (Dewey and Xie 2013; 
Murthy et al. 2014). 


20.2.3 Diosgenin from Trigonella 
foenum-graecum L. 
(Fenugreek) 


Plant tissue cultures (in vitro techniques) offer an 
opportunity to improve the plant properties via 
genetic engineering, and recently it has been used 
as a tool for genetic transformations. Trigonella 
foenum-graecum L. (in Arabic, Hulabah) is also 
employed as a herbal medicine in many parts of 
the world. Diosgenin provides about 50 % of the 
raw material for the manufacture of cortisone, 
progesterone, and many other steroid hormones 
and is a multibillion-dollar industry. However, the 
supply of diosgenin cannot currently satisfy the 
demands of the ever-growing steroid industry, 
and therefore new plant species and new production 
methods, including biotechnological approaches, 
are being researched (Verpoorte 2000; Neumann 
et al. 2009). 


20.2.4 Terpenoid Indole Alkaloids 
(TIAs) 


Terpenoid indole alkaloids (TIAs) are very 
important pharmaceutical compounds. The 
monomeric alkaloids serpentine and ajymalicine 
are used as a tranquilizer and to reduce hyper- 
tension, respectively. The dimeric alkaloids vin- 
cristine and vinblastine are potent antitumor 
drugs. The biosynthesis of TIAs is highly regu- 
lated and depends on tissue-specific factors as 
well as environmental signals. Low productivity 
is the main hindrance toward commercial appli- 
cation of the production of secondary metabo- 
lites by plant cells in suspension culture. 
Different strategies have been developed to 
overcome this problem (Kumar and Roy 2006; 
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Kumar and Sopory 2008, 2010; Kumar and 
Shekhawat 2009; Kumar 2010; Bopana and 
Saxena 2010; Sharma et al. 2011). 


20.2.5 Ginsenoside Saponin 


Han et al. (2013) established dammarenediol-II 
production via a cell suspension culture of 
transgenic tobacco overexpressing PgDDS. 
Dammarenediol-II is a biologically active tetra- 
cyclic triterpenoid, which is a basic compound of 
ginsenoside saponin and is a useful candidate 
with potentially biologically active triterpenes. 
Transgenic tobacco plants overexpressing 
PgDDS (AB122080) under the control of the 
CaMV35 promoter were constructed. 


20.2.6 Benzylisoquinoline Alkaloids 
(BIAs) 


This group of alkaloids is derived from aromatic 
amino acid tyrosine. BIAs include the narcotic 
analgesic morphine, the cough _ suppressant 
codeine, the muscle relaxants papaverine and 
(+)-tubocurarine, the antimicrobial compound 
sanguinarine, and the cholesterol-lowering drug 
berberine (Kong et al. 2004; Kong and von 
Aderkas 2007). Recently, efforts have been made 
to assemble BIA biosynthetic pathways in micro- 
organisms through the heterologous expression 
of multiple alkaloid biosynthetic genes (see 
Kumar and Sopory 2010; Nakagawa et al. 2011). 
Farrow et al. (2012) suggested that species-spe- 
cific metabolite accumulation is influenced by 
the presence or absence of key enzymes and 
perhaps by the substrate range of these 
enzymes. They provided a valuable functional 
genomics platform to test these hypotheses 
through the continued discovery of BIA biosyn- 
thetic enzymes. 

BIA noscapine is cough suppressant and 
promising anticancer agent (Dumontet and 
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Jordan 2010), BIA biosynthetic enzymes from a 
number of related plant species have been char- 
acterized using EST (Farrow et al. 2012). The 
integration of transcript and metabolite profiles 
predicts the occurrence of both functionally redun- 
dant and novel enzymes. 


20.2.7 Hypericin from Hypericum 
perforatum L. 


Hypericin is a traditional medicinal plant for the 
treatment of depression and wound healing, and 
hypericin is one of the main effective active 
substances. To optimize the culture system for 
producing hypericin in adventitious root, Wu 
et al. (2014) reported the use of balloon-type air- 
lift bioreactors. They investigated the effect of 
air volume, inoculation density, indole-3-butyric 
acid (IBA) concentration and methyl jasmonate 
(MeJA) concentration on hypericin content, 
and productivity during adventitious root culture. 
MeJA efficiently elicited the hypericin synthesis 
of H. perforatum adventitious roots (Wu et al. 
2014). 


20.2.8 Anthraquinones 


Baque et al. (2013) reported improve root growth 
and production of bioactive compounds such as 
anthraquinones (AQ), phenolics, and flavonoids 
by adventitious root cultures of Morinda citrifo- 
lia. They studied effects of aeration rate, inoculum 
density, and Murashige and Skoog (MS) medium 
salt strengths using a_ balloon-type bubble 
bioreactor. 


20.2.9 Isoflavone Production 
Isoflavones have an affinity to estrogen-f receptors 


in humans and are reported to exhibit numerous 
health-promoting effects, including the allevia- 
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tion of menopausal symptoms, the prevention of 
osteoporosis and cardiovascular diseases, and 
the lowering of risk of breast cancer (Patisaul 
and Jefferson 2010). The growing demand for 
isoflavonoid derivatives resulted in numerous 
research projects focused on the in vitro cultures 
of selected plant species of Fabaceae which is 
rich in isoflavones. Some of the best known 
phytoestrogens are genistein, genistin, daidzein, 
and puerarin (Patisaul and Jefferson 2010). 
Biotechno-logical production of isoflavones, 
mainly from the family Fabaceae, is based on sus- 
pension cultures of Pueraria sp. (Goyal and 
Ramawat 2008a, b; Sharma et al. 2009; He et al. 
2011), Psoralea sp. (Shinde et al. 2009a, b), and 
Glycine max (Federici et al. 2003; Terrier et al. 
2007). Calycosin, formononetin, and pseudobap- 
tigenin are also present in the more widespread 
legume Trifolium pratense (Kokotkiewicz et al. 
2013). 


20.3 Adventitious Root Culture 
Versus Hairy Root Culture 


Culture of adventitious roots in bioreactors offers 
several advantages such as faster growth rates, 
tremendous quantities of metabolite accumula- 
tion, and stable production year-round. This can 
also reduce production costs and time and final 
product quality can be more easily controlled 
(Lee et al. 2011). Baque et al. (2013), using large- 
scale bioreactors, raised adventitious root culture 
as an efficient and attractive alternative to cell, 
hairy root, or whole-plant cultivation for biomass 
and metabolite production. 


20.4 Bioreactors for Secondary 
Metabolite and Plant 
Propagules 


Recent advances with large-scale production 
have successfully produced ginseng roots in a 
10,000 | bioreactor establishing the feasibility of 
the root system to accommodate industrial 
processes (Sivakumar et al. 2006). A suitable air 
supply inside a bioreactor is an important factor. 
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Air volume promoted the hypericin production 
of adventitious roots (Wu et al. 2011). Higher 
efficiency of genetic transformation resulted not 
only from greater target tissue yield, but there 
was also evidence of improved transgenic event 
production with the tissue produced with airlift 
bioreactors than tissue produced in shaken flasks 
(Kong et al. 2013). 

Numerous studies have applied bioreactors in 
plant cell (Huang and McDonald 2012) and organ 
(Srivastava and Srivastava 2012) culture to obtain 
specific metabolites. Modern bioreactor culture 
systems provide a more advanced technology to 
produce higher secondary metabolites from plant 
cell, tissue, or organ using artificial nutrients with 
MeJA. Yu et al. (2002) found that the ginsenoside 
content was obviously enhanced by the addition 
of 100 uM MeJA during adventitious root culture 
of Panax ginseng; Donnez et al. (2011) examined 
that 0.2 mM MeJA was optimal for the efficient 
production and high accumulation of resveratrol 
in grape cell. 


20.5 Propagules in Bioreactors 


Besides this, a considerable number of research- 
ers have cultured plant propagules in bioreactors 
to produce high-quality seedling (Zhao et al. 
2012a, b). It is clear from these studies that 
temporary immersion bioreactor culture systems 
are appropriate for shoot multiplication and 
regeneration and the continuous immersion 
system is suitable for the proliferation of propa- 
gules (without leaves) such as bulblets (Kim et al. 
2004), PLBs (Yang et al. 2010), and rhizomes 
(see Gao et al. 2014). Kong et al. (2013) con- 
structed and tested airlift bioreactors (ALBs) for 
their potential to enhance chestnut embryogenic 
tissue proliferation for genetic transformation 
and mass propagation. 


20.5.1 Ginsenoside Production 
Panax ginseng roots have been widely used in 


Chinese traditional medicine since ancient times 
owing to their stimulating and tonic properties. 
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The pharmacological activities of ginseng or its 
crude extracts are based on the presence of a 
mixture of triterpenic saponins referred to as 
ginsenosides. The two major groups of ginsen- 
osides are the Rb and Rg groups, which have 
proto-panaxadiol and protopanaxatriol, respec- 
tively, as the sapogenins (Mallol et al. 2001). Rb 
group includes the ginsenosides Rb1, Rb2, Rc, 
and Rd, while Rg group includes the Re, Rf, and 
Rgl ones as the main compounds. Among all 
these ginsenosides, Rb1 and Rgl are the most 
effective compounds (Tanaka and Kasai 1984; 
Mallol et al. 2001). Ginsenosides accumulate in 
the root of the plant, but the agricultural produc- 
tion of roots is expensive. Therefore, the produc- 
tion of ginsenosides, by means of different 
biotechnological alternatives, has been extensively 
studied by a number of researchers, using callus 
tissues (Mallol et al. 2001), cell suspensions 
(Mathur et al. 1994; Kochkin et al. 2013), and 
root cultures (Yoshikawa and Furuya 1987); nev- 
ertheless, the productivity obtained so far has 
been low because of the low growth rates of cul- 
tures. The induction and establishment of hairy 
roots after the infection of Panax ginseng rhi- 
zomes with Agrobacterium rhizogenes has been 
successfully performed (Washida et al. 1998). 
These roots grow more rapidly and produce 
higher levels of saponins than the ordinary cul- 
tured roots obtained by hormonal control. In the 
case of agropine-type strains (such as A. rhizo- 
genes A4), two T-DNA fragments (TL-DNA and 
TR-DNA) are separately transferred into the 
plant material (Jouanin 1984). The integration of 
the TL-DNA into the plant genome is essential 
for developing transformed roots; three genes of 
this fragment, known as rolA, rolB, and rolC, are 
responsible for the full hairy root syndrome 
(Palazon et al. 1998). Although the TR-DNA is 
not essential for hairy root formation, it has been 
shown that the aux! genes harbored in this T-DNA 
segment provide to the transformed cells with an 
additional source of auxin. Aux genes play a sig- 
nificant role in the morphology and alkaloid pro- 
duction of transformed roots of Datura metel and 
Duboisia hybrid (Mallol et al. 2001). 
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20.5.2 Resveratrol 1 


These plant polyphenols have received consider- 
able interest based upon a number of associated 
health benefits (Baur and Sinclair 2006; Delmas 
et al. 2006). Most notably, the significant levels 
of resveratrol | in red wine have been credited to 
the phenomenon known as “the French Paradox,” 
Wherein low incidence of heart disease is 
observed among a population with a relatively 
high-saturated-fat diet and moderate wine con- 
sumption (Frankel et al. 1993). Over the past two 
decades, numerous health benefits impacting 
cardiovascular disease, various cancers, athero- 
sclerosis, and aging have been linked with resve- 
ratrol | (reviewed; Baur and Sinclair 2006; 
Roupe et al. 2006). The majority of resveratrol- 
containing dietary supplements are composed of 
unknown/unidentified botanical components 
wherein resveratrol | and resveratrol derivatives 
only make up a small fraction of the product. 
While chemically synthesized resveratrol 1 may 
address this issue, natural sources often contain 
derivatives, cofactors, and other phytonutrients 
that provide added or synergistic benefits to the 
nutraceutical product and are often preferred by 
the consumer (Wallace 1998). Recent studies 
showing antiaging benefits of resveratrol 1 (Baur 
et al. 2006) further accelerate interest in a natu- 
ral, food-grade source of enriched resveratrol/ 
resveratrol derivatives that delivers a more 
defined and consistent product composition and 
ensures a stable supply chain, and several biotic 
production strategies targeting recombinant 
plants, yeast, and bacteria have been advanced 
(Watts et al. 2006). 


20.5.3 Camptothecin (CPT) 


Camptothecin (CPT), a monoterpene indole alkaloid, 
has been found in several plant species including 
Camptotheca acuminata, Nothapodytes foetida, 
and Ophiorrhiza pumila (Saito et al. 2001; 
Lorence and Nessler 2004). Since it possesses 
topoisomerase I poisoning properties, its semi- 


20 Improving Secondary Metabolite Production in Tissue Cultures 


synthetic derivatives, topotecan and irinotecan, 
have been developed to be clinically used as 
anticancer drugs. Previously, we have estab- 
lished a hairy root culture of O. pumila which 
has already been shown to be a desirable exper- 
imental system to study the biosynthesis of 
camptothecin, since the culture produces a high 
level of CPT and excretes it into the culture 
medium (Sirikantaramas et al. 2007). 


20.5.4 Catharanthus roseus 
(Madagascar Periwinkle) 


As an important medicinal plant, Catharanthus 
roseus (Madagascar periwinkle) produces a 
large amount of terpenoid indole alkaloids 
(TIAs). Among them, vinblastine and vincris- 
tine are important antitumor bisindole alkaloids. 
However, these two anticancer compounds are 
produced at a very low level in C. roseus leaves, 
about 5.8 pg/g for vinblastine and 0.9 ug/g 
(fresh weight) for vincristine, leading to their 
high price in the market (Favretto et al. 2001). 
The lack of vinblastine and vincristine in C. 
roseus hairy roots has been ascribed to an 
absence of vindoline (Bhadra et al. 1998). This 
may be due to the undetectable expression of 
the D4H and DAT genes in the transgenic hairy 
roots. Very recently the DAT gene, which is 
responsible for the terminal step of vindoline 
biosynthesis in C. roseus, was overexpressed in 
C. roseus hairy roots. Interestingly, overexpres- 
sion of DAT did not increase vindoline produc- 
tion but improved the accumulation of another 
monoterpenoid indole alkaloid, horhammeri- 
cine (Magnotta et al. 2007). Biotechnological 
methods may provide an efficient alternative 
for producing natural products since a number 
of genes involved in the TIAs’ biosynthetic 
pathway have been cloned (Pasquali et al. 
2006; Wang et al. 2010; Zhou et al. 2011). 
Deacetylvindoline-4-O-acetyltransferase 
(DAT) is a key enzyme for the terminal step of 
vindoline biosynthesis. In this research, the 
DAT gene promoter was cloned, sequenced, 
and analyzed. 
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Introduction 


Leela Sahijram 


Abstract 

Plants generally exhibit cytogenetic and genetic variations that are helpful 
to plant breeders for crop improvement. When such variants arise through 
the cell and tissue culture process, using any plant portion as an explant 
material, these are termed ‘somaclonal variations’ (SV). Variants obtained 
using callus cultures are referred to as ‘calliclones’, while variants obtained 
using protoplast cultures are known as “protoclones’. On the other hand, 
‘gametoclonal variation’ refers to variations arising in cell cultures of 
gametic origin, as in pollen and microspore cultures, to distinguish them 
from somatic cell-derived regenerants. Somaclonal variation is a double- 
edged sword whereby its presence in micropropagation programmes is 
inimical, while it can be gainfully exploited to create stable variations, e.g. 
disease resistance, where other methods fail or are cumbersome. 


Keywords 
Somaclonal variation ¢ Genetic stability * Gametoclonal * Detection of 
variants ¢ Useful variants 


‘somaclonal variation’ was proposed to describe 
genetic variation in plants regenerated from any 


Somaclonal variation (SV) is a phenotypic varia- 
tion either genetic or epigenetic in origin dis- 
played among somaclones (soma=vegetative; 
clone=identical copy) and occurs among plants 
regenerated from tissue culture. A general term 
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form of cell cultures. Accordingly, plants derived 
from cell and tissue cultures are termed ‘soma- 
clones’. Somaclonal variation has come to repre- 
sent genetic variability present among all kinds 
of cells/plants obtained from cultures in vitro. SV 
can be problematic during micropropagation and 
in vitro conservation and in genetic transforma- 
tion of crop plants, although it may be put to 
good use as a tool in plant breeding. Plants regen- 
erated from tissue and cell cultures show herita- 
ble variation for both qualitative and quantitative 
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traits. Several useful somaclonal variants too 
have been obtained in a large number of plant 
species such as potato, sugarcane, banana, 
tomato, etc. Somaclonal variation 1s well docu- 
mented in the widely commercialized tissue 
culture-raised fruit crop, banana (Musa spp.). 

Variants obtained using callus cultures are 
referred as calliclones, while variants obtained 
using protoplast cultures are known as protoclones. 
Larkin and Scowcroft (1981) proposed a general 
term ‘somaclonal variation’ to describe genetic 
variation in plants regenerated from any form of 
cell cultures. Accordingly, the plants derived from 
cell and tissue cultures are termed as somaclones, 
and the plants displaying variation as “‘somaclonal 
variants’. However, generally the term somaclonal 
variation is used for genetic variability present 
among all kinds of cell/plants obtained from cell 
cultures in vitro. Plants regenerated from tissue and 
cell cultures show heritable variation for both qual- 
itative and quantitative traits. Several useful soma- 
clonal variants have been obtained in large number 
of plant species such as potato, sugarcane, banana, 
tomato etc. Chaleff (1981) labelled plants regener- 
ated from tissue cultures as Ry generation and their 
successive sexual generations as R;, Ro, etc. 

SV can be problematic during micropropaga- 
tion and in vitro conservation and in genetic 
transformation of crop plants, although it may be 
put to good use as a tool in plant breeding. These 
changes are heritable. Early detection of SV is, 
therefore, very useful. Shoot-tip culture preserves 
genetic stability much better than callus or cell 
suspension cultures, yet somaclonal variation 
appears to be widespread among plants regener- 
ated from banana shoot-tip cultures. Off-type fre- 
quencies are reported to vary from | to 74 %. 

In banana, a globally important fruit crop that 
is extensively micropropagated, it is even more 
pertinent to study SV as the crop is especially 
prone to this phenomenon. To date, somaclonal 
variation affecting in vitro propagated banana is 
not well understood, suggesting a complex genetic 
cause of this phenomenon. A molecular biology- 
based approach of analysis would help throw light 
on causes and detection of variants to cure this 
scourge of the banana micropropagation industry. 
In vitro conditions can induce mitotic instability. 
Labile portions are known to exist in the genome 
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rendering it susceptible. These portions get modu- 
lated when cells undergo ‘stress’ in tissue culture, 
resulting in higher rearrangement and mutation 
rates than other portions of the genome. 
Occurrence of hotspots of mutation and recurring 
menus of alternative alleles is consistent with this 
response being limited to a sub-fraction of the 
genome. In banana, where production of soma- 
clonal variants is substantial, only those plants 
that show side shoots as well with the same type 
of variation are considered as ‘variants’. 


21.2 Gametoclonal Variation 


Another term for variations arising due to the tis- 
sue culture process is gametoclonal variation for 
variations arising in cell cultures of gametic ori- 
gin, like in pollen and microspore cultures, to dis- 
tinguish them from somatic cell-derived 
regenerants. 

When gametic cells are cultured under in vitro 
conditions and variations observed in such cul- 
tures, these are called gametoclonal variations. 
Products obtained from gametoclonal variations 
are termed gametoclones. In gametoclonal varia- 
tion, gametes (being products of meiotic division) 
possess only half the number of parent chromo- 
somes. Gametoclones can be developed by cultur- 
ing male or female gametic cells. Anthers or 
isolated microspores are widely used for develop- 
ing gametoclones. A large number of plants have 
been regenerated from gametoclonal variations 
like Oryza sativa, Nicotiana tabacum, Brassica 
napus and Hordeum vulgare. Improvements have 
been made in several plant species through game- 
toclonal variation, e.g. rice, wheat and tobacco. 
There are three major reasons that can cause 
genetic variations in gametoclones: 

e The technique used in cell culture may induce 
genetic variation(s). 

¢ Doubling of haploid chromosomes may gen- 
erate variation(s). 

¢ Heterozygosity in diploids may induce genetic 
variation(s). 

e Variations may result from segregation and 
independent assortment. 

Gametoclones differ from somaclones in three 
distinct ways: (1) Gametoclones regenerate into 
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haploid plants in comparison to somaclones which 
develop into diploid plants. (2) The recombination 
process occurs by meiotic crossing over in game- 
toclonal variation. (3) Gametoclones can be stabi- 
lized by doubling their chromosome number. 


21.3 Factors Contributing 
to Occurrence of Somaclonal 
Variation 


¢ Genotype 

Obvious chromosome breakages or aberrant 

number of chromosomes are found even in 

conventional sucker-grown plants. However, 
these defects get magnified in plants grown in 
tissue cultures. 

e Ploidy level 

In general, more incidence of SV is observed 

with increase in ploidy. 
¢ Number of subculture cycles 

Restricting the number of subculture cycles to 

five to eight is considered safe. SV is known to 

increase to 2.9 and 3.8 % at 9th and 11th sub- 
cultures, respectively, i.e. mutation rate is 
higher in prolonged culture. Almost all soma- 
clonal variants produce poor quality bunches. 
¢ Overdosing with hormones in vitro 
e Starter material: sword suckers vs. water 
suckers 

The basic cause of these variations may be 
attributed to changes in karyotype (chromosome 
number and structure), chromosome rearrange- 
ments, somatic crossing over, sister chromatid 
exchange, DNA amplification and deletion, 
transposable elements and DNA methylation. 
Somaclonal variation can be characterized based 
on morphological, biochemical (isozymes) and 
DNA markers such as random amplified poly- 
morphic DNA (RAPDs), restriction fragment 
length polymorphism (RFLPs) and inter-simple 
sequence repeats (ISSR). 

The variations could also arise in tissue cul- 
ture due to physiological changes induced by the 
culture conditions. Such variations are temporary 
and are caused by epigenetic changes. These are 
non-heritable variations and disappear when the 
culture conditions are removed. Changes in DNA 
methylation pattern have been implicated. 
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There are different approaches (steps) to cre- 

ate somaclonal variations, which include: 

1. Growth of callus or cell suspension cultures 
for several cycles 

2. Regeneration of a large number of plants from 
such long-term cultures 

3. Screening for desirable traits in the regener- 
ated plants and their progenies. For example, 
in vitro selection to select agronomically desir- 
able somaclones for tolerance to various biotic 
and abiotic stresses, herbicides, high salt con- 
centration and extremes of temperature 

4. Testing of selected variants in subsequent 
generations for desirable traits 

5. Multiplication of stable variants to develop 
new breeding lines 
To be of commercial use, a somaclonal variant 

must fulfil certain basic requirements: 

1. It must involve useful characters. 

2. It should be superior to the parents in the 
character(s) in which improvement is sought. 

3. The improved character(s) must be com- 
bined with all other desirable characters of 
the parent. 

4. The variations must be inherited stably 
through successive generations by chosen 
means of propagation. 


21.3.1 Identification of Variants 


There are identifiable and predictable DNA 
markers for early diagnosis of SV. DNA methyla- 
tion has been’ recognized to cause 
SV. Representational difference analysis (RDA) 
has been employed to isolate unique fragments 
(‘difference products’) between visible off-types 
and ‘normal’ tissue culture (TC)-derived plants. 
Various other molecular techniques are available 
to detect sequence variation between closely 
related genomes such as those of source plants 
and somaclones, viz. RAPD, AFLP (including 
MSAP — methylation-sensitive amplification 
polymorphism), microsatellites, etc. 

Somaclonal variants have also been shown to 
have gibberellic acid profiles different from those 
of normal TC plants. Overdosing with cytokinins 
and culture frequency or number (or both) have 
all been shown to cause SV. However, genome is 
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the predominant predisposing factor for occur- 
rence of somaclonal variants. 


21.3.2 Exploiting Somaclonal 
Variations to Advantage 


Plants generally exhibit cytogenetic and genetic 
variations that are helpful to plant breeders in 
crop improvement. When such variants arise 
through the cell and tissue culture process using 
any plant portion as an explant material, varia- 
tions arising are termed as somaclonal variations. 
Hwang and Ko (2004) in Taiwan successfully 
developed ‘Cavendish’ banana cultivars resistant 
to Fusarium wilt acquired through somaclonal 
variation. There is no known source of natural 
resistance within the banana group to this deadly 
deuteromycetan pathogen of the bananas. 


21.3.3 Disadvantages of Tissue 
Culture-Propagated (TCP) 
Plants 


The primary disadvantage is somaclonal varia- 
tion (SV). Technically, somaclonal variation 1s 
a phenotypic variation, either genetic or epi- 
genetic in origin, displayed among somaclones. 
In the context of TC industry, it can be loosely 
defined as a genetic/epigenetic change in the 
plant system inimical (harmful) to commercial 
production, because the primary objective of 
micropropagation is to produce true-to-type 
planting material of a desired clone. 
Micropropagated bananas have shown a high 
propensity for variation. Considerable research 
on multiplication schedules, phytohormone 
concentrations and relative proportions in vitro 
has been performed to minimize/eliminate the 
problem. Additional issues affecting produc- 
tion success and marketing strength are (1) 
selection and quality of explant donor (source) 
plants and (2) recognition and control of 
endogenous (nonpathogenic) bacteria/contam- 
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inants, with adherence to acceptable commer- 
cial principles. 

Micropropagated banana plants can also be 
more susceptible to various pests and diseases. 
Infection can be overcome by inoculation of the 
rhizosphere or the root system of these micro- 
propagated plants with fungi or mycorrhizae and 
improved growth with beneficial bacteria. 

In short, major disadvantages are: 

1. Inability to lower high frequency of off-types 
in certain cultivars 

2. Inability to deal with endogenous bacteria or 
bacter1um-like contaminants contributing to 
significant losses in, and immediately after, 
culture 

Thus, micropropagation of bananas has not 
been without obstacles. SV has been wide- 
spread in both bananas and plantains. The result 
is an increase in the number of off-types and 
this is genotype dependent (Sahijram et al. 
2003a, b, c). Several distinct off-types have 
been described, including dwarfism, in the 
Cavendish group. Inflorescence changes have 
been reported in plantain. Low-vigour and low- 
production off-types are also very common. 
Apart from obvious visual characteristics such 
as distorted/mosaic/variegated plants, off-types 
cannot usually be identified at the in vitro stage. 
Some undesirable somaclones in commercial 
banana micropropagation are illustrated below 
in Figs. 21.1 and 21.2. 


Factors to Consider 

for Dealing with Off-Type 

Production 

e Off-types can arise at any time in culture. 

e They may arise from stimulation of adventi- 
tious buds. 

¢ Growth regulators do not directly induce 
mutagenesis (their effect is indirect). 

e Nursery screening procedures are available 
for rouging off-types. 

¢ Cavendish types can be identified by morpho- 
logical characters. 

¢ Molecular markers can be used. 


21.3.3.1 
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Fig.21.1 Dwarf off-type 
somaclonal variant banana 
plant showing abnormal, 
spindle-shaped leaf lamina 
(a) and leaf texture/colour 
variants (b) (Source: 

R. DoreSwamy, formerly 
at IIHR, Bangalore) 


LEATHERY LEAF WITH 
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21.3.3.2 Methods for Minimizing 
Variants 

e Correct mother-plant selection 

¢ Low-cytokinin medium for multicycles 

e Correct dissection protocols 

e Minimal subculture cycles 

e Correct subculture cycle time 

¢ Clonal conformity during multicycle 

¢ Culling out variants during primary and sec- 
ondary hardening cycles 

e Grading and removal of variants at planting 


21.3.4 Molecular Basis of Somaclonal 
Variation 


Molecular basis of somaclonal variation is not 
precisely known, but both genetic and epigenetic 
mechanisms have been proposed (Cullis et al. 
2007). Banana is a crop where in vitro culture is 
widely practised on a commercial scale. Available 
evidence points toward the existence of labile 
portions of the genome that can be modulated 
when cells undergo the stress of tissue culture. 
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Fig. 21.2 Variant: bunch morphology — an abnormal 
banana bunch showing split fingers 


Therefore, the hypothesis that there are identifi- 
able and predictable DNA markers for the early 
diagnosis of somaclonal variation has been 
tested. Representational difference analysis 
(RDA) was used to isolate unique fragments of 
DNA (difference products) between visible 
culture-induced off-type and normal banana 
plants. Markers generated from six difference 
products differentiated between some of the off- 
type and normal pairs. The genomic region 
around one of these difference products has been 
extensively characterized and has a high degree 
of polymorphism, with variation in up to 10 % of 
the nucleotides sequenced in the region. This 
same region has been shown to vary in other pairs 
of off-type and normal banana plants derived 
from tissue culture as well as in plants propa- 
gated commercially in vitro. The data are consis- 
tent with the hypothesis that there is at least one 
particularly labile portion of the genome that is 
especially susceptible to the stress imposed dur- 
ing tissue culture and that is associated with 
higher rearrangement and mutation rates than 
other portions of the genome. Consequently, the 
regions that are reported here have the potential 
to be used as early detection tools for identifying 
somaclonal variants. 

Lakshmanan Venkatachalam et al. (2007), in 
their study, used two PCR-based techniques, 
RAPD and ISSR, for identification of genetic 


variations in tissue culture-derived banana 
plantlets because of their simplicity and cost- 
effectiveness. The use of the two types of mark- 
ers that amplify different regions of the genome 
allows better chances for identification of 
genetic variation in the plantlets. Although this 
study did not detect any genetic change, it is 
possible that some changes may have occurred 
that went undetected as there is a possibility of 
point mutations occurring outside the priming 
sites. This study screened a large number of ran- 
dom primers common for higher plants, some of 
which are present in monocots. Since no changes 
were seen in the banding pattern of tissue cul- 
ture plants compared to that of the mother plant, 
they concluded that micropropagation protocol 
developed by them for banana, var. Nanjanagudu 
Rasabale, may be applicable for a considerable 
length of time without any significant risk of 
generating genetic instability. 

The extent of DNA methylation polymor- 
phisms was evaluated by Peraza-Echeverria et al. 
(2001) in micropropagated banana (Musa AAA 
cv. ‘Grand naine’) derived from either the vegeta- 
tive apex of the sucker or the floral apex of the 
male inflorescence using the methylation- 
sensitive amplification polymorphism (MSAP) 
technique. In all, 465 fragments, each represent- 
ing a recognition site cleaved by either or both of 
the isoschizomers, were amplified using eight 
combinations of primers. A total of 107 sites 
(23 %) were found to be methylated at cytosine in 
the genome of micropropagated banana plants. In 
plants micropropagated from the male inflores- 
cence explant, 14 (3 %) DNA methylation events 
were polymorphic, while plants micropropagated 
from the sucker explant produced 8 (1.7 %) poly- 
morphisms. No DNA methylation polymorphisms 
were detected in conventionally propagated 
banana plants. These results demonstrated the 
use of MSAP to detect DNA methylation events 
in micropropagated banana plants and indicate 
that DNA methylation polymorphisms are asso- 
ciated with micropropagation. 

Biochemical, isozyme activities and molecular 
marker, DNA fingerprinting were used to analyse 
somaclonal variations in tissue culture-derived 
banana plants BBTV and CMV tested. Variation 
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was found in glutamate oxaloacetic and polyphe- 
nol oxidase isozyme (DISC-PAGE) activities as 
well as peroxidase activity among in vitro regen- 
erated banana plants. Fifteen arbitrary-base prim- 
ers were successfully used to amplify DNA 
extracted from banana plants propagative in vivo 
and in vitro of these, four primers revealed char- 
acteristic. DNA fingerprinting revealed genetic 
variations and 25 % polymorphisms. The fre- 
quency of somaclonal variations was found to be 
dependent upon number of subcultures (times of 
micropropagation cycles). The genetic variations 
were only detected in 7-month-old cultures. It 
was observed also that morphologically abnormal 
shoots and change in chlorophyll showed genetic 
variations at the molecular level. 

Tissue culture propagation of banana has 
gained attention due to its potential to provide 
genetically uniform, pest-free and disease-free 
planting materials. A large number of varieties of 
banana are multiplied in vitro the world over for 
commercial cultivation. 

Yuval Cohen and co-workers (2007) analyzed 
somaclonal variants in date palm (Phoenix dacty- 
lifera). Propagation of date palms using tissue 
culture techniques is very advantageous. 
However, among normal trees, many off-type 
trees are generated. These abnormal phenotypes 
include variegation, ‘low level fruit setting’ and 
dwarfism. The last two tend to occur in mass 
numbers mainly in plantlets of specific tissue cul- 
ture laboratories. While some off-types are easily 
detected in young plantlets, others are commonly 
detected in the field only years after planting. 
Therefore, both preventing generation of and 
developing methods for early detection of these 
off-types are necessary. Their study applied 
molecular techniques to characterize abnormal 
phenotypes where they applied amplified frag- 
ment length polymorphism (AFLP) to character- 
ize all three, common date-palm_ off-type 
phenotypes. In the variegated trees, multiple 
mutations seemed to occur. In contrast, only rela- 
tively few mutations were detected among ‘low 
level fruit setting’ and dwarf trees. No single spe- 
cific mutation was found to be associated with 
these phenotypes. Differences in DNA methylation 
patterns were seen among the off-type trees. 
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Reduction in overall DNA methylation levels 
appeared to be associated with ‘low fruit setting’ 
phenotype. Changes in DNA methylation pat- 
terns have been previously suggested to be 
involved in the generation of somaclonal varia- 
tion. It seems that, similarly, changes in DNA 
methylation patterns occurring during the tissue 
culture process may generate a developmentally 
altered scheme. This may result in off-types 
which can be detected only long after planting of 
the trees. Specific sequence variations and/or 
DNA methylation assays can be useful in detecting 
off-type plantlets before they are field planted. 
Other studies on the use of molecular tech- 
niques for analyzing somaclonal variants in plant 
species of commercial importance are also avail- 
able (Sahijram et al. 2010). Goto et al. as early as 
in 1998 determined genetic stability of long-term 
micropropagated shoots of Pinus thunbergii 
Parl., using RAPD markers. RAPD assessment 
was also carried out by Carvalho et al. (2004) for 
the identification of clonal identity and genetic 
stability of in vitro propagated chestnut hybrids; 
Rahman and Rajora (2001) used microsatellite 
DNA for detecting somaclonal variation in 
micropropagated trembling aspen (Populus trem- 
uloides). Palombi and Damiano (2002) success- 
fully used and made comparison between RAPD 
and SSR molecular markers for detecting genetic 
variation in kiwifruit (Actinidia deliciosa). 


21.3.5 Genetic Stability 
and Molecular Profiling 


ISSR technique is also very simple, fast, cost- 
effective, highly discriminative and reliable. At 
present, RAPD and ISSR markers have been suc- 
cessfully applied to detect the genetic similarities 
or dissimilarities in micropropagated material in 
various plants. Molecular profiling has recently 
been done to assess genetic stability of three eco- 
nomically important banana cultivars of the 
lower Indo-Gangetic plains by Ray et al. (2006) 
using RAPD and ISSR markers. Venkatachalam 
et al. (2007) carried out genetic analysis of 
micropropagated and regenerated banana plantlets 
as assessed by RAPD and ISSR markers. 
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Lakshmanan et al. (2007) earlier studied 
banana cv. Nanjanagudu Rasabale (NR) (silk 
group, AAB) to assess the impact of protocol 
and long-term in vitro effects on induction of 
somaclonal variation. In this study, they adopted 
two PCR-based techniques, RAPD and ISSR, 
for the identification of somaclonal variants 
because of their simplicity and cost-effective- 
ness. The use of two types of markers, which 
amplify different regions of the genome, allows 
better chances for identification of genetic vari- 
ation in the plantlets. 

A large number of micropropagated plantlets 
of NR developed from axillary shoot bud explants 
over 10 years ago were screened for genetic vari- 
ation, if any, using RAPD (random amplified 
polymorphic DNA) and ISSR _ (inter-simple 
sequence repeats) markers. Of the 4,000 in vitro 
plantlets, 11 were used for screening that involved 
shoot cultures with distinct variation in morpho- 
logical characteristics (morphotypes). Similarly, 
the mother maintained in the field was also sub- 
jected to genetic analysis. Out of the 50 RAPD 
and 25 ISSR primers screened, 30 RAPD and 5 
ISSR primers produced a total of 424 clear, dis- 
tinct and reproducible band classes resulting in a 
total of 5,088 bands where the banding pattern 
for each primer was highly uniform and compa- 
rable to that of field-grown mother clone from 
which these cultures had been established. This is 
the first report on the use of genetic markers to 
establish genetic fidelity of long-term microprop- 
agated banana using RAPD and ISSR. Since 
there were no changes in the banding pattern 
observed in tissue culture plants as compared 
with that of mother plant, they concluded that 
that their micropropagation protocol for banana 
var. Nanjanagudu Rasabale can be carried out for 
long without much risk of genetic instability. 


21.3.6 International Scenario 


Two crops of major commercial importance have 
suffered in the past from susceptibility to soma- 
clonal variation. Both of these are monocots, viz. 
banana/plantain group and oil palm. Appearance 
of off-types during the in vitro multiplication 
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process is an important drawback for mass prop- 
agation of bananas. Visual screening at prehard- 
ening and hardening phases in the greenhouse 
helps detect putative off-types which can then be 
eliminated. In any micropropagation pro- 
gramme, 3—5 % somaclonal variation is permis- 
sible, but in bananas, up to 10 % variation is 
permitted owing to the flexible genetic make-up 
of the crop. 

There is an increasing demand for quality and 
uniformity in developing dessert banana TC prod- 
ucts for international trade (Altman and Loberant 
2000). Many commercial banana-producing 
countries use TC plants in an annual or biennial 
crop cycle to improve yield and to reduce disease 
pressure. Greenhouse production using TC plants 
in Canary Islands (Spain) resulted in an increase 
in yield from 47.3 t/ha/year under conventional 
production systems to 83.7 t/ha/year. 

Large-scale commercial production has devel- 
oped particularly in Taiwan, France, South 
Africa, Israel, etc. In vitro plantlets are sent to 
their destination where they are acclimatized, 
placed in shade nurseries and planted in prepared 
fields. By the 1990s, companies were providing 
millions of plantlets annually for commercial 
plantations in South Africa, in the continent of 
Africa and in Southeast Asia. In S. America 
alone, nearly 100 million micropropagated 
banana plants were planted in a decade since the 
1980s. Israel produces micropropagated banana 
for domestic consumption and export. With a 
highly developed micropropagation industry, 
coupled with an academic infrastructure and 
experienced nursery/field agronomists, Israeli 
companies have been in the forefront of provid- 
ing disease-free, selected clones of in vitro des- 
sert banana plantlets to the international market. 


21.3.7 Government Initiatives 
in India 


Like in any other trade, quality assurance and 
suppliers’ guarantee are sought by the buyer of 
micropropagated plants. One of the chief con- 
straints in TC product exports is lack of the con- 
cept of quality. To begin with, selection of 
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‘mother stock’ requires great care. Genetic insta- 
bility should be accorded high importance while 
selecting a particular genotype. Thus, quality 
control of the raw material becomes paramount. 

The Department of Biotechnology (DBT), 
Ministry of Science and Technology (GOD), 
New Delhi, India, set up a Consortium on 
Micropropagation Research and Technology 
Development. To prevent risks, it suggests 
effective testing (indexing) procedures prior to 
bulking up of cultures and adoption of standard 
guidelines as follows: 

e Careful selection of mother plants 

e Ensure establishment of virus-free culture 
through indexing of 100 % plants 

e Proper package and practices to be followed 
such as limiting the number of subculture 
cycles (multiplication cycles), grading cul- 
tures as well as plantlets, insect/pest monitor- 
ing in the hardening area, etc. 

Based on the needs of the TC industry, DBT 
has set up national facilities for virus diagnosis 
and quality control of TC plants at many institu- 
tions in the country. All these measures have 
contributed immensely to the TC industry. The 
crop-wise share of micropropagated plants in 
India pegs the sales of TC banana at 45 % of the 
total volume (i.e. quantity in terms of numbers), 
with crop-wise estimated sales value of TC 
banana in the domestic market at 52 %. A num- 
ber of progressive farmers and nurserymen in 
Andhra Pradesh, Maharashtra, West Bengal, 
Karnataka, Tamil Nadu, etc. are major consum- 
ers (customers) of TC plants. Banana is a prior- 
ity TC crop for several state agriculture 
departments (SADs) and for Kerala AEZ (agri- 
economic zone). 

In India, several plant tissue culture laborato- 
ries and commercial facilities have been set up 
recently, and they have been generating a large 
number of tissue culture-raised plants of com- 
mercial crops. However, the country lacks orga- 
nized testing of the quality of regenerants besides 
their freedom from viruses. The most deleterious 
variants in tissue culture-raised plants are those 
that affect yield and quality and carry infection of 
viruses difficult to diagnose. To bridge the gap, 
the Department of Biotechnology, Govt. of India, 
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has established recognized test centeres for test- 
ing tissue culture-raised plants for quality and 
freedom from viruses. 
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Abstract 

Biotechnology has brought about a revolution in the way that plant genetic 
resources can be utilized. Clonal crops cover a wide range of species from 
the root and tuber crops, such as potato, cassava, yam, taro and sweet 
potato, to fruits, such as apple, pear, citrus, banana and the cooking banana 
(plantain). Other miscellaneous crops including vanilla, ginger, turmeric, 
hops and sugarcane are also clonally propagated. In some of these cases, 
seed production is impossible due to sterility. In others, it is undesirable to 
produce seeds for conservation as this would break up highly heterozy- 
gous clonal genotypes. The foundation technologies that make up an 
in vitro conservation system are collection, disease eradication and index- 
ing, culture initiation, multiplication, storage and distribution. There are 
two basic options for in vitro storage, slow growth for the short to medium 
term and cryopreservation for the long term. (Since the definition of stor- 
age time-span and the concepts of active or base storage derive strictly 
from motivation rather than methodology, there is no reason why, in fact, 
cryopreservation should not have applications in short- to medium-term 
conservation.) Intensive conservation efforts are needed for clonally prop- 
agated crops, constituting about 1,000 species, and for difficult-to-store 
seeds, constituting about 88,250 species throughout the world. In vitro 
approaches, including tissue culture maintenance and cryopreservation, 
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are recognized as useful tools for medium- to long-term conservation of 
these groups of species. The in vitro techniques for conserving plant bio- 
diversity include shoot apical or axillary-meristem-based micropropaga- 
tion, somatic embryogenesis, cell culture technologies and embryo rescue 
techniques, and a range of in vitro cold storage will be discussed in this 


chapter. 


Keywords 


In vitro conservation * Clonal ¢ Slow growth ¢ Tissue culture 


22.1 Introduction 

Seed storage is a preferred method of conserva- 
tion, but it is not feasible for germplasm from 
crops that are either clonally propagated or that 
do not produce seeds. For some genotypes, elite 
genetic combinations are only preserved through 
clonal means as their conservation is dictated by 
breeding strategy; this is because heterozygosity 
does not permit the maintenance of desired char- 
acteristics. Clonally propagated plants thus 
require special conservation approaches. Options 
include maintenance in field gene banks and the 
conservation in cold stores of dormant vegetative 
propagules (Reed 2001); however, these methods 
have limitations regarding efficiency, costs, secu- 
rity and long-term maintenance. In vitro conser- 
vation is preferentially applied to clonal crop 
germplasm as it also supports safe germplasm 
transfers under regulated phytosanitary control 
(IBPGR 1988). 

Conservation in IVGBs combines tissue cul- 
ture and cryopreservation for medium-term 
(MTS) and long-term (LTS) storage, respectively. 
For MTS, subculture intervals are extended, 
reducing processing costs by arresting growth 
using reduced temperature treatments and/or 
growth retardants. For LTS, germplasm (usually 
shoot tip meristems) from in vitro propagated 
plants is cryobanked for long-term storage in liq- 
uid nitrogen (LN) to a minimal temperature of 
—196 °C in the liquid phase. 


22.1.1 Need for In Vitro Conservation 


There are a number of crops which are normally 
propagated vegetatively, such as potato, sweet 
potato, yams, cassava, several fruit tree species 
and many others. In this category, the clonal 
material carries variable gene combinations 
which have been maintained by the avoidance of 
sexual reproduction. When these clones are 
maintained in field gene banks, the traditional 
procedures tend to be expensive due to (1) high 
labour costs, (2) vulnerability to environmental 
hazards and (3) requirement for large amount of 
space. An even more serious problem is the vul- 
nerability of such clones to pests and pathogens 
or natural disasters to which they are almost con- 
tinuously exposed. This can lead to sudden loss 
of valuable germplasm or accumulation of sys- 
temic pathogens, especially viruses. In such 
cases, in vitro conservation is complementary to 
field gene banks, seed gene banks and pollen/ 
DNA preservation which along with in situ con- 
servation measures provide an integrated conser- 
vation strategy. In vitro gene banks, where plant 
material is stored in nutrient medium under artifi- 
cial conditions, are being increasingly used as 
alternatives to conserve vegetatively propagated 
species and threatened plant species (Fay 1994; 
Bhat et al. 1995; Sharma and Chandel 1996). 
Much of the world’s germplasm is currently 
maintained as breeders’ collections in gene 
banks, plantations, orchards or even in evolution 
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gardens primarily raised from seeds, such as rub- 
ber and coconut, and vegetatively propagated 
plant species such as citrus, cocoa, banana and 
many other fruits. These also include clonal col- 
lections of important staple food crops, such as 
cassava, sweet potato and yams, and aroids such 
as Colocasia and Xanthosoma. These field gene 
banks do not represent the entire range of genetic 
variability within the respective crop gene pool, 
and most of them represent only a fraction of the 
variability which should be conserved (Withers 
and Williams 1985). The IBPGR programme ini- 
tiated a move to include fruits, vegetables and 
forages. Any strategy for collection and conser- 
vation of samples of crops that are normally 
propagated vegetatively or that produce seeds 
which cannot be stored using normal procedure 
of storage may require alternative methods. This 
led to the consideration of in vitro techniques and 
cryopreservation of seeds for germplasm conser- 
vation (Withers and Alderson 1986). Problems of 
in vitro storage of such material, when solved, 
should also relate to cycling of the material 
through multiplication schemes, distribution of 
germplasm and also its characterization and eval- 
uation. Hence, the development of the full poten- 
tial of in vitro culture storage and associated 
biochemical techniques could revolutionize the 
handling of germplasm. 

A range of in vitro techniques have been 
developed in the last few decades. The organized 
culture systems have a high degree of genetic sta- 
bility and are more likely to be of importance for 
germplasm storage, especially the ‘shoot tips’ or 
meristem cultures. In vitro techniques are 
employed to eliminate diseases and _ pests. 
However, some viroids and viruses particularly 
are not necessarily eliminated or even detected 
and can readily multiply in tissue culture. These 
can be eliminated by meristem or shoot tip cul- 
tures possibly in combination with both heat and 
cold therapy. The International Potato Research 
Center (CIP) maintains the pathogen-tested 
potato germplasm in the form of in vitro plantlets 
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or tuberlets. Potato germplasm is being preserved 

at CIP through in vitro techniques using meri- 

stem and shoot tip culture for international 
exchange of germplasm. 

With in vitro techniques, it is now possible to 
provide a germplasm storage procedure which 
uniquely combines the possibilities of disease 
elimination and rapid clonal propagation 
(Henshaw and Grout 1977). Further, the virus- 
tested cultures could provide ideal material for 
international exchange and distribution of germ- 
plasm as they will be acceptable to plant quaran- 
tine authorities (Paroda et al. 1987) and comply 
with international quarantine regulations. 

In vitro conservation strategy offers an appro- 
priate alternative and would be discussed in detail 
in the present chapter. With in vitro techniques, it 
is now possible to provide a germplasm storage 
procedure which uniquely combines the possi- 
bilities of disease elimination and rapid clonal 
propagation (Henshaw and Grout 1977). 
However, field gene banks have the potential risk 
of germplasm being lost due to disease, stress or 
disaster and are labour intensive. Cryogenic pres- 
ervation of seeds or vegetative material is another 
potential mode of ex situ conservation which is 
still at experimental stages. 

Various aspects of in vitro conservation and 
cryopreservation were reviewed by Normah et al. 
(1996), Ashmore (1997), Engelmann and Takagi 
(2000), Reed et al. (2004), Sarasan et al. (2006) 
and Krishnan et al. (2011). Research require- 
ments identified by Reed et al. (2004) were: 

1. Germplasm health: virus surveys, indexing 
techniques, development of effective virus 
testing in vitro and whether viruses can be 
transmitted in vitro and development of 
indexing techniques for latent endogenous 
bacteria 

2. Slow growth: research into the effects of plant 
growth regulators and growth retardants, light 
and light-temperature interactions, propagule 
type, size, growth stage (microtubers, bulbs, 
rooted plantlets, unrooted shoots), statistical 
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rigour in experimental design and minimizing 
the use of growth retardants 

3. Cryopreservation: widening its applicability 
to more crops and genotypes, methods devel- 
oped for several localities, and use of 
cryotherapy 

4. Genetic stability: selection pressure of in vitro 
maintenance, genetic variation in field com- 
pared to in vitro, field evaluations on material 
with known instabilities and development of 
markers to monitor genetic stability 


In Vitro Conservation: 
An Overview 


22.2 


In vitro storage of germplasm first reported two 
decades ago (Henshaw 1975) offers promise for 
conservation of threatened species of known and/ 
or potential medicinal and aromatic value and for 
species clonally propagated. The material for 
such species could be available once the true 
potential of the species is realized. The funda- 
mental objectives of in vitro conservation 
technology are the maintenance and exchange of 
germplasm in disease-free and genetically stable 
state through tissue culture. The essential prereq- 
uisites for an in vitro conservation programme 
are: 

1. Creation of special facilities including tissue 
culture facility, green-/glasshouse facilities, 
storage facility, computer facility and facility 
for monitoring genetic stability 

2. Presence of trained scientists and technicians 

3. Linkage with farmers’ fields 
Information on the in vitro multiplication and/ 

or conservation of the plant species is also desir- 

able. Many laboratories and institutes in India 
have been engaged mainly in developing proto- 
col for micropropagation of various threatened 
endemic species. The Department of 

Biotechnology established India’s first national 

facility of plant tissue culture repository in 1986 

at National Bureau of Plant Genetic Resources 

(NBPGR), New Delhi, which has made con- 

certed efforts towards developing in vitro tech- 

nology for conservation of several vegetatively 
propagated agri-horticultural and several threat- 
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ened/rare species, especially of medicinal and 
aromatic value. The number of species being 
worked upon has increased appreciably with the 
initiation of DBT funded G-15 project operative 
at three centres, namely, NBPGR, Central 
Institute of Medicinal and Aromatic Plants 


(CIMAP), Lucknow, and Tropical Botanic 
Garden and Research Institute (TBGRI), 
Thiruvananthapuram. 


Any in vitro conservation programme mainly 
comprises of two stages: 
1. In vitro multiplication to build up a large num- 

ber of plants 
2. In vitro storage/preservation 

The cultures may be conserved for either 
short, medium or long term, depending on the 
requirement as well as the technique applied and 
infrastructure availability. For short-term mainte- 
nance of cultures, regular subculture (4-8 weeks’ 
interval) may suffice. To conserve cultures for a 
longer period of time, two strategies normally 
adopted include slow growth and cryopreserva- 
tion. The use of artificial seeds in combination 
with the above two is a more recent approach in 
the conservation programmes. 


22.3 Type of Culture Systems 


In vitro techniques rely on the concept of ‘toti- 
potency’ of plant cells. Cultures could be initi- 
ated from two types of explants: first, explants 
that retain developmental integrity such as meri- 
stem shoot tips and axillary buds and, second, 
explants that differentiates to a more or less 
organized state such as somatic embryos and 
adventitious buds through a disorganized callus 
phase. One of the important requirements of 
in vitro conservation is to get high-frequency 
regeneration of plantlets from organized 
explants such as meristem/shoot tips, embryos, 
embryonic axes and plantlets as they offer the 
lowest frequency of genetic variation during 
conservation (Karp 1989). In contrast, callus, 
cell suspension and protoplast culture are pre- 
ferred systems only when endowed with special 
attributes or required for biotechnological 
applications. 
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Germplasm of threatened plants is collected 
from difficult areas and may be available in the 
form of either seeds or cuttings or vegetative 
propagules such as bulb, corm or tubers in lim- 
ited number. In such cases, the ‘less preferred’ 
culture system may be the only choice for 
conservation. 


22.4 InVitro Propagation 

Development of efficient plant regeneration pro- 
tocols for clonally propagated species and threat- 
ened plants of medicinal and aromatic value is a 
recent phenomenon. The methods for microprop- 
agation include stimulation of axillary bud, pro- 
liferation from shoot tip and nodal explants, 
induction of somatic embryogenesis using 
explants from juvenile or mature plants depend- 
ing on the availability of material and inducing 
adventitious bud directly from explants or 
through intervening callus. Protocol for rapid 
multiplication involves five stages: 

Section and preparation of stock plants 
Establishment of aseptic cultures 
Multiplication of propagules 

Preparation for re-establishment in soil 
Transfer to greenhouse and acclimatization 
Since multiplication is carried out under artifi- 
cial conditions on a nutrient medium, plants can 
be produced round the year if photoperiod and 
temperature are properly maintained. As plants 
are produced under aseptic conditions, they are 
free from pest and pathogen. Even virus can be 
eliminated by meristem culture technique, that is, 
regeneration of plants from 0.1 to 0.2 mm shoot 
meristem. 

Various techniques have been used for micro- 
propagation of plants. From conservation, the 
most useful is the propagation from existing mer- 
istem as by this method plants with desired traits 
are obtained. Micropropagation protocols have 
been developed in an increasingly large number 
of species. However, certain species are recalci- 
trant to tissue culture (Coptis), and this is a major 
obstacle in using tissue culture for germplasm 
conservation. The rate of shoot multiplication 
varies from 3.5-fold per 3 weeks in Saussurea 
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lappa (Arora and Bhojwani 1989; Bhojwani et al. 
1989) to as high as 150 shoots every 4 months in 
Coleus (Sen and Sharma 1991). High multiplica- 
tion rate has major advantages for raising plans 
for nurseries and commercial plantings; however, 
for conservation programmes, very high multipli- 
cation rate is not desirable. The mode of regen- 
eration has been either direct or through callus, in 
the form of shoots or somatic embryos. It is evi- 
dent that in most of the cases, the propagation is 
through axillary branching. 

It is worthwhile to mention here that tissue 
culture methods also provide potential means of 
multiplying threatened species and clonally prop- 
agated with possible reintroduction into their 
original habitats for throated species. However, it 
is too early to predict the survival and fate of rein- 
troduced tissue culture-raised endangered plants 
in their natural/native habitats. Sustainable utili- 
zation of this germplasm depends on the develop- 
ment of appropriate in vitro conservation 
procedure to ensure its availability for future 
utilization. 


22.5 Conservation of Tissues 


Six major steps defined in the conservation use 
cycle are collection, quarantine, propagation, 
characterization, evaluation, monitoring, storage 
and distribution. The role of in vitro conservation 
techniques in the overall conservation strategies 
should be indicative of the fact that it should 
complement other conservation strategies within 
the total programme of a given species or popula- 
tion. The methods chosen should be carefully 
considered taking into account the feasibility, 
practicality, economy and security. 

Generally, field conservation of plants requires 
more space and is labour intensive and expensive. 
They also run the risk of being damaged by natural 
calamities and biotic stress factors. Techniques to 
conserve such species in vitro have recently been 
developed. For some species, while in situ conser- 
vation is the only option available, tissue culture 
systems offer advantages, which are listed below: 
1. Very high multiplication rates 
2. Aseptic system 
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— Free from fungi, bacteria, viruses and 
insect pests 
— Production of pathogen-free stocks 

3. Reduction of space requirements 

4. Genetic erosion reduced to zero under optimal 
storage conditions 

5. Reduction of the expenses in labour costs 
In vitro collections of species could be main- 

tained at the same or separate site, but should 

have clear linkages with field gene banks. The 
properties required for a successful in vitro con- 
servation system as defined by Grout are: 

The ability of the biological system to: 

1. Minimize growth and development in vitro 

2. Maintain viability of stored material at the 
highest possible level along with minimum 
risk of genetic stability 

3. Maintain full developmental and functional 
potential of the stored material when it is 
returned to physiological temperatures 

4. Make significant savings in labour input, 
materials and commitment of specialized 
facilities 
Some of the advantages favouring this 

conservation strategy are: 

1. Collection may occur at anytime, independent 
of flowering periods for each species. 

2. There is potential for virus elimination from 
contaminated tissue through meristem 
culture. 

3. Clonal material may be produced. 

4. Rapid multiplication. 

5. Germination of ‘difficult? immature seed/ 
embryo rescue may be facilitated for 
breeding. 

6. Distribution across borders may be safer. 
Issues of concern that could limit potential 

application of in vitro techniques for conserva- 

tion of plants include: 

1. The whole programme can be initially expen- 
sive, but with low recurring cost. Technology 
inputs as adopted by the developed nations 
will have to play a major role for successful 
implementation of this conservation strategy. 

2. In vitro storage techniques, particularly cryo- 
preservation procedures, are not yet well opti- 
mized for routine application across a wide 
range of species or genotypes. Although cryo- 
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preservation techniques are currently being 
tested and protocols optimized for gene pool 
components of several plant species, the rate 
of success is limited to only a few, where the 
conditions need to be carefully monitored to 
ensure viability, minimize genetic damage 
and prevent contamination by diseases and 
pests. 

3. Somaclonal variation can be a major limita- 
tion among tissue culture regenerated plants; 
some of the methodological basis for variation 
is explant source, age of culture, hormone 
used, genotype, ploidy status, etc. 

4. The problems of genetic stability manifested 
by loss of cellular integrity among most tissue 
culture systems pose a major obstacle in using 
this technique as a conservation strategy. 
Variation can be observed at different levels, 
such as morphological, karyotypic or bio- 
chemical. The use of axillary or apical meri- 
stem for micropropagation reduces the 
probability of genetic variation among plant 
tissue culture systems. The genetic stability at 
all stages of an in vitro conservation pro- 
gramme should be monitored. No _ well- 
defined techniques are _ available for 
conservation of endangered medicinal plants. 
Therefore, special attention is required in this 
regard, since it is a question of species extinc- 
tion, and it is essential to retain the quality and 
quantities of secondary metabolites contained 
in the species 

5. There is a need to establish the basic tissue 
culture competence of the plant species in 
question; difficulties can be encountered dur- 
ing culture initiation, micropropagation, root- 
ing and establishment of plants extra vitrum. 
All these stages for any given medicinal plant 
species must be optimized. Some species 
show recalcitrance in culture system for which 
may require special attention. 


22.6 InVitro Collection 

In vitro collection involves initial disinfesta- 
tions and placement of plant explants in sterile 
culture medium, before transport to a tissue 
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culture laboratory for further in vitro proce- 
dures. In vitro collection is particularly useful 
for species that are vegetatively propagated and 
for those with recalcitrant seeds or embryos, 
which deteriorate rapidly. The technique has 
much potential to facilitate the collection of 
germplasm of tropical and subtropical fruit spe- 
cies, as has already been demonstrated with cas- 
sava and coconut. Recently, 300 Musa 
accessions were collected in Papua New Guinea 
using this technique; before being transported to 
a collection in Australia, an added advantage of 
this exercise 1s that it complied with quarantine 
regulations that are in place to stop the spread of 
Fusarium and other diseases. 


22.7  InVitro Storage 

As with short-term storage, there have been 
very few attempts to apply cryopreservation 
techniques to tropical and subtropical fruit spe- 
cies, with the exception of Musa spp. (Panis 
1995) and Citrus spp. (Pérez-Molphe-Balch and 
Ochoa-Alejo 1997). Withers (1992), in a review 
article, reported successful cryopreservation of 
the recalcitrant tropical species, Theobroma 
cacao (cocoa), Artocarpus heterophyllus (jack- 
fruit), Cocos nucifera (coconut) and Nephelium 
lappaceum (rambutan), but provided no details. 
Very low survival rates have been reported when 
excised embryos from seeds of jackfruit, rambu- 
tan and coconut were cryopreserved (Chin 







Germplasm in growing state 


Maintenance under normal growth conditions 


Technical approaches to in vitro storage 
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1988). No survival was achieved when excised 
embryos from partially dehydrated seeds of 
rambutan, durian and cempedak (Artocarpus 
integer) were cryopreserved (Hor et al. 1990). 
However, before cryopreservation can be uni- 
versally applied to woody perennial fruit spe- 
cies, there is still much research, development 
and field testing that need to be done. For exam- 
ple, the issue of genetic stability is rarely men- 
tioned. Because growth is suspended, the 
potential to store material for long periods with- 
out genetic variation is assumed. However, any 
system based on cell suspension or callus 
(including embryogenesis) is prone to soma- 
clonal variation and should be _ field-tested 
before being accepted unreservedly. Field test- 
ing of tropical and subtropical fruits should be 
continued through to the fruiting stage, as fruit 
production is the primary reason for their col- 
lection and use. Unfortunately, this requires 
long-term projects for many _ species. 
Nevertheless, a research effort into cryopreser- 
vation of tropical and subtropical fruit species 
should be encouraged because of its potential 
for long-term preservation of germplasm. At the 
current rate of development, it is reasonable to 
assume that routine protocols for cryopreserva- 
tion and subsequent regeneration of explants 
will eventually become available for most plant 
species. However, protocols must be repeatable 
and result in high percentages of preserved tis- 
sue being viable after thawing, before they can 
be used routinely for storage of germplasm. 


Germplasm under suspended growth 


f 


Maintenance under growth limitation 


Cryopreservation 
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In Vitro Conservation: 
Strategies 


22.8 


Once cultures have been established and multi- 
plied in sufficient number, an effective method 
for conservation is required. Conservation can 
partly be achieved by regular subculture on fresh 
media. However, it may not be practical due to 
the danger of microbial contamination and equip- 
ment failure and may be uneconomical in terms 
of labour, physical resources and time require- 
ment. Additionally a few systems may have con- 
straints such as loss of morphogenic capacity and 
occurrence of somaclonal variation. 

The main aim of in vitro conservation pro- 
grammes is to reduce frequent demand for sub- 
culture, which can be accomplished in two ways: 
by maintaining cultures under normal growth 
(SCC) or by subjecting them to growth limiting 
strategies (for detailed reviews see Grout 1995). 
The latter includes slow growth and suspended 
growth (cryopreservation). Normal growing cul- 
tures along with those in slow growth comprise 
the active collections whereas those cryopre- 
served constitute the in vitro base collection. 

The expectations are high about tissue culture 
methods providing sound strategy for both clonal 
propagation and medium-term storage. Literature 
survey revealed that till date there is very limited 
documented information on in vitro conservation. 


22.8.1 Normal Growth 


It is possible to maintain cultures virtually indefi- 
nitely under normal growth conditions provided 
nutrients are supplied and accidents avoided. 
This method is preferred for inherently slow- 
growing, stable systems and for cultures for 
which there is no other method though it is labo- 
rious and abounds with risks of genetic altera- 
tions with time, contamination or loss through 
human errors; in specific cases such as tropical 
germplasm, it can be useful because of the fol- 
lowing advantages: 
1. It minimizes requirement of low temperature 
facility (particularly for developing countries’ 
saves inputs). 
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2. It promises avoidance of stress-induced 
variability. 
3. It helps in getting cultures for instant multipli- 


cation and exchange. 


22.8.2 Slow Growth 


The aim of this method is to reduce requirement 
for subculture without causing any damage to the 
tissue. It 1s the most direct way of restricting 
growth and development of in vitro materials and 
is usually applied to differentiated plantlets or 
shoot cultures. Slow growth involves one or a 
combination of the following techniques: 
Type of enclosure 
Temperature and/or light reduction 
Use of minimal media and osmotic 
Use of growth retardants 
Other approaches: 

(i) Reduction of oxygen pressure 

(ii) Mineral oil overlay 
(iii) Encapsulation 
(iv) Desiccation 
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22.8.2.1 Type of Enclosure 

Type of enclosure seems to have direct influence 
on subculture requirement of growing cultures. 
One of the simplest and cost-effective approaches 
for slowing growth rate of cultures has been 
replacement of the commonly used cotton plugs 
with polypropylene caps as culture tube enclo- 
sures (Balachandran et al. 1990; Sharma and 
Chandel 1992). The increase in storage time is 
attributed to the reduction in evaporation of water 
from the medium in culture tubes. 

Shoot cultures of Coleus forskohlii, Rauvolfia 
serpentina and Tylophora indica have been con- 
served for 12—20 months at 25 °C without requir- 
ing any intermittent subculture (Chandel and 
Sharma 1992; Sharma and Chandel 1992; Sharma 
and Chandel 1996). In Allium tuberosum and 
Dioscorea too, the shelf life of shoot cultures was 
extended for up to 9 months at 25 °C. Encouraging 
results have been obtained in other species also in 
our laboratory. This technique seems to work well 
with species belonging to subtropical or tropical 
region probably due to their inherent property of 
growing at higher temperature. In most of the 
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species of temperate region, optimum subculture 
period could be extended only to 5 months. The 
added advantage of this approach is that mostly 
the cultures can be visibly assessed for viability 
and can readily be brought back to fresh culture 
medium to produce plants on demand. 

Excised roots to Rauvolfia serpentina could 
be conserved for 16 years, and these retained the 
regenerative capacity to produce plants 
(Chaturvedi et al. 1991). This approach needs to 
be tested for other important threatened species. 


22.8.2.2 Reduction in Temperature 
and/or Light 

The most commonly used method for reducing 
growth of tissues in cultures is by decreasing the 
temperature in which cultures are maintained. 
This may also be accompanied with reduction of 
light intensity. The basic principle in this method 
is that incubation at temperature lower than that 
required for optimum growth would reduce or 
decrease the metabolic activities, thereby restrict- 
ing the growth of the plants. However, care must 
be taken to avoid temperature below freezing or 
where chilling injury could occur (Lyons et al. 
1979). In most cases, the storage temperature is 
species specific. Usually, the most suitable stor- 
age temperature for temperate species 1s between 
1 and 10 °C, whereas tropical species are stored 
in the range of 10—22 °C. The light intensity can 
be reduced by 60 % from standard requirement, 
1.e. 1,000 1x and 16th photoperiod. This method 
is very simple and easy to use and may be appli- 
cable to a wide range of genotypes. However, 
uninterrupted maintenance of reduced tempera- 
ture, required for the purpose, may be difficult 
and uneconomical particularly for tropical and 
developing countries. 

Low temperature incubation of in vitro cultures 
appears highly promising. This may also be 
accompanied by reduction in light intensity. 
Cultures are kept at ambient culture conditions for 
1 week and subjected to screening for detection 
and elimination of contamination prior to low tem- 
perature incubation. Cultures exhibit reduction in 
growth when transferred to low temperature. 

Shoot culture/nodal cultures of Rauvolfia ser- 
pentina have been successfully conserved for over 
15 months at 15 °C, while 10 °C and 5 °C were 
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found deleterious to growth of cultures (Sharma 
and Chandel 1992). These in vitro conserved cul- 
tures showed shoot multiplication comparable to 
the cultures maintained at 25 °C. The plantlets 
obtained from these conserved cultures could be 
established in soil and no morphological differ- 
ences were observed in these plants. 

Tylophora indica have been successfully con- 
served from more than 12 months tested so far at 
10 °C (Sharma and Chandel 1996). These in vitro 
conserved cultures exhibited normal growth and 
multiplication on rejuvenation. 

Reduction of light along with temperature has 
extended shelf life of cultures in Saussurea lappa. 
According to Arora and Bhojwani (1989), shoot 
cultures of Saussurea could be successfully 
stored at 4 °C in the dark for 12 months, with 
100 % viability. Shoot tip cultures of the same 
species have been conserved for more than 
15 months at 4 °C in the dark (Sharma et al. 
1995). Further experiments have indicated that 
cultures can be maintained for 22 months at 
10 °C under reduced light. However, the result 
has to be repeated for confirmation. An interest- 
ing observation has been that some of the cul- 
tures that were subjected to reduced temperature/ 
light regime exhibited a better rate of multiplica- 
tion following storage as compared to cultures 
maintained at normal temperatures and light. 

In Picrorhiza kurroa, shoot cultures have 
been successfully conserved for 6 months at 
25 °C and over 9 months at 10 °C. Shoot cul- 
tures of this species are reported to have been 
stored for 10 months at 5 °C in the dark, with 
70 % survival. Cold-stored shoots, on transfer to 
25 °C, multiplied at rates comparable to cultures 
maintained under normal conditions (Upadhyay 
1989). 

Embryogenic calli of Podophyllum hexan- 
drum cold stored for 7 months at 5 °C exhibited 
95 % viability and differentiated globular 
embryos (Bhojwani et al. 1989). However, simi- 
lar studies carried out in our laboratories indi- 
cated that cold-stored calli failed to regenerate 
beyond the globular stage. 

Gentiana kurroo, another threatened medici- 
nally important species, has been conserved for 
11 months at 4 °C (Sharma et al. 1993; Sharma 
and Chandel 1996). In Allium tuberosum, shoot 
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cultures stored at 10 °C survived for 18 months 
without subculture. 


22.8.2.3 Minimal Media/Osmotica 

Inclusion of osmotica with or without low tem- 
perature incubation has also proved to be an 
important technique to prolong subculture 
period. Change of carbon source may have a 
very remarkable effect on growth rate. Use of 
either increased or decreased doses of sucrose 
yielded interesting results in some species. 
Inclusion of non-metabolizable, inert sugar 
alcohol, particularly mannitol and sorbitol, in 
the range of 3-6 % (w/v) has been quite effec- 
tive in restricting the growth of many plant spe- 
cies. The major limitation of this method is that 
different genotypes may react differently under 
these conditions. However, when used in combi- 
nation with reduced temperature incubation, it 
becomes the most realistic and cost-effective 
method because the combinations of these two 
treatments show synergistic effects on in vitro 
conservation of many species. Inclusion of 
osmotica such as sucrose and mannitol, with or 
without low temperature incubation, has also 
proved to be an important technique to prolong 
subculture period. There is no documented 
information regarding its use in case of threat- 
ened plant species. Effect of addition of manni- 
tol in the medium to cultures of Rauvolfia, 
Picrorhiza, Gentiana and Saussurea has been 
studied in extending shelf life. Addition of man- 
nitol led to maintenance of these species for 
more than 9-12 months at 25 °C. Reduction of 
temperature further resulted in prolongation of 
shelf life of the same cultures up to 16 months. 
Experiments are underway to study the effect of 
mannitol and other osmotica, in other species 
too. Incorporation of high concentration of 
sucrose has also been beneficial in extending the 
storage time of cultures of Allium tuberosum. In 
Rauvolfia serpentina, half-strength medium and 
full-strength medium without hormones were 
used to test their effect on survival of cultures at 
25 °C (Sharma and Chandel 1992). None of 
these were effective in improving the shelf life 
of cultures in comparison to low temperature 
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storage, with survival rates of only 33-50 % 
after 6 months of storage. An important obser- 
vation made in author’s lab is that rooted cul- 
tures survive longer probably due to roots being 
capable of absorbing water and nutrients from 
the medium more efficiently than shoot cultures 
especially at later stages. 


22.8.2.4 Growth Retardants 

The basic principle of using these compounds is 

simply to reduce the overall growth rate of the 

in vitro plantlets and thereby enhance the subcul- 
ture interval. The choice of inhibitory growth 
regulator necessarily depends upon the circum- 
stances and species. Abscisic acid, N-dimethyl- 
aminosuccinamic acid, maleic hydrazide (MH), 
trans-cinnamic acid (TCA), chlorocholine chlo- 
ride (phosphon-D), daminozide (B 995) and 

Cycocel (CCC) at a concentration range 

2—50 mg/l have been reported to extend subcul- 

ture period for 6—12 months. 

This method is simple, efficient and cheaper. 
Using these chemicals, cultures can be stored at 
normal culture room, thereby eliminating the 
requirement of low temperature facility. However, 
its use may pose serious problems in the germ- 
plasm conservation: 

1. The plant may become stunted and show 
abnormal growth, thereby posing problems in 
regenerating normal plants. 

2. The presence of these retardants may lead to 
selection of lines with resistance or tolerance 
to growth retardants. 

3. Chances of induction of genetic alterations 
increase because some of the growth retar- 
dants have mutagenic properties. 


22.8.2.5 Other Approaches 

Growth rates of in vitro culture are influenced by 
composition and volume of atmosphere inside 
the culture vessel (Gould and Murashige 1985). 
Mineral oil overlay and reduction of oxygen 
pressure in the culture vessel may extend shelf 
life cultures from a few weeks to months. The 
main problem with the mineral oil overlay is the 
recovery growth of shoot tips. The limitation of 
modification of gaseous environment is the diffi- 
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culty of maintaining gas atmosphere of individ- 
ual culture vessel. However, the use of gas 
permeable culture vessel such as star packs (Reed 
and Abdelnour 1991) offers the possibility of 
controlling the gas atmosphere of the entire 
incubation chamber which may be more practi- 
cal. This technique therefore needs further inves- 
tigation. Storage of encapsulated buds and 
somatic embryos at room temperature for 
8 months when water content was reduced down 
to 15 %. Though the result of these methods is 
encouraging and may be useful in specific cases, 
the methods need further standardization for their 
acceptability and wider use. Thus, these may be 
viewed as experimental techniques with future 
application. It is worthwhile to mention here that 
the technique of applying slow growth to reduce 
subculture frequency is not without drawbacks, 
mainly of somaclonal variation. Moreover, 
reduction of temperature has to be used with 
caution for tropical species which require higher 
temperatures for growth. Besides cost-effective- 
ness of low temperature storage, maintenance of 
constant low temperature for desired long peri- 
ods may pose a practical constraint, especially in 
subtropical/tropical regions. 


22.9 Principles of Medium-Term 
Storage 


Standard culture conditions can only be used for 
medium-term storage of slow-growing species. 
However, in most cases, environmental condi- 
tions and culture medium have to be modified to 
induce growth reduction. This is most frequently 
achieved by temperature reduction, often in 
association with a decrease in light intensity or 
even its complete suppression. Temperatures in 
the range of 0-5 °C are employed with cold-tol- 
erant species, but higher temperatures are 
required in the case of tropical species, which 
are generally cold sensitive. It is also possible to 
limit growth by modifying the culture medium, 
mainly by reducing the sugar and/or mineral ele- 
ment concentration. Finally, the type of culture 
vessel, its volume as well as the type of enclo- 
sure influence survival of cultures. Notably, it is 
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important to limit the evaporation of the culture 
medium by using a type of enclosure, which is 
tight enough. 

Medium-term conservation techniques have 
been developed for a wide range of plant species, 
but they are used routinely for the conservation of 
only a limited number of species such as Musa, 
potato or cassava (Engelmann 1997). 

New medium-term conservation techniques 
include reduction of the oxygen level available to 
cultures (achieved by covering explants with a 
layer of liquid medium or mineral oil or by plac- 
ing them in controlled atmosphere), desiccation 
and encapsulation of explants in alginate beads 
(Engelmann 1997). However, these techniques 
are still at the experimental stage. 

The objective of slow growth (or minimal 
growth) is to reduce subculture intervals to a crit- 
ical level that does not impose a long-term delete- 
rious effect on germplasm or put at risk the 
stability of regenerated/regrown plants. However, 
slow growth treatments incur some level of stress, 
and it is essential to optimize MTS with respect 
to the timing of subculture regimes and regenera- 
tion. When this is achieved, slow growth is a suc- 
cessful method of securing plant germplasm in 
MTS (Cha-um and Kirdmanne 2007). Minimal 
growth storage is useful for genotypes that can- 
not be cryopreserved, and it is a key part of 
in vitro gene banks that clonally propagate crops 
for distribution services. Several MTS treatments 
are applied, either singularly or in combination to 
retard growth: 

e Physical growth limitation: 

— Low temperature 

— Low light/restricted photoperiod 

— Minimal containment 

— Minimal O, 

— Osmotic (water) stress 
¢ Chemical growth limitation: 

— Growth regulator retardation 

— Growth inhibitors 
e Nutrient limitation: 

— Low macronutrient levels 

— Low micronutrients levels 

Ashmore (1997) listed key issues and actions 
for future consideration. 


428 


22.10 In Vitro Conservation: 
Practical Considerations 


The real utilization of any conservation 
programme lies in carefully selecting the species, 
designing work plan and sincerely executing it. 
The first and foremost and essential step, once 
the infrastructure is available, is prioritization of 
species. In deciding the priority, the main points 
to be considered include threat of genetic ero- 
sion, importance of the species and potential 
demand for distribution/exchange for commer- 
cial utilization, besides size of collection and cost 
of field maintenance. It is equally important to 
select a strategy after assessing its economy and 
safety. 
An ideal strategy should meet the following 
requirements: 
1. Wide and easy applicability of protocols and 
techniques over a range of species 
2. Economy (establishment and running cost) in 
operation/management 
3. Ensure high viability after storage 
. Low risk of genetic alteration 
5. Technically less demanding techniques 
The material needs to be well documented and 
conserved under safe condition and duplicated in 
at least one other location/country to minimize 
accidental loss and to ensure its availability. 
Before embarking on in vitro conservation, the 
potential approach should be balanced against 
other conservation strategies. Experience with 
in vitro maintenance is limited; thus, time and 
cost compliment other conservation strategies for 
the same crop species (seed, field gene banks, 
etc.) including in situ conservation. Issues of con- 
cern in potential application of in vitro techniques 
for germplasm conservation of medicinal and 
aromatic plants include: 
1. Genetic stability 
2. Basic tissue culture competence 
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22.11 Genetic Stability 


Maintenance of specific gene combination (geno- 
types) is vital for any germplasm conservation 
programme. The phenomenon of somaclonal 
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variation associated with certain culture systems 
iS perceived as major obstacle in using plant tis- 
sue culture for germplasm conservation. The 
instability has been documented to occur at the 
morphological, karyotypic and biochemical lev- 
els. The variations primarily result from the 
change in ploidy level, structural changes in 
chromosome morphology and mitotic aberra- 
tions. It has been reported that these changes may 
occur during culture, though genetic variation 
occurs spontaneously in nature also. Long preser- 
vation in culture media composition can also give 
rise to variations. Somaclonal variation can be 
enhanced, particularly where regeneration is 
adventitious. It has also been observed that cer- 
tain in vitro storage conditions can increase the 
risk of integrity because of directional change in 
response to selection. The use of axillary bud for 
multiplication reduces the probability of genetic 
variation. Therefore, formation of callus and 
adventitious shoots should be avoided (Roca 
et al. 1989). 

Monitoring of genetic stability is an impor- 
tant aspect of in vitro conservation. Technique 
of monitoring of stability will depend on the 
need, type and nature of plant species and its 
economic value. The characterization may be 
for morphological, cytological, histological 
and/or biochemical traits. Some of the tech- 
niques like field testing for morphological traits 
cannot be avoided. Other techniques have to be 
chosen judiciously depending on the crop, the 
economic product, the life cycle of the plant and 
the resources available. Use of sophisticated 
high-technology monitoring (molecular mark- 
ers/DNA polymorphism) has value when there 
is sufficient basis to assume instability, either 
intrinsic or due to the culture system and storage 
conditions. In the case of medicinal and aro- 
matic plants, however, it is more relevant to 
monitor the potentially useful metabolites of 
plant species. 

Greater understanding of the causes and 
nature of somaclonal variation (SCV) is needed, 
particularly after prolonged storage in culture 
and cryopreservation and in establishing safe 
storage procedures, developing improved mark- 
ers and methods for characterizing SCV and 
monitoring genetic instability. Encouragement of 
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research aimed at understanding causal factors of 
SCV and comparative assessments of genetic sta- 
bility in germplasm conserved in the field and 
IVGBs is recommended. 


22.11.1 Reproducible and Wider 
Use of Slow-Growth 
and Cryopreservation 


Methods 


Improved, reproducible and robust protocols 
required for slow growth and cryopreservation 
optimized across genotypes held in gene banks, 
greater provision for species that have received 
limited attention, application of in vitro methods 
for safe movement of germplasm and prioritizing 
problem species. 

Developing methods using simple facilities, 
with general application, optimized, and tested in 
IVGBs; more information on in vitro distribution 
and transportation of in vitro material. 


22.11.2 Management Issues 


Database of in vitro collections and their activi- 
ties and operations are required including routine 
methods and guidelines, particularly addressing 
what are acceptable ranges for survival and 
amount of material to be stored per accession for 
in vitro storage techniques. 


22.12 Performance Indicators 
for Slow Growth 


Performance indicators include (1) plant 
health, (2) extension of subculture interval, (3) 
contamination frequency and (4) capacity 
[viability, vigour, health status] to recover 
from stress treatments. Reed et al. (1998b, 
2003) use descriptive scales of 0 and 1-5 to 
rate the performance of in vitro Pyrus and 
AHumulus cultures maintained under minimal, 
low temperature growth: 

0 = all of the plantlet is brown and no visible 

indication of growth. 
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1 = very poor, questionable viability, brown, 
necrotic shoots, only extreme shoot visibly 
green, plantlet mostly brown. 

2 = poor, much browning, most shoot tips 
necrotic, shoot tip green, leaves and stems 
mostly brown, base may be brown. 

3 = fair, some browning, some shoot tips necrotic, 
shoot tips and upper leaves green, etiolation 
present, base green. 

4 = good, elongated shoots, shoot tips generally 
healthy, green leaves, stem and limited 
etiolation. 

5 = excellent condition, dark green leaves and 
shoots, no etiolation. 

A widely used method for conservation 1s 
seed storage. Several categories of crop present 
problems with regard to seed storage. At pres- 
ent, the most common method to conserve the 
genetic resources of these problem species is as 
whole plants in the field (field gene banks — 
FGBs). However, several serious problems such 
as labour-intensive field maintenance, shrink- 
ing land resources, etc. are associated with field 
gene banks. The conservation of rare and 
endangered plant species has also become an 
issue of concern. 

During the last 20 years, in vitro culture 
techniques have been extensively developed 
and applied for more than 1,000 different spe- 
cies (George 1996). Tissue culture techniques 
are of great use for the collection, multiplica- 
tion and storage of plant germplasm. Storage 
techniques, which allow to reduce the proce- 
dures and to preserve the genetic integrity of 
the plant material, are urgently needed. 
Different in vitro conservation methods are 
employed depending on the storage need 
(Engelmann 1991). For short- and medium- 
term storage, the aim 1s to reduce growth and to 
increase the interval between subcultures. For 
long-term storage, cryopreservation in liquid 
nitrogen is the only available method. 

Tissue culture techniques are of great interest 
for collection, multiplication and storage of 
plant germplasm (Engelmann 1991). Tissue cul- 
ture systems allow propagating plant material 
with high multiplication rates in an aseptic envi- 
ronment. Virus-free plants can be obtained 


430 


through meristem culture in combination with 
thermotherapy, thus ensuring the production of 
disease-free stocks and simplifying quarantine 
procedures for the international exchange of 
germplasm. The miniaturization of explants 
allows reducing space requirements and conse- 
quently labouring costs for the maintenance of 
germplasm collections. 

The methods employed vary according to the 
storage duration required (Engelmann 1991). 
For medium-term conservation, the aim is to 
reduce growth, thus increasing intervals between 
subcultures. For long-term conservation, cryo- 
preservation, i.e. conservation at ultralow tem- 
perature, usually that of liquid nitrogen 
(-196 °C), is the only method currently avail- 
able. Indeed, all metabolic processes as well as 
cell divisions are stopped at this temperature. 
Plant material can thus be conserved without 
modification or alteration for extended periods, 
under a reduced volume and with reduced 
maintenance. 


22.13 Stability: Optimal Storage 


In vitro gene bank practices need to ensure their 
biological resources maintain their special char- 
acteristics and are not changed because of stor- 
age and associated tissue culture practices. 
Genetic instability includes the risks of in vitro 
generated instability termed somaclonal varia- 
tion (SCV), defined by Scowcroft (1984) as heri- 
table genetic variability in plants generated 
through tissue culture. Genetic changes can also 
arise because of epigenetic processes, stress and 
selection pressure. The consequences of SCV 
are significant for genetic resource conservation 
as it is manifest in the regenerated plant; there- 
fore, reducing the potential risk of SCV is neces- 
sary. Scowcroft (1984) suggested that in vitro 
storage protocols should avoid practices that 
increase the risks of genetic variation occurring. 
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These are (1) avoiding germplasm propagation 
via dedifferentiated (callus) and adventitious 
routes, (2) limiting the use of plant growth regu- 
lators that increase the possibility of dedifferen- 
tiation and adventitious development and (3) 
selecting germplasm from young cultures as 
SCV increases and totipotency decreases during 
prolonged culture. 

Genetic instability arising from tissue culture 
is particularly significant for clonally propa- 
gated crops as compared to sexually propagated 
species in which chromosomal abnormalities are 
eliminated by gametogenesis and fertilization 
(Cassels and Curry 2001). Ideally, germplasm 
with a higher risk of manifesting genetic insta- 
bility should be monitored at the genetic level, 
as recessive genetic changes occurring during 
the tissue culture of asexually propagated spe- 
cies will have no phenotypic expression. In this 
context (Scowcroft 1984), clonal crops may 
thus be expected to display a potentially higher 
frequency of SCV than those propagated by 
seed; however, as off-types can arise in field- 
grown, clonally propagated plants, some varia- 
tion may be unrelated to culture conditions. In 
vitro conservation can help safeguard against 
the genetic changes that occur in field-propa- 
gated materials that have a predilection to pro- 
ducing off-types. 


22.13.1 General Status of In Vitro 
Conservation 
22.13.1.1 Guidelines 


Pence et al. (2002) described in vitro collecting 
techniques for Musa, Citrus, avocado, taro, coco- 
nut, tropical rainforest trees and wild and endan- 
gered species. Standards for in vitro culture and 
slow growth storage are developed by FAO. Reed 
et al. (2004) developed technical guidelines for 
management of field and in vitro germplasm 
collection (Tables 22.1 and 22.2). 
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Table 22.1 Examples of in vitro conservation in representative species of different categories of crops 


Species Storage regime 
Forest species 


Castanea sativa 8 °C and 23 °C 
(chestnut) 


Cedrela odorata 12°C, 20 °C, 
L., Guazuma 25 °C 

crinita Mart. and 

Jacaranda 

mimosaefolia 


Conditions 


Slow growth storage has been achieved 48 months 
for Castanea sativa (cv. ‘Montemarano’ ) 

shoot cultures over a duration of 

48 months at a temperature of 8 °C, 

where 82 % of explants survived and 

were able to resume normal growth after 

transfer to standard culture conditions at 


2a °C 


Shoot tips excised from in vitro plantlets 6 and 12 months 


were encapsulated in calcium alginate 
beads and stored on different substrates 
at 12, 20 and 25 °C. Percent viability 
when encapsulated shoot tips were stored 
on substrate containing only water 
solidified with | % (wt/vol) agar was 

80 % after 12 months at 12 °C for C. 
odorata, 90 % after 12 months at 25 °C 
for G. crinita and 70 % after 6 months at 
20 °C for J. mimosaefolia 


Storage period 


Reference 


Capuana and 
Di Lonardo 
(2013) 


Maruyama 
et al. (1997) 


Coffea arabica L. 1 °C Immature zygotic embryos of Coffea 2 years Naidu and 
(coffee zygotic arabica L. can be preserved with a little Sreenath 
embryos) loss of viability in the presence of ABA (1999) 

even at the normal room temperature (25 

+ 1 °C) up to 2 years without any transfer 
Drosophyllum 25 °C Double-node cuttings were maintained 4,8 and Gongalves 
lusitanicum (L.) for 4, 8 and 12 months at 5 or 25 °C in 12 months and Romano 
Link the dark. The cultures could effectively (2007) 

be conserved under minimum growth at 

5 °C for 8 months on Murashige and 

Skoog’s medium supplemented with 60 g 

dm-3 sucrose, 20 g dm-3 mannitol and 

0.91 PM zeatin 
Elettaria 2522 °C In vitro conservation by slow growth 18 months Tyagi et al. 
cardamomum method was achieved on 1/2 MS (major (2009) 
Maton. salts) +5 uM BAP +0.7 % agar 
(cardamom) (conservation medium); about 70 % of 

the cultures survived up to 18 months at 

25+2 °C. Some 96 % of the plants 

survived the hardening treatment and 

grew normally in a greenhouse 
Eucalyptus 24-28 °C Slow growth storage, for up to 10 months Watt et al. 
grandis 10 months, has been achieved for (2000) 


Eucalyptus grandis shoot cultures by 
either the addition of 10 mg I"! abscisic 
acid to the growth medium or by the 
halving of nutrient supply (half MS) and 
removal of exogenous plant growth 
regulators. Reduction of light intensity or 
addition of mannitol to the media was 
less effective in reducing growth rate. 
Isolated in vitro axillary buds 
encapsulated in calcium alginate and 
stored under low temperature and low 
light intensities survived for up to 

3 months without loss in viability 
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Table 22.1 (continued) 


Storage regime 
20 °C or 12 °C 


Species 
Jacaranda 


mimosaefolia 
D. Don. 


Medusagyne Ob iaal tad © 


oppositifolia 


Olea europaea L. 
(olive) 


Pyrus L. (pear) 


Quercus suber L. 1-5 °C 


(cork oak shoot) 


Ornamental crops 


Ipsea malabarica 2 °C 


Conditions 


When encapsulated shoot tips were 
stored on substrate containing only water 
solidified with | % (wt/vol), agar was 

70 % 


Long-term in vitro cultures of M. 
oppositifolia were slow growing and 
eventually lost the ability to multiply and 
root. Because of the lack of new seed and 
vegetative materials, efforts were made to 
rejuvenate the existing cultures 


Italian local cultivar showed a notable 
difficulty for the in vitro establishment 
due to heavy pathogen contamination. 
Mercury chloride and sodium 
hypochloride in the sterilization step and 
antibiotics in culture media allowed 
overcoming the problem. Proliferation of 
shoot apical bud on olive culture medium 
with 36 g dm? mannitol and 4.56 uM 
zeatin appeared very satisfactory 


Pear (Pyrus L.) germplasm may now be 
stored as seeds (species), dormant buds 
or pollen from field-grown trees or shoot 
tips from in vitro grown plants 
(cultivars). Pear germplasm may now be 
cryopreserved and stored for long periods 
(>100 year) utilizing slow freezing or 
vitrification of in vitro grown shoot tips. 
Dormant bud freezing, pollen and seed 
cryopreservation of other lines are being 
developed to complete the base collection 
for Pyrus 


Cultures were stored in vitro on 
multiplication medium at 5+ 1 °C 
without an intervening subculture for 
2 years 


Half-strength MS medium with 3 % 
sugar and 1.5 mg | kinetin in jam bottles 
facilitated storage of shoots up to 

14 months without subculture which 
developed 25 shoots. Medium facilitated 
storage up to 20 months. Half-strength 
MS growth regulator and sugar-free 
(photoautotrophic) medium were best for 
in vitro conservation, and the storage 
period on this medium was 27 months 


P.E. Rajasekharan and L. Sahijram 


Storage period Reference 

6 months Watt et al. 
(2000) 

6 months Marriott and 
Sarasan 
(2010) 

1 month Zacchini and 
De Agazio 
(2004) 

More than Reed et al. 

100 years (1998a) 

6, 12 or Romano and 

24 months Martins- 
Lougao 
(1999) 

20—27 months Martin and 
Pradeep 
(2003) 
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Table 22.1 (continued) 


Species 
Lilium L. (lily) 


Spiranthes 
brevilabris 
(Orchidaceae) 
Vegetable crops 
Cassava 
germplasm 


Cucumis sativus 
L. (Haploid 
cucumber) 


Potato 
microplants 


Storage regime 
—2 °C and 25 °C 


SCC 


24 °C—25 °C 


25°C 


Conditions 


In vitro regenerated bulblets of 10 lily 
(Lilium L.) genotypes (Asiatic hybrids, 
Oriental hybrids, L. longiflorum and 

L. henryi) were stored for 28 months at 
—2 °C and 25 °C on four different media: 
1/4 or full-strength Murashige and Skoog 
nutrients with 9 % (w/v) or 6 % sucrose. 
Sprout growth, bulb growth and viability 
were determined. The combination of 1/4 
strength MS nutrients and 9 % sucrose 
gave the highest reduction in sprout and 
bulb growth, the highest viability and the 
highest percentage of regrowth after 

28 months of storage. At 25 °C, all lily 
genotypes survived 28 months of storage 
under these conditions. At —2 °C, Asiatic 
and Oriental hybrids survived 28 months 
of storage, whereas genotypes of 

L. longiflorum and L. henryi survived 

6 months of storage, but died during 
prolonged storage 


It maintains a high degree of mycobiont 
specificity under in vitro symbiotic seed 
germination conditions 


Plant regrowth by encapsulated nodal 
cuttings and shoot tips was significantly 
affected by the duration of storage period 
as shoot recovery decreased from almost 
100—73.3 % for encapsulated nodal 
cuttings and 94.4—60 % for shoot tips 
after 28 days of storage 


During the first year of storage, between 
30 and 80 %, the clones were lost, due to 
disturbances in plant development, 
increased levels of endogenous bacteria 
and physiological changes resulting in 
continuous flowering. After 2 years of 
storage, haploids showed reduced vigour. 
Therefore, plants were regenerated 
directly from primordial leaf 
microexplants 


At 8 °C, culture in the presence of either 
acetylsalicylic acid or 4 % mannitol 
retarded microplant stem growth and 
required intervals between subcultures 
ranged from 8 to over 12 months, 
depending on the genotype. Prolonged 
culture on acetylsalicylic acid had no 
adverse effects on the yield of minitubers 
on plantlets transferred to glasshouse 
cultivation. At 18 °C also, intervals 
between subcultures could be prolonged 
to 5.5—6 months by culture on either 
mannitol or acetylsalicylic acid media 


Storage period 
28 months 


12 months 


16 h photoperiod 
for 3-5 weeks 


8 to over 
12 months 
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Reference 


Bonnier and 
Van Tuyl 
(1997) 


Stewart and 
Kane (2007) 


Danso and 
Ford-Lloyd 
(2003) 


Niemirowicz- 
Szczytt et al. 
(2000) 


Lopez- 
Delgado et al. 
(1998) 
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Table 22.1 (continued) 


Species 
Solanum 
tuberosum 
(potato) 


Solanum 
tuberosum L. 
(potato) 


Solanum 
tuberosum L. 
(potato 
germplasm) 


Solanum 
tuberosum 
(potato) 


Solanum 
stenotomum 
(potato) 


Fruit crops 
Apple genotypes 


Phoenix 
dactylifera (Date 
palm) 


Storage regime 
24/22 °C 


4°C and 10 °C 


6+1 °C 


SCC 


5 °C in the dark 


P.E. Rajasekharan and L. Sahijram 


Conditions 


Culture growers can successfully use 6—12 weeks 
their existing facilities, small refrigerators 

and coolers with low light intensity, set at 

4 °C, for short-term storage of potato, 

choke cherry and Saskatoon berry 


cultures 


‘Regrowth’ of 93-100 % was recorded 
for non-stored encapsulated shoot tips, 
directly transferred on soil in the 
greenhouse after 2 weeks pre-culture on 
MS solid medium with an added 
fungicide (Carbendazim) in the 
encapsulating gel. The ‘regrown’ shoot 
tips produced plants showing normal 
development 


Microtubers produced on media devoid 12 months 
of ABA and containing high sucrose 
concentrations and N6-benzyladenine 
(44.38 mM) could be stored for 

12 months under diffused light at 

6-18 °C 

Poor microplant quality at standard Ca 
[ext] over prolonged storage under 
minimal growth was due to limiting Ca 
nutrition, and this could be improved by 
using Ca [ext]-enriched (5.0—7.0 mM) 
minimal growth medium for conservation 
of potato microplants 


Not given 


More than 
5 years 


In both in vitro and greenhouse 
conditions, potato and the somatic 
hybrids produced few bigger tubers, 
while many small tubers were obtained 
from the relative. The hybrid tubers were 
morphologically intermediate. The starch 
content of hybrid tubers was much lower 
than that of potato, but similar to that of 
the relative species. Interestingly, the 
level of bacterial resistance, introgressed 
from S. stenotomum into potato, was 
shown to be very stable and remained as 
high as that of the relative after a 
long-term period of in vitro conservation 


6, 8,12 and 
18 months 


Microvessels appeared to be suitable for 
storing single node cuttings under slow 
growth conditions up to a year at least 
3, 6, 9 and 
12 months 


High percent of cultures remained viable 
without serious signs of senescence. The 
growth rate decreased as storage period 
increased. The role of sorbitol as osmotic 
agent in storage was examined. Healthy 
shoot bud cultures were obtained after 

6 months of storage on medium 
containing 40 g dm™ sorbitol 


Storage period 


Up to 390 days 


Reference 
Pruski (2001) 


Nyende et al. 
(2003) 


Gopal et al. 
(2004) 


Sarkar et al. 
(2005) 


Fock et al. 
(2007) 


Negri et al. 
(2000) 


Bekheet et al. 
(2002) 
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Table 22.1 (continued) 


Storage regime 
15 °C and 18 °C 


Species 


Ensete 


Grapevine (Vitis 4°C 
vinifera L.) 


Mulberry (Morus 15-20 °C 
spp.) 


Raspberry 4°C 


Medicinal and spice crops/species 


Acorus calamus 10 °C and SCC 
Artocarpus 10 °C and SCC 
heterophyllus 

Lam. 


Bacopa monnieri 2 °C 


(L.) 


Conditions Storage period 


In vitro cultures can be effectively 6 months 
maintained for 6 months at 15 °C and 

18 °C on MS medium supplemented with 
10 pM BAP, in the presence of mannitol 
at concentrations of 0 %, 1% or2 % asa 
growth retardant. Shoots were 
subsequently recovered and multiplied on 
MS medium supplemented with 10 and 
20 uM BAP at 25 °C, and rooted shoots 
were successfully transferred to the 
greenhouse 


After 42 months of dehydrated storage, 
90 % of the somatic embryos regenerated 
into plants. To test utility of this storage 
method, dehydrated embryos stored for 
12 and 26 months cool temperature 
storage of dehydrated somatic embryos is 
a simple and inexpensive method of 
clonal germplasm preservation when 
compared to alternatives such as 
cryopreservation 


42 months 


A low temperature of 0-5 °C is used for Not mentioned 
cold-tolerant species, but plants from 

tropical and subtropical regions may not 

survive such low temperatures, in which 


case they are stored at 15—20 °C 


In the first, subculture after storage was 
lower than that of non-stored cultures. In 
the second, subculture was lower, and in 
the third, subculture was similar to that in 
non-stored cultures 


3 and 6 months 


MS medium devoid of growth regulators 6 months 
for medium-term storage (under standard 
culture conditions). These cultures were 
incubated at 26+2 °C with 31.55 m-2s-2 
light intensity. In vitro established 
cultures of Acorus calamus were 
relocated to chambers having reduced 
light and temperature conditions (reduced 
temperature of 10 °C and reduced light of 
intensity of 2.97 um-2s-1) to study the 
effect of storage 


Protocols have been optimized forI[VAG 4 years 
for this species 


3,6 and 
24 months 


A slow growth protocol was developed 
for medium-term conservation using 
mineral oil (MO) overlay. Nodal 
segments of B. monnieri (two genotypes; 
1C249250, IC468878) were conserved 
using MO for 24 months. Normal plants 
regenerated from conserved cultures were 
successfully established in soil 
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Reference 


Negash et al. 
(2001) 


Jayasankar 
et al. (2005) 


Vijayan et al. 
(2011) 


Lisek and 
Orlikowska 
(2004) 


Rajasekharan 
et al. (2010) 


Ganeshan 
et al. (2010) 


Sharma and 
Anwar (2012) 
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Table 22.1 (continued) 


Species 
Coleus forskohlii 


Coleus sp. 


Curcuma longa 
L. (turmeric) 


C. malabarica 
and C. aromatica 
(Curcuma sp.) 


Decalepis 
hamiltonii 


Eleutherococcus 
senticosus 
Maxim. 


Storage regime 


10 °C and SCC 


SCC 


Conditions 


In vitro plants could be successfully 
conserved in full MS under SCC for 
6 months without any subculture 


Vitro plants could be successfully 
conserved under SCC for 6 months 
without subculture and full potential to 
regenerate 


Genetic stability of 12-month-old in vitro 
conserved plants was assessed using 25 
random amplified polymorphic DNA 
(RAPD) primers; no significant variation 
was observed in RAPD profiles of mother 
plants and in vitro conserved plantlets on 
CM and low-cost media 


Plantlet regeneration and medium-term 
genotype conservation phenomena were 
genotype dependent and influenced 
significantly by type and concentration of 
cytokinins used. The number of shoots 
per culture ranged from 1.3 to 7.2 and 
conservation period from 264 to 

379 days. In 30-day-old cultures, highest 
frequency of shoot regeneration could be 
obtained in C. malabarica (7.2 shoots per 
culture) on MS + 11.4 uM zeatin. 
Curcuma sp. (unidentified wild species) 
could be conserved for maximum period 
(379 days) on MS + 24.6 uM 2iP 
followed by C. aromatica (363 days) on 
MS + 22.8 uM zeatin 


Microshoots, recovered from 
encapsulated nodal segments (capsule), 
were best rooted on half-strength MS 
medium containing 2.5 pM a-naphthalene 
acetic acid (NAA). Complete plantlets 
(with shoot and root) were successfully 
acclimatized and established in field 
where they grew well without any 
detectable variation 


An average of 115,370 germinated 
somatic embryos (SEs) developed from 
an initial 400 mg of embryogenic callus, 
and 64.7 % of germinated SEs converted 
into plantlets after a 4-week culture on 
agar medium. During the bioreactor 
culture process, secondary SEs were 
induced directly from SEs at various 
stages, a phenomenon that rarely 
occurred in suspension culture. These 
secondary SEs developed quickly and 
germinated during the bioreactor culture 
process 


Storage period 
6 months 


6 months 


Up to 12 months 


264-279 days 
(C. malabarica) 
379 days (C. 


aromatica 


6 weeks 


4 weeks 


P.E. Rajasekharan and L. Sahijram 


Reference 


Rajasekharan 
et al. (2004) 


Rajasekharan 
et al. (2010) 


Tyagi et al. 
(2007) 


Tyagi et al. 
(2004) 


Sharma and 
Anwar (2012) 


Yang et al. 
(2012) 
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Table 22.1 (continued) 


Species 
Kaempferia 
galanga 


Medicinal plants 


Momordica 
sahyadrica 


Nothapodytes 
nimmoniana 


Pelargonium 
sidoides DC. 
(Geraniaceae) 


Piper longum 


Rauvolfia 
serpentina 


Splachnum 
ampullaceum 
(moss buds) 


Taxus media 


Storage regime 
10 °C and RCC 


10 °C and SCC 


10 °C and SCC 


10 °C and SCC 


SCC 


10 °C and RCC 


1 °C in darkness 


Conditions 


Reduced culture conditions (RCC) of 
light, temperature and media showed that 
vitro plants retained the capacity to 
regrow after storage for a duration of 

6 months without intervening subculture 


Protocols for conserving plant diversity 
for 22 species of medicinal plants have 
been optimized. SCC (standard culture 
conditions) and RCC (reduced culture 
conditions: light and temperature) 


In vitro multiplication and conservation 
of wild Momordica sahyadrica 


Under reduced temperature and light, 

in vitro plants grew well (without any 
subculture) even after 6 months of 
incubation. Cultures in MS basal medium 
showed shoot elongation of 5-10 mm 
compared to MS with TDZ (3-50 mm) 


Based on the pharmacological data 
generated in this study, aerial parts can be 
used to substitute the commonly 
preferred underground tubers for the 
preparation of herbal formulations and 
antioxidant properties 


Only shoots that indexed negative for 
endogenous bacteria were used for 
proliferation and in vitro conservation 
studies. The shoot cultures could be 
maintained without subculturing for as 
long as 8 weeks in MS medium 
supplemented with 1 mg/l paclobutrazol 
(PBZ) and 40 mg/I ascorbic acid 


Lengthening the time between 
subcultures up to 1 year and consequently 
reducing maintenance requirements could 
be useful for in vitro germplasm 
collections 


Moss regeneration was evaluated at 
different periods of time to examine the 
efficiency of the technique for moss 
germplasm conservation 


Paclitaxel levels increased in the cells 
after conservation during the first 
recovery subculture cycle and then 
decreased during the subsequent recovery 
subculture cycle. The results of high- 
performance liquid chromatography 
indicated that DNA methylation 
increased during the course of repeated 
subculture. A decrease in DNA 
methylation level caused by treatment 
with 5-Aza- 2’ -deoxycytidine coincided 
with an increase in paclitaxel levels 


Storage period 
6 months 


Different periods 


6 months 


6 months 


6 weeks and 


12 months 


4—8 weeks 


1 year 


Long term (2 and 


2.5 years) 


180 days 
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Reference 


Rajasekharan 
et al. (2009) 


Rajasekharan 
et al. (2010) 


Rajasekharan 
and Ganeshan 
(2010) 


Rajasekharan 
et al. (2010) 


Moyo et al. 
(2013) 


Rani and 
Dantu (2012) 


Rajasekharan 
et al. (2007) 


Li et al. 
(2013) 
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Table 22.1 (continued) 


Species Storage regime 
Tylophora indica 10°C and RCC 
Vanilla (Vanilla  2°C 
planifolia) 

Zingiber sp. — 


Pulse crops 


Wild Arachis 10 °C and 45 % 


relative humidity 


Commercial crops 


C. malabarica 
(Curcuma sp.) 


14 °C and zero 
irradiance 


Corymbia 
torelliana x C. 
citriodora and 
Khaya 
senegalensis 


Conditions 


Vitroplants could be successfully 
conserved in full-strength MS medium 
(FMS) under SCC for 6 months without 
subculture with full potential to 
regenerate, producing viable shoots and 
nodes 


The conserved material could be 
retrieved and multiplied normally in MS 
medium with 1.0 mg I"! BA and 0.5 mg 
l' IBA 

A total of 33 genotypes of cultivated and 
wild species of Zingiber were 
subsequently tested for conservation 
through in vitro rhizome formation on 
CM under 16 h light condition. All 
genotypes produced rhizomes of varying 
size with numbers ranging from 3 to 15 
per culture and were conserved for at 
least 12 months; some genotypes could 
be conserved even up to 16—20 months 


Arachis seeds display a sub-orthodox 
behaviour due to their high lipid content 
and the fragile tegument, which may lead 
to oxidative damages. It leads to loss of 
germinative potential, which occurs even 
under optimal storage conditions. Plant 
multiplication under field conditions can 
be limited by specific soil and 
environmental requirements or by low 
seed yielding 


Plantlet regeneration and medium-term 
conservation phenomena were genotype 
dependent and were influenced 
significantly by type and concentration of 
cytokinins used. The tissue culture-raised 
plantlets were morphologically similar to 
their parents. The in vitro conserved 
plants multiplied rapidly in tissue 
cultures and produced normal rhizomes 
upon transfer to soil in net house 


Encapsulated explants of both trees were 
preserved most effectively on high- 
nutrient (half-strength MS medium 
containing | % sucrose), which provided 
very high frequencies of shoot regrowth 
(92-100 % for Corymbia and 71-98 % 
for Khaya) and excellent shoot 
development after 12 months of storage. 
This technique provides an extremely 
efficient means for storage and exchange 
of eucalypts and mahoganies, ideally 
suited for incorporation into plant 
breeding and germplasm conservation 
programmes 


P.E. Rajasekharan and L. Sahijram 


Storage period Reference 


6 months Rajasekharan 


et al. (2009) 


10 months to Divakaran 


7 years et al. (2006) 

2—3 weeks Tyagi et al. 
(2006) 

15 years Pacheco et al. 
(2009) 

264-379 days Tyagi et al. 
(2006) 

0, 3, 6 and Hung and 

12 months Trueman 
(2012) 
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Table 22.1 (continued) 


Species Storage regime 
Hibiscus 5 °C calcium 
moscheutos alginate 

microencapsulation 
Olive (Olea — 


europaea L.) 


Saccharum spp. 2 °C 


Saccharum spp. 2-18 °C 


hybrid (genotype 

88H0019) 

Sequoia 4 °C encapsulated 

sempervirens and 

(D. Don.) Endl nonencapsulated 
shoots 


Strawberry beads 4 °C 


Conditions Storage period 


80 % of encapsulated hardy hibiscus 
nodal segments survived refrigerated 
storage for 1.5 years (78 weeks). After 

3 months on proliferation medium, the 
nodal segments produced nearly the same 
length axillary shoots with the same 
number of axillary nodes per shoot as 
compared to encapsulated segments 
either not stored at 5 °C or stored for 

24 weeks at 5 °C 


Italian local cultivar showed a notable 1 month 
difficulty for the in vitro establishment 

due to heavy pathogen contamination. 

Mercury chloride and sodium 

hypochloride in the sterilization step and 

antibiotics in culture media allowed 

overcoming the problem. Proliferation of 

shoot apical bud on olive culture medium 

with 36 g dm? mannitol and 4.56 uM 

zeatin appeared very satisfactory 


For storage of whole plantlets, the 2 weeks 
highest survival rates and shoot regrowth 

after 8 months were observed at 18 °C on 
half-strength MS and 10 g I" sucrose, 

with or without 30 g I"! sorbitol 


For storage of whole plantlets, the 8 months 
highest survival rates and shoot regrowth 

after 8 months were observed at 18 °C on 
half-strength MS and 10 g I" sucrose, 

with or without 30 g I"! sorbitol 


Survival and regrowth of encapsulated 15 months 
shoots declined within 3 months, 
regardless of the storage medium 
composition. No significant decrease in 
survival and regrowth was noted with 
nonencapsulated shoots after 12 months 
of storage on Quoirin and Lepoivre 
medium supplemented, or not, with 

1 mgl"' benzyladenine. Regrowth 
dropped to 60-61 % after 15 months of 
storage on the same media 


In the first, subculture after storage was 9 months 
lower than that of non-stored cultures. In 

the second, subculture was generally 

higher than in non-stored cultures, and in 

the third, subculture was similar to that in 
non-stored cultures 


SCC standard culture conditions, RCC reduced culture conditions 
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Reference 


West et al. 
(2006) 


Zacchini and 
De Agazio 
(2004) 


Watt et al. 
(2009) 


Watt et al. 
(2009) 


Ozudogru 
et al. (2011) 


Lisek and 
Orlikowska 
(2004) 
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Table 22.2 Tissue culture and cryopreservation at 
National Gene Bank of India (NBPGR, New Delhi) as on 
28 February 2014 


Crop/crop group Present status (no.) 
Tropical fruits 420 
Temperate and minor tropical fruits 305 
Tuber crops 619 
Bulbous crops 171 
Medicinal and aromatic plants 153 
Spices and industrial crops 225 
Total 1,893 
Recalcitrant 0 
Intermediate 6,113 
Orthodox 3,253 
Dormant bud (mulberry) 387 
Pollen 446 
Total 1,0199 
References 
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Abstract 

Seed storage is one of the most widespread and valuable ex situ approaches 
to conservation. Extensive expertise has been developed in this field by 
agencies and institutions involved with plant genetic resources over the 
past 30 years. Seed banking has considerable advantages over other meth- 
ods of ex situ conservation such as ease of storage, economy of space, rela- 
tively low labour demands and, consequently, the capacity to maintain 
large samples at an economically viable cost. Depending on the species, 
seeds are dried to suitably low moisture content according to an appropri- 
ate protocol. Typically this will be less than 5 %. The seeds then are stored 
at —18 °C or below. Because seed DNA degrades with time, the seeds need 
to be periodically replanted and fresh seeds collected for another round of 
long-term storage. There are about six million accessions, or samples of a 
particular population, stored as seeds in about 1,300 gene banks through- 
out the world as of 2006. The procedure for seed storage along with the 
latest development on gene banking 1s discussed in this chapter. 
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responsible for the extensive and valuable 
collections assembled at the All-Union Institute 


The former Soviet Union gained an early lead in 
collecting and conserving plant genetic resources 
as a result of the work of Vavilov, who was 


PE. Rajasekharan (Db) 

Division of Plant Genetic Resources, 

Indian institute of Horticultural Research (IIHR), 
Hessaraghatta Lake P.O., Bangalore, 

Karnataka 560089, India 

e-mail: rajasekharan.pe @ gmail.com 


Bir Bahadur et al. (eds.), Plant Biology and Biotechnology: Volume II; Plant Genomics 


of Plant Industry at St. Petersburg (formerly 
Leningrad). The Vavilov All-Union Institute of 
Plant Industry (VIR), as it was to be known later, 
became the central nationwide institution respon- 
sible for collecting and conserving global plant 
diversity and studying it for the purposes of plant 
breeding and crop improvement in the Soviet 
Union (Vavilov 1997) In the United States, col- 
lection and evaluation of germplasm has been 
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performed since the 1900s, but the first reposi- 
tory of crop seeds was not established until 1947 
when the Regional Plant Introduction Station in 
Ames, IA, began operation. In 1958, the first 
national facility for storage of germplasm at low 
temperatures in cold rooms was constructed at 
Fort Collins, CO. These facilities were recently 
upgraded (Qualset and Shands 2005). An IBPGR 
survey in 1975 revealed that there were only 
eight long-term genetic resources’ conservation 
centres globally (almost all of them in the 
industrialised countries). But only 7 years later, 
the total had jumped to 33, and today there are 
over 1,000 major germplasm collections in gene 
banks all over the world (Qualset and Shands 
2005). Crop genetic resources grown from true 
seed are stored in three main types of gene banks. 
In long-term gene banks, whose aim is to store 
material for 50-100 years, samples are kept at 
—10 °C to —20 °C in airtight containers. Usually 
before such storage, the seed samples are dried to 
a moisture content of 5-7 %. In medium-term 
facilities, as most working collections are stored, 
temperatures of 0—5 °C are maintained and seeds 
may last up to 10-15 years. The short-term col- 
lections or breeder’s collections are usually kept 
in paper envelopes or cloth bags or tin cans at 
5-15 °C without any seed drying. Samples are 
constantly withdrawn for evaluation. Much of the 
germplasm of major crops and their wild and 
weedy relatives already resides in gene banks. 
The effect of the vast increase 1n accessions since 
the 1970s on germplasm utilised for crop produc- 
tion has thus far been modest (Damania 2008). 
More than a decade ago, a US survey found 
around three-quarters of soy and wheat breeders 
and around three-fifths of cotton and sorghum 
breeders, but only 45 % of corn breeders used 
gene banks more than ‘rarely’. 

Since the 1970s, work on the conservation of 
crop genetic resources has increasingly become a 
large-scale independent activity detached from 
crop improvement efforts. A substantial germ- 
plasm collecting effort was launched in the 1970s 
in response to concerns about genetic erosion and 
crop vulnerability. Over 1,000 gene banks have 
been established, holding about six million acces- 
sions (FAO 1998). The germplasm holdings in 
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major international agricultural research centres 
are provided in Table 23.1. Seed banks of a global 
network of international agricultural research 
institutions, coordinated by the Consultative 
Group on International Agricultural Research 
(CGIAR), Washington, are focused on crops and 
have extensive seed collections for such crops as 
rice, maize, wheat, barley, millets, pulses, oil 
seeds, tuber crops, banana, tropical forage and 
fruits. The collections in these seed banks are 
well documented, and the institutions are net- 
worked among themselves and with several other 
institutions. The Millennium Seed Bank Project 
(MSBP) at the Royal Botanic Gardens, Kew, 
England, is one of the largest conservation proj- 
ects. MSBP’s 47 partner organisations in 17 
countries intend to store 25 % of the world’s 
plant species by 2020. The Seed Information 
Database (SID) at Kew is an ongoing compila- 
tion of seed characteristics and traits worldwide, 
targeted at >24,000 species. 

What makes seed banks such an effective ex 
situ conservation technique is that the methodol- 
ogy can be applied to a wide range of species in a 
universal and straightforward way and that large 
amounts of intraspecific diversity can be con- 
served and for long periods of time without inter- 
vention. Additionally, germinating seeds to 
obtain fully grown plants is relatively simple 
compared with obtaining plants from in vitro- 
stored material. Plants recovered from banked 
seeds can also be compared with natural popula- 
tions from which the material was harvested 


Table 23.1 PGR holdings at [ARCs (Source: SINGER) 


Name of IARC Number of accessions 
AVRDC 52,845 
Bioversity International 1,208 
CIAT 72,246 
CIMMYT 120,527 
CIP 15,092 
ICARDA 140,189 
ICRAF 1,785 
ICRISAT 114,865 
IITA 27,596 
ILRI 20,177 
IRRI 108,272 
WARDA 21,752 


23. Gene Banking for Ex Situ Conservation of Plant Genetic Resources 


years before and which may have been subjected 
subsequently to environmental change (e.g. as a 
result of global warming). 


23.2 Current Status of Seed Banks 
for Food and Agriculture 


Just under half of the six million accessions are 
held in 12 national collections. To some degree, 
this is a function of the early establishment of 
their genetic resource collections. The collec- 
tions include those in Russia and the United 
States (noted earlier), Japan (established in 
1966), Germany (1970), Canada (1970) and 
Brazil (1974). Of the 1,308 national or regional 
collections currently noted by FAO (1996), only 
397 within 75 countries are held in long- or 
medium-term seed banks. Medium-term storage 
might be assumed to be in the order of 10 or 
more years. 


23.3 Scientific Principles 
Underlying Seed Banking 


Viable seeds of many species when maintained in 
a dry and cold state are capable of being germi- 
nated many years later. This capability means 
that the long-term ex situ conservation of many 
higher plants is a realistic possibility. 


23.3.1 Seed Storage Conditions 


The science of seed storage is not a new one and 
dates back, at least, to China in the sixth century. 
Advances in the quantification of seed longevity 
under different storage conditions were made in 
the second half of the twentieth century through 
the work of Harrington in the United States and 
Ellis and Roberts in the United Kingdom (see 
review in Hong, Jenkins et al. 1998). Critical fac- 
tors that determine seed longevity are the seed’s 
moisture content, temperature, and gaseous envi- 
ronment; its initial viability; and its genetic back- 
ground. With respect to the latter, differences 
between species would appear to be much greater 
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than those within. Genetics particularly influ- 
ences the relationship between seed longevity 
and seed moisture content. Most species produce 
seeds that can be dried to low moisture contents 
(e.g. where less than 5 % of the seeds’ fresh 
weight is water) without loss of viability. The 
seeds of such species are termed ‘orthodox’. 


23.3.2 Genetic Considerations 


The genetics of seed bank storage is an important 
issue. A criticism occasionally levelled at seed 
banks is that there is selection in storage. 
Selection can occur during collection through a 
biassed sample (e.g. for early- or late-flowering 
genotypes). It can also occur when samples are 
grown out under conditions that differ dramati- 
cally from those where the seeds were harvested. 
There is, however, little evidence to show that, 
compared with, say, room conditions, the seed 
bank environment does other than slow down the 
(normally distributed) times for individuals to 
die. In other words, there is no greater risk of 
selection out of any individual by the conditions 
applied than under natural conditions. 


23.4 Other Forms of Gene Banks 
23.4.1 Conservation of Pollen 


Conservation of nuclear genetic diversity (NGD) 
using pollen is desirable in horticultural species 
for a variety of reasons. Cryopreserved pollen 
can be a major access point for pre-breeding 
germplasm lines, hybrid seed production and bio- 
technological and other basic studies. In the case 
of tree species, germplasm can be easily received 
and exchanged through pollen, eliminating a long 
juvenile phase. The objective of a useful pollen 
cryostorage protocol is to collect mature pollen 
from plant and treat it so as to retain its normal 
function, ultimately assessed by its ability to ger- 
minate in vivo and effect normal fertilisation 
(Hanna and Towill 1995). Alexander and 
Ganeshan (1993) reviewed the work on pollen 
storage in fruit crops. Hoekstra (1995) has 
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assessed the merits and demerits of pollen as 
genetic resource. Ganeshan and Rajasekharan 
(1995) reviewed work on ornamental crop pollen 
storage. Grout and Roberts (1995) detailed the 
methodology for pollen cryopreservation. 
Recently, Barnabas and Kovacs (1997) and 
Berthoud (1997) stressed the importance and 
need for pollen conservation. Response to cryo- 
preservation of pollen of 45 species belonging to 
15 families is presented in Table 23.1. In some of 
the recently cryostored pollen, only feasibility 
tests were carried out. In most species, protocols 
are optimised for establishing pollen cryobanks. 

Besides the already existing role of pollen 
banks in breeding, there are many promising 
applications which have come to focus with the 
recent advances in allied bioscientific areas. 
Some of the practical utilities are discussed 
below: 


23.4.2 Advantages to the Use 
of Pollen 


¢ Gene-banked pollen can be made available to 
breeders upon request. For tree species, this 
obviates the need for growing the male parents 
in the breeding orchards. It allows for wide 
hybridisation across seasonal and geographi- 
cal limitations and reduces the coordination 
required to synchronise flowering and pollen 
availability for use in crosses. With adequate 
pollen available, one can also load additional 
pollen onto stigmas to increase pollination 
and yield. 

e Pollen is available for research programmes. 
As single cells, pollen provides a simple 
model system for research on conservation. 
Storage of pollen within gene banks also 
ensures its availability year-round for basic 
biology and allergy research programmes 
(Shivanna 2003). 

e Pollen captures diversity within small sample 
sizes, and documentation 1s available for long- 
term survival of pollen from many diverse 
species. Pollen also serves as a source of 
genetic diversity in collections where it is hard 
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to maintain diversity with seeds (species of 
low fecundity, large seeds, or seeds that 
require an investment of labour to store). 

e Pollen can also be shipped internationally, 
often without threat of disease transfer 
(Hoekstra 1995) (http://cropgenebank.serp. 
cgiar.org/). 

Detailed protocols for pollen cryopreservation 
have been given by Ganeshan et al. (2008) and 

Rajasekharan et al. (2013). 


23.4.3 Conservation of DNA 


Advances in genomics have provided technolo- 
gies for high-throughput analysis of plant 
genomes with potential for use in gene discovery 
in germplasm collections. Biotechnology has 
made possible DNA conservation of plant spe- 
cies, in the form of extracted DNA or as genomic 
DNA libraries. However, this technique needs 
advanced technological inputs to match with the 
importance and requirement for the species in 
question, to justify its need as a practical conser- 
vation strategy. For any given plant species, DNA 
conservation could be advantageous for: 
1. The study of molecular phylogenetics and sys- 
tematics of extant and extinct taxa 
2. Production of previously characterised sec- 
ondary compounds in recombinant DNA- 
mediated transgenic cell cultures 
3. Production of transgenic plant using genes 
from gene families 
4. In vitro expression and study of enzyme struc- 
ture and function, synthesising genomic 
probes for research laboratories 
The establishment of DNA banks facilitates 
this screening by making DNA from large num- 
bers of plant accessions widely available. DNA 
banks require the development of appropriate 
policies for access and benefit sharing. Tools for 
automating sample and data handling are 
essential. Standard molecular methods for finger- 
printing DNA accessions for international com- 
parisons need to be determined. New screening 
technologies are required to take advantage of the 
emerging availability of large DNA collections. 


23. Gene Banking for Ex Situ Conservation of Plant Genetic Resources 


The Australian Plant DNA Bank aims to collect 
DNA from all Australian plant species and to 
sample the diversity within each species. DNA 
from all individuals of the species is being stored 
for rare species. Domesticated or economically 
important species from all countries are also 
being collected and stored. The international net- 
working of DNA banks will be a key step in link- 
ing genomics tools to global plant diversity (Rice 
et al. 2006). 


23.4.3.1 The Use of DNA Banks 

Molecular techniques are becoming increasingly 
important in the study and management of 
genetic resources. DNA has been routinely 
extracted and stored from the nuclei, mitochon- 
dria and chloroplasts of many plant species, 
together with derivatives such as RNA, cDNA 
and genes. Technologies are available to allow all 
these to be stored quickly and at low cost in DNA 
banks as an insurance policy against loss of crop 
diversity. DNA storage has so far been under- 
taken with objectives other than conservation in 
mind, usually to allow genetic material to be 
made readily available for molecular applica- 
tions, for distribution or for training (http://crop- 
genebank.serp.cgiar.org/). 


23.4.3.2 Advantages 

DNA banking is an efficient, simple and long- 
term method to conserve the _ genetic 
information. 


23.4.3.3 Disadvantages 

There are problems with subsequent gene isola- 
tion, cloning and transfer of DNA back to a plant, 
and it currently does not allow the regeneration of 
the same genotype as the original sample. 


23.4.3.4 Storage 

There is little information on the long-term sta- 
bility of extracted DNA during frozen storage, 
but most repositories consider several years to 
decades as realistic. Information on the stability 
of purified DNA dissolved in buffer suggests that 
the overall fragment size decreases with storage 
time and that the usefulness of the specimen for 
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PCR-based assays may be 1—2 years when stored 
at 4 °C, 4-7 years when stored at —18 °C and 
greater than 4 years when stored at —80 °C 
(Madisen et al. 1987; Visvikis et al. 1998). The 
choice of temperature usually depends on the 
moisture level within the sample. 


23.5 Community Seed Banks 
(CSBs) 


Community seed banks fulfil diverse purposes of 
sustainable agriculture for small and marginal 
farmers. These seed banks serve as focal point in 
maintaining indigenous genetic diversity on 
farms involving farmer community. CSBs serve 
local farmers to form an informal seed distribu- 
tion system prevailing in villages since ancient 
time at no or very low cost. Community partici- 
pation in maintaining local genetic diversity pro- 
vides pride to farmers and sense of belonging for 
local landraces. This system is run, maintained 
and promoted by farmers to facilitate good qual- 
ity seeds and input (Malik et al. 2013). There are 
no set guidelines available to establish and man- 
age community seed banks as they form an 
important part of informal seed distribution sys- 
tem in villages since ancient time. The farming 
community as per their convenience has devel- 
oped this system and the same is being continued 
by the farmers. 


23.6 Soil Seed Bank 


The soil seed bank is the natural storage of seeds, 
often dormant, within the soil of most ecosys- 
tems. The study of soil seed banks started in 1859 
when Charles Darwin observed the emergence of 
seedlings using soil samples from the bottom of a 
lake. The first scientific paper on the subject was 
published in 1882 and reported on the occurrence 
of seeds at different soil depths. Weed seed banks 
have been studied intensely in agricultural science 
because of their important economic impacts; 
other fields interested in soil seed banks include 
forest regeneration and restoration ecology. 
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23.7 Seed Bank Management 
23.7.1 Procedure 


The basic elements of the seed banking proce- 

dure (more or less in order) are as follows 

(Linington et al. 2001): 

¢ Collection planning and permission seeking 

e Seed (and pressed specimen) collecting and 
field data recording 

e Shipment of seeds 

e Creation of a data record about the accession 

e Seed cleaning (sometimes preceded by initial 
drying and sometimes accompanied by X-ray 
analysis and quantity determination) 

e Main drying 

e Seed moisture determination 

e Initial germination test (sometimes left until 
after banking) 

e Packaging and _ banking 
duplication 

e Characterisation (including verification of 
identity in the case of wild species) and evalu- 
ation (where appropriate) 

e Distribution of stocks to users (through time) 

e Germination retests (through time) 

e Regeneration/multiplication (as required) 


and security 


23.7.2 Seed Collection 


Set against the background of the CBD and the 
International Undertaking, collecting should only 
be carried out with the permission of the national 
and local authorities. In India as per the 
Biodiversity Act, one has to take permission from 
local bodies for collection. Foreign collectors 
should work collaboratively with local scientists, 
and clear agreements on benefit sharing should 
be in place. One immediately tangible benefit is 
the sharing of collections and the information 
relating to them. Other international regulations 
need to be adhered to. These include the 
Convention on International Trade in Endangered 
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Species (CITES) and national quarantine laws. 
Seed collecting methodology and _ genetic 
resource exploration have been thoroughly cov- 
ered by Guarino et al. (1995). In most instances, 
random and even sampling of wild plant or crop 
populations is recommended, including careful 
note taking of the sample method (in often less 
than perfect conditions). Objective data record- 
ing 1s essential as is accurate recording of loca- 
tion. This latter aspect is now facilitated by the 
use of Global Positioning System receivers that 
help fix latitude, longitude and even altitude 
using satellites. 


23.7.3 Seed Cleaning 


Seed cleaning is the removal of debris, inert 
material, damaged and infested or infected seeds 
and seeds of different species (e.g. weeds) to 
achieve clean and pure samples of seeds of high 
physiological quality for storage. 

Seed cleaning is necessary to: 

e Reduce bulk during transportation. 

e Improve purity of the sample. 

e Optimise storage space and reduce costs. 

e Prevent seed from going mouldy and help 
reduce ‘damping off’ or fungal contamination 
after germination. 

e Allow precise regulation of seed moisture 
content during storage. Seeds should be 
cleaned immediately after harvest or soon 
after they arrive at the gene bank. Cleaning 
methods vary according to the type of seed. 
Cleaning should not cause damage to samples 

or lead to waste. It can be done manually or by 

machines, but gene banks are strongly advised to 
clean accessions by hand for the following 
reasons: 

¢ Mechanical cleaning could result in selection 
within genetically heterogeneous accessions 
(due to exclusion of very small and very large 
seeds passing through mechanical apertures). 

e Equipment requires rigorous cleaning and 
often careful adjustment between accessions. 
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23.7.4 Seed Bank Storage Standards 


Seeds can be classified into the following types 

according to seed storage behaviour: 

1. Orthodox — seeds that can withstand low seed 
moisture content and can be stored at low tem- 
perature without losing viability 

2. Recalcitrant — seeds that lose viability if mois- 
ture content is less than 12-13 % 

3. Intermediate — seeds that exhibit storage char- 
acteristics between orthodox and recalcitrant 
seeds 
For orthodox seeds, the storage potential is 

influenced by inherent and external factors; for 
example, some legumes that are hard-seeded are 
long-lived, while seeds with high oil content are 
short-lived. There is also observed variation at 
the eco-geographic races (e.g. indica vs. japonica 
rice) and cultivar levels. Seeds harvested at their 
physiological maturity generally store better. The 
environment (temperature, moisture, nutrition 
and light) under which the crop is planted can 
also affect storability. The longevity of seeds 
depends on the initial seed quality, moisture con- 
tent and temperature during storage. In general, 
low moisture content and low temperature reduce 
the loss of seed viability. Harrington’s rule of 
thumb can be applied as rough estimate of length 
of storage in reference to temperature and seed 
mc. The rule states that beginning at 14 % mc, for 
every | % decrease, the lifespan of the seeds dou- 
bles, and beginning at 50 °C, for every 5 °C 
decrease in storage temperature, the lifespan of 
the seeds doubles. 

No two banks are the same. Traditionally, seed 
banks have been classified into the following 
categories: 

e Base collections that are for the long-term 
storage of seed lots and from which seeds are 
not normally sent to users (though this is not 
always the case). 

¢ Active collections from which seeds are 
made available to users. Often such stores 
are maintained under less optimal storage 
conditions compared to those holding base 
collections though this need not be the 
case. FAO/IPGRI (1994) recommends that 
storage lives of 10-20 years might be 
appropriate. 
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23.7.5 Seed Drying 


Seed drying is the reduction of seed moisture 
content to the recommended levels (which should 
not be lower than the critical seed moisture con- 
tent) for storage using techniques which are not 
detrimental to seed viability. Dry seeds retain 
viability for longer periods during storage. Seeds 
are hygroscopic and absorb or desorb moisture 
depending on the relative humidity of the sur- 
rounding air or the gradient in water potential 
between the seed and surrounding air. If the water 
vapour pressure of the seed is greater than the 
surrounding air, the seed will lose moisture and 
becomes drier (desorption). If the water vapour 
pressure of the seed 1s lower than the surrounding 
air, the seed will gain moisture (absorption). 
Absorption or desorption occurs until the water 
vapour pressure in the seed and the surrounding 
air is balanced. The water content of seeds at 
equilibrium with the RH of surrounding air is 
referred to as equilibrium moisture content. 
Understanding the relation between equilibrium 
seed moisture content and relative humidity is 
important for the gene bank technician to decide 
on the drying regime for seeds. Several methods 
are available for drying seeds. Methods that mini- 
mise loss of viability during drying should be 
used. The most common and safe methods used 
for drying are dehumidified drying and silica gel 
drying. Drying rate depends on seed size, shape, 
structure, composition, initial seed moisture con- 
tent, amount of seeds and layers, air movement, 
temperature and relative humidity. 


23.7.6 Packaging 


Seed packaging is the placing of a counted or 

weighed sample of seeds of an accession into a 

container which is then hermetically sealed for 

subsequent storage. 
Seeds are packaged to: 

e Prevent absorption of water from the atmo- 
sphere after drying. 

¢ Keep each accession separate and avoid the 
mixing of accessions. 

e Prevent contamination of the seeds 
insects and diseases. 


from 
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The best time to package seeds is immediately 
after the moisture content has been determined 
and found to be within the required limits for safe 
storage. Dry seeds will reabsorb moisture from 
ambient air. Therefore, seeds should be packaged 
into containers and hermetically sealed without 
delay, soon after removal from the drying room 
or cabinet. Different types of containers are avail- 
able for packaging. The choice depends on stor- 
age conditions and species. The most important 
thing is that the packing material should be com- 
pletely impermeable to water and suitable for 
long-term use. Some frequently used containers 
in gene banks are glass bottles, aluminium cans, 
aluminium foil packets and plastic bottles. These 
different types of containers all have advantages 
and disadvantages. Glass bottles are good but 
fragile and can easily break. Aluminium cans are 
difficult to reseal once they are opened. 
Aluminium foil can be resealed and occupy less 
space in storage room. However, seeds with sharp 
projection can pierce the packets and moisture 
can leak inside. Plastic bottles are moisture resis- 
tant but not moisture proof. They should be used 
with caution if relative humidity of the storage 
room is not controlled. Packaging is best carried 
out in an air-conditioned room where the relative 
humidity is controlled. It is important to ensure 
that seeds taken from the drying room are exposed 
to the ambient air for the shortest possible time so 
that they do not reabsorb water. 


23.7.7 Storage Temperature 


Many long-term seed banks store seed under 
deep-freeze conditions using either purpose-built 
prefabricated cold rooms or domestic deep freez- 
ers. To reduce staff time at subzero temperatures, 
a few seed bank cold rooms, such as one at the 
National Institute of Agrobiological Resources 
(NIAR), Japan, have mechanised banking/ 
retrieval systems. The uses of such systems have 
implications to energy consumption by the bank. 
Use of permafrost has been considered for long- 
term duplicate storage of seed in places such as 
Svalbard. Stores are usually unable to match the 
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lowering of temperatures possible in conven- 
tional base storage conditions. In 1997, the 
Japanese-based Biological and Environmental 
Specimen Time (BEST) Capsule 2001 Project 
discussed the possibilities of long-term storage of 
flagship samples under Antarctic ice at 58 °C 
(which incidentally is not sufficiently low for ani- 
mal tissue preservation) or even on the dark side 
of the moon at 230 °C. 


23.7.8 Monitoring Seed Lot Viability 


Perhaps one of the most important parameters of 
seed bank effectiveness is the result of germina- 
tion monitoring. Germination is the preferred test 
for seed lot viability. Providing such information 
to those using the seed is helpful. Additionally, 
other viability tests such as vital staining using 
tetrazolium solution have a greater element of 
subjectivity about them. This staining test is, 
however, sometimes used to help distinguish 
between dead and dormant seeds among those 
that did not germinate under a given test regime. 
Two problems relate to the germination monitor- 
ing of seed bank accessions: First, because seeds 
are tested soon after arrival at the bank and then 
at regular intervals (often every 5—10 years) dur- 
ing their storage life, the tests need to be repeat- 
able and operator independent. Second, in order 
to recover aS many genotypes as possible repre- 
sented within a seed lot, it is necessary to break 
seed dormancy. This can be a particular problem 
in the seed of wild species and where the seed is 
freshly harvested. Key techniques include scarifi- 
cation of hard seed coats to facilitate water or 
oxygen permeation, imbibed chilling at 5—10 °C 
and later incubation at diurnal alternating tem- 
peratures with fluorescent light (.e. rich in red 
light) provided only in the higher temperature 
phase. Tests have to be seed lot specific as 
most seed dormancy is not strongly genetically 
inherited and the form it takes depends on the 
conditions under which the seed matured. 
Once determined, the same treatments can be 
used during the monitoring of that seed lot 
through time. 


23. Gene Banking for Ex Situ Conservation of Plant Genetic Resources 


23.7.9 Duplication 


One of the main advantages of seed banks 1s that 
they centralise collections of genetic material 
making them more easily accessed and studied. 
Indeed, some seed banks might be seen as some 
of the world’s greatest plant diversity hot spots 
with more individuals and, in some banks, more 
species per square metre than anywhere else on 
the planet. This centralisation poses a risk to all 
but the most carefully located and constructed 
facilities. Potential catastrophic loss, which of 
course threatens plants, conserved both ex situ 
and in situ, means that duplication of collections 
and their associated data is an important element 
of seed bank safety. The FAO report (1996) indi- 
cates that the level of security duplication of plant 
genetic resources for food and agriculture still 
needs to be improved and is at best uncertain. 


23.7.10 Characterisation 
and Evaluation 


Characterisation can vary from accurate naming 
of the species or subspecies represented by the 
collection through to more detailed recording of 
characters governed by genes that are little modi- 
fied by environmental factors (major genes). 
Such information, published in the form of 
descriptor lists, 1s of great value to plant breeders 
wishing to narrow their choice of material from, 
often, vast collections. Similarly, the concept of 
core collections has been established to facilitate 
use by breeders. A core collection genetically 
represents a limited set of accessions of a crop 
gene pool with the minimum repetition. 
Increasingly, characterisation is taking the form 
of more detailed molecular techniques such as 
screening by amplified fragment length polymor- 
phism. By contrast to characterisation, evaluation 
records data on traits such as yield that are 
strongly influenced by the environment in which 
the plants are grown. Such data are thus site and 
year specific and are perhaps of less use to 
breeders. 
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23.7.11 Distribution to Users 

A very important element of seed bank work is to 
make the seed available wherever possible. In 
evidence of the scale of such dispatch, Kerry Ten 
Kates and Laird (1999) quote an annual distribu- 
tion of nearly 120,000 samples by the US 
National Plant Germplasm System, of which 
65 % are sent abroad, many requested by refer- 
ence to the Germplasm Resources Information 
Network (GRIN) available on the Internet. 
Furthermore, the usage rate of crop banks by 
plant breeders is probably less than the rate of 
request from banks holding broader plant diver- 
sity collections where uses include a wide array 
of pure and applied research in addition to field 
trials. During 1994-1996, there was a 50 % 
request rate for seeds offered through an exten- 
sive list offered by the Kew Seed Bank. 


23.7.12 Regeneration 


Seed bank accessions are grown out for the pur- 
poses of regeneration of seed stock (either when 
seed numbers are low or when viability has 
reduced), for characterisation and for evaluation. 
Many banks have a regeneration standard below 
which the germination of a seed lot should not 
fall. This is usually set at 85 %. This high value 
limits the risk of accumulated genetic damage 
that is associated with seed ageing. Even though 
falling levels of seed germination are correlated 
with falling levels of field establishment, many 
banks have adopted lower standards. This may in 
part be due to the backlog of regeneration work 
that in some national facilities highlighted by 
FAO (1996) is nearly 100 % of the collection. By 
collecting high-quality seed lots in good quantity, 
other banks have reduced the necessity for regen- 
eration that can be time- and labour-consuming 
and that can have adverse effects on the genetics 
of the collection. Samples regenerated under con- 
ditions different from where they originated can 
experience selection. [If too small a sample is 
regenerated, genetic drift may occur in which 
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rarer alleles are lost through chance. Under some 
circumstances, recollection, if possible, may be 
the more desirable option. 


23.8 Seed Bank Design 


Having considered the aspects of seed bank man- 
agement, a brief consideration of seed bank 
design is appropriate (also see Cromarty et al. 
1985). The location of the bank is important from 
political, practical and security aspects. Potential 
risks have to be considered be they earthquake, 
flooding or radiation fallout. Some facilities are 
placed underground such as the seed bank at 
Krasnodar in Russia and the Millennium Seed 
Bank in the United Kingdom. Others such as the 
NSSL are located on the first floor to limit possi- 
ble impact from structures above resultant from 
seismic activity. The size of most banks should 
be dictated by peak annual intake (seed drying 
and cleaning facilities), projected capacity before 
a rebuild which is practical (seed storage) and 
annual collection maintenance (germination, 
field and greenhouse facilities). Cold storage 
facilities vary from a few domestic deep freezers 
up to large rooms such as one of 140 m? (with 
capacity for 150,000 samples) at NIAR in Japan. 


23.9 Gene Bank Standards 


The Genebank Standards for Plant Genetic 
Resources for Food and Agriculture is intended as 
a guideline for gene banks conserving plant collec- 
tions (seeds, live plants and explants). They were 
developed based on a series of consultations with a 
large number of experts in seed conservation, cryo- 
preservation, in vitro conservation and field gene 
banks worldwide. The standards are voluntary and 
nonbinding and have not been developed through 
standard-setting procedure. They should be viewed 
more as targets for developing efficient, effective 
and rational ex situ conservation in gene banks that 
provides optimal maintenance of seed viability and 
genetic integrity, thereby ensuring access to and 
use of high-quality seeds of conserved plant 
genetic resources (FAO 2014). 
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The standards, as described in this document, 
define the level of performance of a routine gene 
bank operation below which there is a high risk 
of losing genetic integrity. Each section is 
divided into: 

A. Standards 

B. Context 

C. Technical aspects 
D. Contingencies 

Specific standards have been developed for 
eight key areas and include the following 
sections: 

e Acquisition and initial handling 

e ‘Testing for nonorthodox behaviour and assess- 
ment of water content, vigour and viability 

¢ Hydrated storage for recalcitrant seeds 

e Jn vitro culture and slow growth storage 

e Cryopreservation 

e Documentation 

e Distribution and exchange 

e Security and safety duplication 

An operation manual is also available for gene 
bank (Rao et al. 2006) (Tables 23.2, 23.3, 23.4 
and 23.5). 

Promoting genetic resources’ utilisation, pre- 
cise evaluation and documentation of plant 
genetic resources is a prerequisite for their utili- 
sation. The following areas of research need to be 
paid more attention for promoting effective utili- 
sation of PGR gene resources: 


Table 23.2 PGR holdings in the National Gene Bank, 


India 

Crop group No. of accessions 
Cereals 145,765 
Minor millets 53,466 
Pseudo-cereals 6,619 
Grain legumes 56,870 
Oilseeds 54,994 
Fibre crops 11,483 
Vegetables 24,112 
Fruits 382 
Medicinal and aromatic crops 6,304 
Spices and condiments 2,708 
Agroforestry 2,433 
Safety duplicates 10,235 
Total 375,371 


Source: NBPGR website 
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Table 23.3 Details of various ex situ conservation sites 
for PGRFA in India 


Type of conservation and nodal Number of 
ministry department facilities 
Seed gene bank (long-term ICAR 1 
collections, —18 °C), Ministry of 

Agriculture 

Seed gene bank (medium-term ICAR 28 
collections, 4 °C), Ministry of 

Agriculture 

Seed gene bank (short-term ICAR 13 
collections at around 10 °C), 

Ministry of Agriculture 

Botanical gardens, Ministry of BSI 150 
Environment 

In vitro conservation (4—25 °C), ICAR 5 
Ministry of Agriculture 

Field gene bank, Ministry of ICAR, SAU 25 
Agriculture 

Cryopreservation (using liquid ICAR 2 


nitrogen), Ministry of Agriculture 


Table 23.4 Number of samples of PGR distributed over 
the last 10 years 


Year No. of samples 
1996 20,775 
1997 27,022 
1998 23,313 
1999 11,064 
2000 9,714 
2001 10,771 
2002 12,274 
2003 15,487 
2004 15,543 
2005 9,366 
2006 9,537 
Total 164,866 


e Development of core collections 

¢ Focused Identification of Germplasm Strategy 
(FIGS) 

e Pre-breeding 
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e Gene prospecting and allele mining for a trait 
of interest from genetic resources (Tables 23.6 
and 23.7) 

On 18 April 2013, the Commission on Genetic 
Resources for Food and Agriculture (CGRFA) of 
the United Nations Food and Agriculture 
Organization (FAO) endorsed and adopted the 
revision of the Genebank Standards, \ast pub- 
lished in 1994. The revised Genebank Standards 
take into account the changes in ex situ conserva- 
tion conditions, diversity in storage requirements 
and purpose and period of germplasm conserva- 
tion, ranging from temperate to tropical prove- 
nances. Field gene banking is the most commonly 
used method for nonorthodox seed-producing 
plants, for plants that produce very few. 

Seeds are vegetatively propagated and/or have 
a long life cycle, and their standards have been 
defined accordingly. The standards for in vitro 
culture and cryopreservation are broad and 
generic in nature, due to the marked variation 
among nonorthodox seeds and vegetatively prop- 
agated plants (Tyagi and Agrawal 2013). 


23.10 Economic Considerations 


The annual cost (in year 2000 US$) of conserv- 
ing and distributing the genetic material pres- 
ently held in all 11 CGIAR gene banks is 
estimated to be 5.7 million US$ (m US$), which 
could be maintained for all future generations by 
setting aside a fund of 149 m US$ (invested at a 
real rate of interest of 4 % per annum) (Virchow 
2003). This would be sufficient to underwrite the 
costs for the CGIAR’s current conservation activ- 
ities in perpetuity (estimated to be 61 m US$), as 
well as the cost of maintaining the distribution 
activities (88 m US$) that provide germplasm to 
breeders, scientists, farmers and others world- 
wide (Koo et al. 2003). 
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Table 23.5 Total accessions conserved ex situ and the number of safety duplicates for various categories of crop spe- 
cies (1996-2005) 


Sl. no. 


de toad Fae (aaa (a 


Table 23.6 Genetic diversity collection and utilisation 


% in 
Commodity Landraces collections 
Rice 140,000 90 
Maize 65,000 90 
Sorghum 45,000 80 
Millets 30,000 80 
Soybean 30,000 60 
Chickpea 22,230° 90 
Pigeon pea 8,220° 80 
Groundnut 6,374* 90 
Potato 30,000 95 


Modified after Evenson et al. (1998) 
‘ICRISAT and ICARDA holdings 


Status 

Traditional cultivar/landrace 
Wild 

Weedy 

Breeders line 
Mutant/genetic stock 
Advanced/improved cultivar 
Others 


No. of 
crop spp. No. of accessions 
280 121,274 
314 15,881 
70 267 
37 14,661 
26 7,898 
59 9,080 
73 27,662 
Wild In situ 
species collections 
20 Few 
- Few 
20 Few 
- None 
- None 
19 None 
57 Few 
45 None 
65 Few 


Safety duplicates 
as active Safety duplicates 
collections (%) 
84,931 70 
4,745 30 
1] 4 
2212 15 
4,880 62 
4,867 54 
2,378 9 
Ex situ 
collections Utilisation 
(acc.) distribution 
420,000 High 
277,000 High 
169,000 Low 
90,000 Low 
174,000 Low-medium 
BO;102° High 
13,628? Medium-high 
15,419 Medium-high 
31,000 High 
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Table 23.7 Status of base collections in the National 
Gene Bank of India [NBPGR, New Delhi (—18 °C)] (as of 
28 February 2014; Source: NBPGR website) 


Number of accessions 


Crop/crop group conserved 
Paddy 97,314 
Wheat 40,340 
Maize 9,639 
Others 12,276 
Cereals 159,569 
Sorghum 20,542 
Pearl millet 8,930 
Minor millet 22,614 
Others 5,437 
Millets and forages my gees) 
Amaranth 5,579 
Buckwheat 886 
Others 389 
Pseudo-cereals 6,554 
Chickpea 17,373 
Pigeon pea 11,432 
Mung bean 3,705 
Others 26,246 
Grain legumes 58,756 
Groundnut 14,937 
Brassica 10,759 
Safflower 8,081 
Others 24,700 
Oilseeds 58,477 
Cotton 7,329 
Jute 2,915 
Others 2,214 
Fibre crops 12,458 
Brinjal (eggplant) 4,119 
Chilli 2,012 
Others 19,199 
Vegetables 25,330 
Custard apple 59 
Papaya 23 
Others 448 
Fruits 530 
Opium poppy 350 
Ocimum 473 
Tobacco 1,490 
Others 4,559 
Medicinal and aromatic plants 6,872 
and narcotics 

Coriander 911 
Sowa (dill) 9] 
Others 2,845 


(continued) 
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Table 23.7 (continued) 


Number of accessions 


Crop/crop group conserved 
Spices and condiments 3,847 
Pongamia (oil tree) 395 
Others 2,048 
Agroforestry 2,443 
Lentil 7,712 
Pigeon pea 2,525 
Duplicate safety samples 10,235 
Total 402,894" 


No. of crop species conserved: 1,586 
‘The figure includes 4,305 released varieties and 2,300 
genetic stocks; regenerated accession not included 


23.11 Svalbard 


Svalbard Global Seed Vault: On 26 February 
2008, the Svalbard Global Seed Vault (SGSV) 
opened near Longyearbyen (Norway), 600 miles 
from the North Pole. SGSV is designed to hold 
4.5 billion batches of seeds of the world’s main 
crops. The SGSV is a glazed cave-like structure, 
drilled 500 ft below permafrost, in the middle of 
a frozen Arctic mountain topped with snow, with 
the goal to store and protect samples from every 
seed collection in the world, which will stay fro- 
zen. An automated digital monitoring system 
controls temperature and humidity and provides 
high security. The SGSV is an insurance against 
natural disasters, such as earthquakes and tsuna- 
mis, or deliberate attacks like bomb blasts or 
human errors such as nuclear disasters or failure 
of refrigeration that may erase the seeds of any 
important species in the other seed banks or in 
the wild, in the other countries. Such seed can be 
re-established using seeds from SGSV. NBPGR 
facilitated safe transfer of about 40,000 acces- 
sions of ICRISAT mandate crops to SSGV dur- 
ing the past 2 years. 

The Svalbard Global Seed Vault was estab- 
lished with the ‘objective to provide a safety net 
for the international conservation system of plant 
genetic resources, and to contribute to the secur- 
ing of the maximum amount of plant genetic 
diversity of importance to humanity for the long 
term in accordance with the latest scientific 
knowledge and most appropriate techniques’. 
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Ensuring that the genetic diversity of the world’s 
food crops is preserved for future generations is 
an important contribution towards the reduction 
of hunger and poverty in developing countries. 
This is where the greatest plant diversity origi- 
nates and where the need for food security and 
the further development of agriculture is most 
urgent. The Svalbard Global Seed Vault, which is 
established in the permafrost in the mountains of 
Svalbard, is designed to store duplicates of seeds 
from seed collections around the globe. Many of 
these collections are in developing countries. If 
seeds are lost, e.g. as a result of natural disasters, 
war or simply a lack of resources, the seed collec- 
tions may be re-established using seeds from 
Svalbard. The loss of biological diversity is cur- 
rently one of the greatest challenges facing the 
environment and sustainable development. The 
diversity of food crops is under constant pressure. 
The consequence could be an irreversible loss of 
the opportunity to grow crops adapted to climate 
change and new plant diseases, and the seeds of 
an expanding population of more than 200,000 
crop varieties from Asia, Africa, Latin America 
and the Middle East—drawn from vast seed col- 
lections maintained by the Consultative Group on 
International Agricultural Research (CGIAR)— 
will be shipped to a remote island near the Arctic 
Circle, where they will be stored in the Svalbard 
Global Seed Vault (SGSV), a facility capable of 
preserving their vitality for thousands of years. 


23.12 Conservation Benefits 


The value of genetically coded information can 
never be determined a priori but rather only from 
a posterior observation as a result of their success 
on the market. To assign monetary value to the 
benefits generated by PGR conserved in ex situ 
collections is exceedingly difficult because they 
have multiple dimensions. The use of germplasm 
in plant breeding leads to changes in crop output 
and breeder’s requests represent the demand for 
germplasm in terms of crop in production and 
trade. Little information is available on the germ- 
plasm movement. Only some gene banks and 
national programmes have drawn up some statis- 
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tics. Over the past 3 years, for instance, the 
CGIAR centres have distributed an annual aver- 
age of over 50,000 accessions to national pro- 
grammes all over the world (SGRP 1996). 
Similarly between 1992 and 1994, the Unites 
States distributed over 100,000 samples each year 
(FAO 1996). In developing countries, 34 % of all 
accessions are stored in public gene banks, 49 % 
in developed countries, | % private companies 
and 0.2 % local conservators (Iwanaga 1993). In 
India, Ethiopia and China, the expected value for 
PGRFA conservation is assessed as being very 
high (Virchow 1999). The benefits of conserva- 
tion must be analysed more in depth to guide the 
conservation investment on national and interna- 
tional levels. It is necessary to define the conser- 
vation objectives for adjusting these to some 
cost-efficiency criteria. The conservation invest- 
ments must follow the criteria of cost efficiency. 
Objectives must be quantified. In the accessibility 
of germplasm for present and future use, the link 
between conservation and utilisation and distribu- 
tion and characterisation has to be strengthened 
(http://cropgenebank.serp.cgiar.org/). 
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Abstract 

Endemic and threatened plant species are a vital component of plant bio- 
diversity which require immediate human intervention to ensure their 
long-term survival. Natural and anthropogenic factors are putting 1n jeop- 
ardy such vulnerable plant groups in India, which on the one hand is one 
of the mega-diverse nations in terms of plant diversity while on the other 
hand is one of the most populous nations of the world with a rapidly grow- 
ing economy. Strategies for conservation of plant diversity include in situ 
approaches in the form of protected areas such as national parks, biosphere 
reserves and gene sanctuaries and ex situ approaches such as seed gene- 
banks, field genebanks and in vitro conservation. A judicious integration 
of diverse conservation approaches can not only help in rescuing plant 
species in danger but also in widening our knowledge base of these species 
which can in turn pave way for their sustainable use for the benefit of man- 
kind. The present chapter reviews the current status of endemic and threat- 
ened plant species in India, followed by the various conservation strategies 
which are being or which can be employed for the conservation of such 
species in India along with the future perspectives in this field. 
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24.1 Introduction 

The single most important factor that makes our 
planet unique is perhaps the existence of life. The 
sum total of the multitudes of forms in which life 
exists 1s known as biodiversity. From the point of 
its origin, around three to four billion years ago, 
life has been shaped by the dynamic process of 
evolution to form the diversity in living organ- 
isms we see today (Dawkins 2005). The balance 
between speciation and extinction dictates this 
diversity. Planet earth has undergone five periods 
of mass extinctions due to a variety of reasons. It 
is suggested that today, we are in the middle of 
the sixth period which is predominantly driven 
by anthropogenic reasons including climate 
change, unlike the previous five (Barnosky et al. 
2011; Root et al. 2003). High extinction rates 
coupled with the fact that only 86-91 % of the 
estimated 8.7 million species on earth have been 
described lead to the conclusion that many spe- 
cies will go extinct before we have a chance to 
describe them (Mora et al. 2011). With a goal of 
stemming this biodiversity loss through conser- 
vation and policy interventions, the United 
Nations has declared 2010-2020 as the decade 
on biodiversity (CBD c2014a). 

Conservation in any form involves consider- 
able costs, so prioritization of conservation 
efforts is of prime importance. The identification 
of priority areas and species for conservation is 
undertaken on the basis of extent of biodiversity, 
level of past, present of future threat, species 
richness, percentage endemism and potential end 
use (Plucknett et al. 1983; Wilson et al. 2006). 
Among the 298,000 estimated plant species on 
earth (Mora et al. 2011), around 7,000 have been 
grown for human consumption in human history, 
and presently, we depend on only 30 species for 
95 % of food energy needs (FAO c2014). 
Considerable success has been achieved in the 
conservation of diversity of such species of 
immediate utility particularly under ex situ con- 
ditions. Most of the accessions held in the gene- 
banks around the world such as the National 
Genebank (NGB) in National Bureau of Plant 
Genetic Resources (NBPGR), New Delhi, are of 
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such species. The conservation of the vast major- 
ity of other plant species has been largely limited 
to in situ conservation in protected areas, even 
though specialized genebanks such as Kew’s 
Millennium Seed Bank have achieved consider- 
able success in ex situ conservation. 

In the present chapter, the status and manage- 
ment efforts for plant species of prime concern in 
India, one of the richest nations in the world with 
respect to floral diversity, are discussed with 
emphasis on management of endemic and threat- 
ened taxa. 


24.1.1 India: A Hub of Biodiversity 


India, the seventh largest country in the world, is 
considered as one of the most biodiverse regions 
of the planet. The multifarious combinations of 
topography, geology and climate have led to the 
evolution of highly diverse life forms in the coun- 
try. This rich biological heritage has led to the 
recognition of India as one of the elite mega- 
diverse nations of the world, a group of 17 coun- 
tries estimated to hold ~70 % of the Earth’s 
biodiversity (Mittermeier et al. 1997). Occupying 
only 2.4 % of world’s land area, India harbours 
7-8 % of recorded species of the world which 
includes 7.43 and 11 % of global faunal and floral 
diversity, respectively (Goyal and Arora 2009). 


24.1.2 Floristic Diversity of India 


India is bestowed with a huge diversity has made 
in the plant species. Hooker and Thompson 
(1855) in one of the earliest attempts and to 
inventorize the Indian floral diversity had 
observed the presence of representatives of 
almost every family of the world. In terms of 
plant diversity, India ranks tenth in the world and 
fourth in Asia. There are over 45,500 recorded 
floral species in India, including 17,527 angio- 
sperms, 67 gymnosperms, 1,200 pteridophytes, 
2,00 bryophytes, 2,223 lichens, 14,500 fungi, 
7,175 algae and 850 virus/bacteria. 7 % of world’s 
documented flowering plants are from India with 
the major families being Poaceae, Orchidaceae, 


24 Conservation and Management of Endemic and Threatened Plant Species in India: An Overview 


Leguminosae, Asteraceae, Rubiaceae, 
Cyperaceae, Euphorbiaceae and Acanthaceae 
(Goyal and Arora 2009). Four of the 234 global 
centres of plant diversity exist in India (UNEP- 
WCMC 2013). Furthermore, India is one of the 
Vavilovian centres of origin and diversity of cul- 
tivated plants possessing 811 cultivated crop spe- 
cies and 902 crop wild relatives (Singh et al. 
2013). 


24.1.3 Endemism and Threat Status 
of Indian Flora 


A taxonomic category entirely restricted to a 
given biogeographical unit 1s referred to as an 
endemic. Such a restricted range is enforced 
through a combination of geographical factors 
such as unique edaphic, geological, ecological 
and climatic conditions and biological factors 
such as reduced seed production, reduced polli- 
nation, limited dispersal and interspecies interac- 
tions (Hobohm et al. 2014). 

The migration of the Indian plate northward 
from Gondwanaland to Eurasia has left their 
mark, and the present-day Indian flora is an amal- 
gamation of several global genera (Axelrod 
1971). Notwithstanding the unique geological 
history, the relatively recent isolation of Indian 
flora owing to the Himalayas in the north, hot and 
arid desert in the west and the Arabian Sea, Bay 
of Bengal and the Indian Ocean in the south has 
aided the evolution of several endemic taxa 
(Nayar 1996). The diverse ecosystems and habi- 
tats of India have made it a home of several 
endemic plant species. Species endemism is one 
of the criteria for defining biodiversity hotspots 
for setting conservation priorities. The geograph- 
ical boundaries of India overlap with 4 of the 34 
such biodiversity hotspots identified around the 
world, viz., the Himalaya, Indo-Burma, the 
Western Ghats and Sri Lanka and Sundaland 
(Mittermeier et al. 2004). Further, Nayar (1996) 
has identified three mega-centres of endemism 
and 27 micro-endemic centres in India (Fig. 24.1). 

Around 33.5 % of Indian flora have been iden- 
tified as endemic and are distributed predomi- 
nantly in the Indian Himalaya, Peninsular India 
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and the Andaman and Nicobar Islands (Nayar 
1996). Even though several of the 20,074 India 
angiosperm species are endemic, only 49 endemic 
genera and no angiosperm family strictly endemic 
to India have been identified (Irwin and 
Narasimhan 2011). Among the 101 species of 
gymnosperms existing within India, seven are 
endemic (Srivastava 2006). Even though not suf- 
ficiently surveyed in comparison to higher plants, 
49 pteridophyte taxa out of 900—1,000 and 340 
bryophyte taxa out of 2,489 are endemic to India 
(Chadha et al. 2008; Dandotiya et al. 2011). 

This rich and unique floral diversity of India is 
extensively under threat as India is also one of the 
most populous countries in the world with a fast- 
growing economy. Furthermore, the very nature 
of endemics makes them unusually vulnerable to 
extinction (Myers 2003). Global assessment of 
species threat status to highlight species threat- 
ened with extinction and prioritize conservation 
efforts is carried out globally by the International 
Union for Conservation of Nature (IUCN) since 
1966 in the form of red lists (The IUCN Red List 
of Threatened Species, Version 2013.2 2013). 
The IUCN Red List 2013.2 lists 5,512 Indian 
angiosperm species. Most of the assessed species 
(4,511 species) are of least concern status (LC) 
unlike the world status of assessed plants 
(Fig. 24.2). However, 137 are not threatened 
(NT), 242 are vulnerable (VU), 232 are endan- 
gered (EN) and 77 are critically endangered 
(CR). Six species have been declared extinct 
(EX) and one species is extinct in the wild (EW) 
(Fig. 24.2). The state-wise distribution of these 
assessed angiosperm taxa shows their non- 
uniform distribution according to the threat status 
across India (Fig. 24.3). 

In India, in addition, national level assess- 
ments have been undertaken by the Botanical 
Survey of India (BSI) leading to the publishing of 
three volumes of the Red Data Book of Indian 
Plants following the pre-1994 IUCN Red data 
book threat status categories. Assessment was 
undertaken for 940 taxa, of which 99 are indeter- 
minate (I), 6 are indeterminate/potentially extinct, 
397 are rare (R), 178 are vulnerable (V), 32 are 
endangered or potentially extinct (E/PEx), 175 
are endangered (E) and 47 are extinct or poten- 
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Fig. 24.1 Regions of high plant endemism in India—the geographic zones of India (Rodgers et al. 2000). 
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Fig.24.2 Proportion of living plant species from India and the world under different threat categories (The IUCN Red 
List of Threatened Species, Version 2013.2 2013) 
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Fig. 24.3 State-wise distribution of threatened angiosperm flora in India (The IUCN Red List of Threatened Species, 
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tially extinct (Ex/PEx) (Nayar and Sastry 1987, 
1988, 1990). 

However, these assessments of endemism and 
threat are far from adequate, as only a tiny frac- 
tion of Indian flora have been systematically 
investigated for threat status. Furthermore, the 
state-wise distributions of threatened angiosperm 
flora in Fig. 24.3 may partially be a reflection of 
the regional differences in botanical research 
intensity. Similar differences are also revealed 
when the number of assessed and threatened 
plant species in different angiosperm families are 
compared (Fig. 24.4) as several major Indian 
families are underrepresented. Other plant groups 
such as pteridophytes and bryophytes are also 
grossly underrepresented in such assessments. 


For the current chapter, we have largely fol- 
lowed the assessments of Red Data Book of 
Indian Plants (Nayar and Sastry 1987, 1988, 
1990) and The IUCN Red List of Threatened 
Species, Version 2013.2 (2013) in describing the 
threat status of species. 


24.2 Conservation Strategies 


Germplasm conservation is defined as the man- 
agement and use of biosphere resources which 
may yield the sustainable benefit to present gen- 
erations while maintaining its potential to meet 
the needs of future generations (IUCN/UNEP/ 
WWE 1980). Anthropogenic forces have been 
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Fig. 24.4 The top ten (a) assessed and (b) threatened 
Indian angiosperm families according to the number of 
species (The IUCN Red List of Threatened Species, 


largely detrimental to the survival of a great 
number of plant species including several unique 
and important taxa. The conservation of these 
plant species on priority is the need of the hour. 
The conservation of plant genetic resources has 
long been realized as an integral part of biodiver- 
sity conservation, and the systematic efforts 
towards its comprehensive execution are gather- 
ing pace albeit under several limitations such as 
limited information and resources and the lack of 
adequate guidelines for implementing the new 
and evolving policy requirements under the 
framework of the Convention on Biological 
Diversity (CBD). CBD is the umbrella multilat- 
eral treaty under the aegis of the United Nations 
for ensuring conservation, sustainable use and 
equitable sharing of benefits of biodiversity 
around the world (UNEP 1992). 
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Version 2013.2 2013); numbers in parentheses indicate 
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Conservation strategies broadly involve two 
approaches — in situ and ex situ based on the site 
of implementation. They are complementary to 
each other and can be judiciously integrated 
based on the species to be conserved and whether 
medium- or long-term preservation is required. 
Conservation methods to be employed depend on 
the objective of conservation, the breeding sys- 
tem of species, its threat status, seed physiology 
and storage behaviour and the available resources 
for conservation. /n situ conservation involves 
maintenance of genetic resources within the 
existing natural habitat of the species, which is 
highly susceptible to natural calamities, whereas 
ex situ conservation involves conservation out- 
side the natural environment of the species and is 
used to safeguard populations of species which 
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are in immediate danger of destruction or as a 
safety duplicate of the existing in situ collection. 


24.2.1 In Situ Conservation 


In situ conservation 1s the management of species 
within their natural ecosystems and _ habitats. 
Viable populations of the target species are main- 
tained in their natural surroundings with minimal 
human intervention (UNEP 1992). Generally to 
achieve this, whole forest ecosystems and the 
containing species are preserved in protected 
areas through managed use (Iwanaga 1995). This 
is particularly preferred for those taxa which are 
less amenable to external disturbances. Moreover, 
for several species with highly specialized adap- 
tations, it is the only viable conservation strategy 
available. Apart from being the most convenient 
and economical conservation approach, it allows 
the dynamic forces of evolution to continue to act 
on the natural populations enabling them to con- 
tinue to adapt to the prevailing conditions 
(Saunier and Meganck 1995). Furthermore, as 
habitats and ecosystems as a whole are protected, 
aside from the target species, several associated 
species are simultaneously conserved in the natu- 
ral environment. 

In situ conservation is generally practised for 
forestry species, wild relatives of crop species, 
medicinal and aromatic plants and the species 
that are under various levels of threat of extinc- 
tion, because they cannot be grown outside their 
habitat or they are the members of complex eco- 
systems, like tropical forest trees, have high dor- 
mancy or possess highly specialized breeding 
system. 

Depending on the ecological hierarchy tar- 
geted for conservation, in situ conservation 
approaches are of species, habitat or ecosystem 
kind. 


24.2.1.1 Species Approach 

The conservation efforts are concentrated towards 
a single species in this approach with the inten- 
tion that along with it, the associated species in 
the ecosystem are conserved. These may be ‘key- 
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stone’ species, may be those which have a crucial 
role in the ecosystem and may be whose disap- 
pearance may lead to the collapse of the same. 
They can also be ‘umbrella species’ with large 
home ranges for survival of a breeding popula- 
tion, whose protection ensures that a huge area 
with all the concomitant species is protected. 
Moreover, to engage the society in conservation 
efforts, certain threatened ‘flagship species’ with 
features that appeal to the general public are used 
as rallying points (Caro 2010). 


24.2.1.2 Habitat Approach 

Conservation efforts in this approach are focused 
towards the maintenance of the quality of strate- 
gic habitats on a landscape scale in an ecosystem 
which harbour multiple species. As the habitats 
and the dynamic ecological processes existing 
within are conserved as a whole, thriving popula- 
tions of multiple interacting species are con- 
served. Such protected areas include managed 
forests, sacred groves, nature reserves, preserva- 
tion plots, wildlife sanctuaries and national parks 
(Griffiths and Vogiatzakis 2011). 


24.2.1.3 Ecosystem Approach 

This approach has become the foundation of con- 
servation efforts under CBD. It is a strategy for 
the integrated management of land, water and liv- 
ing resources that promotes conservation and 
sustainable use in an equitable way. In addition to 
the different processes, functions and interactions 
among different organisms and environment and 
the integral role of humans in ecosystems are 
acknowledged under this. Biosphere reserves of 
the United Nations Educational, Scientific and 
Cultural Organization-World Network of 
Biosphere Reserves (UNESCO-WNBR) come 
under this approach. Initiated under UNESCO’s 
Man and Biosphere (MAB) programme, a bio- 
sphere reserve involves the conservation of vari- 
ous representative ecosystems of earth which are 
large enough to ensure self-perpetuation and evo- 
lution of organisms. It also envisages national 
and international cooperation for harmonious 
interaction between people and environment 
(Arora and Paroda 1991). 
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24.2.1.4 Protected Areas (PAs) 
in India 

The term ‘protected area’ is defined in the 
Convention on Biological Diversity (Article 2) as 
‘a geographically defined area, which is 
designated or regulated and managed to achieve 
specific conservation objectives’. Protected areas 
(PAs) are the important strength of biodiversity 
conservation as they maintain key habitats, pro- 
vide refugia, allow for species migration and 
movement and ensure the maintenance of natural 
processes across the landscape. They not only 
ensure biodiversity conservation but also secure 
the well-being of humanity itself. PAs provide 
livelihoods for nearly 1.1 billion people, are the 
primary source of drinking water for over a third 
of the world’s largest cities and are a major factor 
in ensuring global food security. PAs, when suit- 
ably managed, are recognized as the key to both 
mitigation and adaptation responses to climate 
change. They are set aside for the protection of 
biological diversity along with the natural and 
associated cultural resources and are managed 
through legal or other effective means (CBD 
c2014b). They include national parks and nature 
reserves, sustainable use reserves, wilderness 
areas and heritage sites (Chape et al. 2008). For 
the effective conservation in PAs, IUCN has cat- 
egorized them according to management 
objectives. 

In India, a network of 668 PAs has been estab- 
lished comprising of 102 national parks, 515 
wildlife sanctuaries, 47 conservation reserves 
and 4 community reserves (MoEF c201]4a) 
(Fig. 24.5). Generally, PAs are constituted and 
governed under the provisions of the Wild Life 
(Protection) Act, 1972, enforced through the 
Ministry of Environment and Forests. The act has 
been amended several times according to the 
changing requirement of PAs management. 

Moreover, in India, two important land mam- 
mals, the tiger and the Asiatic elephant, have 
been used as flagship species for conservation, 
and today, there are 43 tiger reserves and 26 ele- 
phant reserves with a cumulative protecting area 
of over 60,000 km? for each species. In addition, 
sanctuaries in Tura range in Garo Hills of 
Meghalaya for the conservation of Citrus and 
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Musa species and for Rhododendrons and orchids 
in Sikkim have been established. 

With an objective of an ecosystem-level con- 
servation, the government of India has estab- 
lished 18 biosphere reserves (MoEF c2014b), 
which protect large land and/or coastal environ- 
ments and often include several protected areas 
within. People being an integral component of 
the system, their rights are also protected to 
ensure sustainable economic use to a limited 
extent. Nine of these are part of UNESCO- 
WNBR international network of biosphere 
reserves. 

Being party to several major international con- 
ventions such as the Convention on [International 
Trade in Endangered Species of wild fauna and 
flora (CITES), the Convention on Biodiversity 
(CBD), the International Union for Conservation 
of Nature (IUCN), the International Convention 
for the Regulation of Whaling, the UNESCO 
World Heritage Convention, the Ramsar 
Convention on Wetlands, the United Nations 
Framework Convention on Climate Change 
(UNFCCC), the United Nations Convention to 
Combat Desertification (UNCCD), the United 
Nations Commission on Sustainable Development 
(UNCSD), the International Treaty on Plant 
Genetic Resources (ITPGR), the United Nations 
Convention on the Law of the Sea (UNCLOS) 
and the Convention on Migratory Species (CMS), 
the biodiversity protection efforts in India are in 
tune with the global goals of conservation. 

In addition to these official efforts, there are 
several non-governmental organizations (NGOs) 
involved in the conservation in India (WWE India 
2008), for example, in situ conservation of 
medicinal plants by the NGO Foundation for 
Revitalisation of Local Health Traditions 
(FRLHT) in medicinal plant conservation areas 
established in the South Indian states (FRLHT 
2006). 


24.2.2 Ex Situ Conservation 
Efforts for ex situ conservation of plant biodiver- 


sity have so far mainly focused on cultivated crop 
species. Such activities for wild, rare and endan- 
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Fig.24.5 Protected areas in India (Data sourced from The World Database on Protected Areas (WDPA) (2010)) 


gered species have largely been modest in com- 
parison which is a key issue of concern today. 
After inclusion in Article 9 of CBD, ex situ con- 
servation has gained greater focus globally 
(Glowka et al. 1994). At the same time, the 
world’s biodiversity is declining at an alarming 
rate (Sarasan et al. 2006). During the period 
1996-2004, a total of 8,321 plant species have 
been added to the Red List of Threatened Species 
(Baillie et al. 2004), and the number of plants 
recorded as critically endangered has increased 
by 60 %. In the case of wild species, the tradi- 
tional conservation approach is in situ; however, 
itis now recognized that ex situ techniques can be 
efficiently used to complement in situ methods, 
and they may represent the only option for con- 


serving certain highly endangered and rare spe- 
cies (Ramsay et al. 2000). The different ex situ 
conservation approaches are: 

e Field Genebank 

e Seed Genebank 

¢ In Vitro Genebank 

e Cryo-Genebank 

e Pollen bank 

e DNA Bank 


24.2.2.1 Field Genebank 

Field genebanks provide an easy and ready access 
to the plant genetic resources for characteriza- 
tion, evaluation or utilization, while the same 
material conserved in seed in vitro or cryo- 
genebanks must be germinated or regenerated 
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and grown before it can be used. Field genebanks, 
however, are generally more expensive to main- 
tain, requiring more labour, more inputs and 
more space (land) than other methods of conser- 
vation. Moreover, they also have higher levels of 
risk from natural disasters and adverse environ- 
mental conditions like drought, floods or attacks 
from pests and diseases. They include botanical 
gardens, arboreta, herbal gardens and clonal 
repositories. 


24.2.2.1.1 Botanical Gardens 

Botanical gardens hold adequate and accurately 
identified samples of plants as living collections, 
including collections from arboreta, other field 
genebanks, seed genebanks and in vitro propa- 
gated material. In addition, they possess both 
propagation facilities and the scientific and tech- 
nical skills for species recovery and establish- 
ment. UNEP (1995) has estimated that botanical 
gardens conserve more than one third of the 
world’s flowering plants. Such botanical gardens 
present worldwide have been classified based on 
their roles by Wyse Jackson and Sutherland 
(2000) (Table 24.1). 

Botanical gardens play a vital role in ex situ 
conservation and protection of endemic and 
threatened plants. Bringing plants into cultivation 
is the primary initiative towards saving them 
from extinction followed by ensuring their sur- 
vival in suitable protected sites in the wild 
(Heywood and Jackson 1991). The International 
Union for Conservation of Nature and Natural 
Resources (IUCN) has greatly emphasized the 
need of developing botanical gardens into a 
global site for ex situ conservation of plants 
(Bramwell et al. 1987). The network of botanical 
gardens in every country plays a significant role 
in ex situ conservation of plant genetic resources 
(Nayar 1987). In India, there are over 140 botani- 
cal gardens, including those attached to univer- 
sity botany departments, public parks and 
agri-horticultural gardens (Chakraverty and 
Mukhopadhyay 1990). Over 38 botanical gar- 
dens conserve around 246 rare and threatened 
plants of India as living plant collections, and 
among these, 32 species are catalogued in the 
Indian Red Data Books (Chakraverty et al. 2003). 
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A few such examples are given in Table 24.2. 
Furthermore, there are several plants species of 
botanical interest such as Rafflesia, several 
orchids, Bromeliads, Nepenthes and other insec- 
tivorous taxa and aquatic species including 
Victoria amazonica, Nymphaea _ gigantea, 
Nelumbo nucifera and Euryale ferox which can 
only be protected in botanical gardens under ex 
situ conditions. 


24.2.2.1.2 Arboreta 

Arboreta are special spaces set apart for the culti- 
vation and display of a wide variety of different 
kinds of trees and shrubs. Many tree collections 
have been established as arboreta. Arboreta differ 
from pieces of woodland or plantations because 
they are botanically significant collections with a 
variety of species. For example, in India, the 
Ooty arboretum has both indigenous and exotic 
tree species such as Hypericum hookerianum and 
Alnus nepalensis. In addition, the Regional Plant 
Resource Centre (RPRC), Bhubaneswar, Odisha, 
has an arboretum with 1,430 species of trees. 


24.2.2.1.3 Herbal Gardens 

There are about 71 herbal gardens with 4,024 
species in India which have been established with 
the objective of conservation of local biodiversity 
in medicinal and aromatic plants (Rao and Das 
2011). These are basically maintained to 
strengthen ex situ conservation of medicinal and 
aromatic plants and provide access to quality 
planting material for utilization. 


24.2.2.1.4 Clonal Repositories 

Plant genetic resources of different fruit plants/ 
tree species and many ornamentals are main- 
tained by vegetative propagation in order to keep 
their genetic makeup true to the type. In the case 
of fruit germplasm, usually, conservation is done 
in such field genebanks which are also known as 
varietal collection or living collection. A clonal 
repository is a facility where clonal materials are 
conserved as living collections in a field, orchard 
or plantation. Here, the plant genetic resources 
are kept as live plants that undergo continuous 
growth and require continuous maintenance. 
They are often used when the germplasm is either 
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Table 24.1 Types of botanical gardens (Wyse Jackson and Sutherland 2000) 


Type 
‘Classic’ multipurpose gardens 


Ornamental gardens 


Historical gardens 


Conservation gardens 


University gardens 


Combined botanical and zoological 
gardens 


Agro-botanical and germplasm gardens 


Alpine or mountain gardens 


Natural or wild gardens 


Horticultural gardens 


Thematic gardens 


Community gardens 


difficult or impossible to conserve as seeds (i.e. 
when no seeds are formed, seeds are recalcitrant 
or seed production takes many years, as for many 
plant 


tree species) the 


vegetatively. 


Or 


Description 


These institutions have a broad range of activities in horticulture and 
related training; research, particularly in taxonomy with associated herbaria 
and laboratories and public education and amenity 


These are establishments with diverse plant collections that are documented 
They may or may not have any role in research, education or conservation 


They are earliest gardens developed for the teaching of medicine and for 
religious purpose. A number of these gardens are still active in medicinal 
plant conservation and research and today are primarily concerned with the 
collection and cultivation of medicinal plants and increasing public 
awareness about them 


They are developed in response to local needs for plant conservation. Some 
have associated areas of natural vegetation in addition to their cultivated 
collections. Included in this category are native plant gardens. Most 
conservation gardens play a role in public education 


Many universities maintain botanical gardens for teaching and research. 
Many are open to the public 


These are currently reassessing the roles of their botanical collections. 
Plants collections are being researched and developed that provide habitats 
for the displayed fauna, and interpretation of these habitats to the general 
public is an important element 


These function as an ex situ collection of plants of economic value or 
potential for conservation, research, plant breeding and agriculture. Several 
are experimental stations associated with agricultural or forestry institutes 
and contain associated laboratories, plant breeding and seed testing 
facilities, but many are not open to the public 


These are most frequently in mountain regions of Europe and some tropical 
countries. They are specifically designed for the cultivation of mountain 
and alpine flora or in the case of tropical countries for the cultivation of 
subtropical or temperate flora. Some alpine and mountain gardens are 
satellite gardens of larger lowland botanical gardens 


Contain an area or natural or seminatural vegetation, which is protected and 
managed. Most are established to play conservation and public education 
roles and include areas where native plants are grown 


These are often owned and maintained by horticultural societies and open 
to public. They exist primarily to foster the development of horticulture 
through the training of professional gardeners and are involved in plant 
breeding, registration and conservation of garden plant varieties 

These specialize in growing a limited range of related or morphologically 
similar plants or plants grown to illustrate a particular theme generally in 
support of education, science, conservation and public display. These 
include orchids, rose, Rhododendron, bamboo and succulent gardens or 
gardens established on such themes as ethnobotany, medicine, bonsai, 
butterfly gardens, carnivorous plants and aquatic plants 


These are generally small gardens with limited resources, developed by a 
local community to fulfil its particular needs, such as recreation, education, 
conservation, horticultural training and the growth of medicinal and other 
economic plants 


24.2.2.2 Seed Genebank 

A large number of plant species particularly crop 
species have seeds with orthodox seed storage 
behaviour which can be dried down to low mois- 
ture content (3-7 %) and thus can be stored at 


reproduces 
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Table 24.2 Some important rare and endangered species maintained at botanical gardens in India (BGCI c2014) 


Sl. no. Botanical/Latin name Family 

1. Adhatoda beddomei Acanthaceae 
2 Bentinckia condapanna Arecaceae 
ee Cycas beddomei Cycadaceae 
4. Eremostachys superba Lamiaceae 
5. Frerea indica 

6. Peucedanum dehradunensis Apiaceae 

ie Sophora mollis Fabaceae 

8. Trachycarpus takil Arecaceae 


subzero temperature for a long term (Roberts 
1973). Since, handling of dry seeds is much eas- 
ier over other plant materials and they can remain 
viable for long periods under controlled condi- 
tions, the conservation of orthodox seeds is con- 
sidered to be the cheapest and safest method for 
ex situ conservation. Today, there are about 7.4 
million accessions, or samples of a particular 
population, stored as seeds in about 1,750 gene- 
banks throughout the world (FAO 2010). 

The seeds of intermediate and recalcitrant 
seeds cannot be conventionally stored for a lon- 
ger time. Seeds of intermediate category such as 
papaya, gooseberry, oil palm, etc. can be dehy- 
drated to low moisture levels but are sensitive to 
freezing, so can be desiccated and stored only for 
a medium term (Ellis 1991), whereas recalcitrant 
seeds which are intolerant to both desiccation 
and freezing can be stored only for a short term, 
few days to few weeks. Cryopreservation is the 
only option wherever the standardized protocols 
are available in recalcitrant seeds. 

Seeds can be stored under different storage 
conditions depending on seed types and time 
period to be stored. Seeds can be shade dried and 
stored up to 2 years in sealed plastic containers, 


Conservation status 


Rare 
Rare 
Critically endangered 


Critically endangered 


Asclepiadaceae Critically endangered 


Critically endangered 


Endangered 


Critically endangered 


Method of conservation (remarks) 


Seedlings established in botanical 
garden 


Seedlings transplanted and 
establishment in botanical garden 


Saplings acclimatized and established 
as seedlings in botanical garden 


Seedlings from its origin transplanted 
in botanical garden (poor seed viability 
was observed) 


Seedlings from its origin multiplied by 
in vitro propagation and established as 
seedlings in botanical garden 


Saplings established from seeds 
collected from natural habitat in 
botanical garden (poor seed 
germination was observed) 


Seedlings established in botanical 
garden 


Saplings established from seeds 
collected from natural habitat in 
botanical garden (good seed 
germination was observed) 


paper bags or muslin cloth bags at 18—20 °C and 
45 % RH for short term, while for medium term, 
the seeds can be shade dried and stored up to 
5 years in cloth bags, metal cans and plastic jars 
at 4-10 °C and 35 % RH, and for long term, seeds 
can be dried and stored in sealed aluminium foil 
pouches at —18 °C with 6-8 % moisture content 
on fresh weight basis. At NBPGR, New Delhi, 
active collections are kept at medium-term stor- 
age which are immediately available for distribu- 
tion, utilization and multiplication, while base 
collections are stored under long-term conditions 
for posterity. The minimum standards followed 
for long-term seed storage in NGB are untreated 
seeds, seeds with maximum viability >85 % min- 
imum 2,000 seeds in self-pollinated and 4,000 in 
cross-pollinated species, seed moisture content 
3-7 %, seed storage temperature —18 °C for long- 
term and +4 °C for medium-term storage and 
monitoring of viability — after 10 years in long- 
term and 5 years in medium-term storage. 

The operational sequence to integrate an 
accession into the genebank involves cleaning, 
seed health testing, determination of moisture 
content, drying, viability testing and packaging. 
The management of seed collections requires that 
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germplasm accessions be maintained with a high 
proportion of viable seeds, and this involves stor- 
age under appropriate conditions, periodic moni- 
toring of seeds for viability and quantity and their 
regeneration as and when required. 


24.2.2.2.1 Storage of Orthodox Seeds 

The methodology followed for processing of 
orthodox seeds for long-term conservation in the 
NGB at NBPGR, New Delhi, is as follows. In 
case of species with known seed storage behav- 
iour, the corresponding standard protocols for 
viability testing are followed, while for species of 
unknown category, the seed storage behaviour is 
predicted based on the seed size, family it belongs 
to, reproductive biology, desiccation and freezing 
sensitivity, etc. (Hong and Ellis 1996, Fig. 24.6). 
Similarly, the germination protocols for viability 
testing are worked out based on the seed size and 
ISTA (2003) rules. 


Receipt, verification of samples and seed 
health testing Seed cleaning is done by removal 
of debris, inert material, damaged and infested or 


473 


infected seeds and seeds of other species (e.g. 
weeds) to obtain clean and pure samples of seeds 
of high physiological quality for storage. To 
achieve the pest-free status of the germplasm, 
visual examination and X-ray radiography are 
done to detect infection and/or infestation, and 
those samples which may be salvaged without 
compromising the seed health status are passed 
on for conservation in NGB wherever possible. 


Moisture testing and accessioning For estima- 
tion of moisture content, the low constant oven 
drying method for oily seeds and high constant 
oven drying method for the other species are 
used. The moisture content is expressed as a per- 
centage of the weight of the original sample. 
Qualified seed samples are given a national 
unique identity — exotic code (EC) numbers or 
indigenous code (IC) numbers. 


Drying and viability testing Seeds are dried in 
specially designed walk-in seed dryers main- 
tained at 15 % RH and 15 °C temperature. The 
initial seed viability test is conducted after drying 


Determine seed moisture and viability 


to 18-30% moisture content 


Most seeds die 


RECALCITRANT 


Viability test 


INTERMEDIATE 


Most seeds survive 


Dry to 6-8% moisture 
content 


Viability test 


Most seeds survive 


Stored at -20°C for 
3 months 


Viability tests 


All / almost seeds 
survive 


ORTHODOX 


Determine optimum Determine optimum 
conditions for moist storage air storage conditions 


Fig. 24.6 Protocol followed to determine the seed storage behaviour (Hong and Ellis 1996) 
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of the seed sample and before packaging (ISTA 
2003). The initial germination value should 
exceed 85 % for most seeds of cultivated field 
crop species. Exceptions may be granted for seed 
of endemic and threatened plant species. 


Seed packaging and storage After drying 
(desired moisture content of 3—7 %) and viability 
testing (>85 %), the seed samples are packed in 
tri-layered aluminium foil packets (outer polyester 
layer, 12 um; middle aluminium foil layer, 12 um; 
inner polythene layer, 250 gauge), which have 
been found most effective for long-term conser- 
vation of all types of seeds. Packaging is done in 
an air-conditioned room with controlled humid- 
ity (preferably 15 % RH). The passport data 
along with the national identity is documented. 
The minimum data is printed on the labels which 
are pasted on the respective seed packets. The 
processed accessions are arranged in plastic 
baskets serially and placed in cold storage vault 
at —18 °C at a defined location in the genebank. 


Germplasm documentation Correct and reli- 
able recording of data, its documentation and 
information transfer are as important as proper 
handling of germplasm itself. Correct and proper 
documentation allows us to retrieve and utilize 
the information effectively. Being the most 
important component of NGB, it is accomplished 
efficiently through a well-managed computer 
network. 

Today, the National Genebank holds a total of 
4,00,098 accessions of different agri-horticultural 
crops of various crop groups such as cereals, 
grain legumes, millets, oilseeds, vegetables, fibre 
and forages, medicinal and aromatic, spices, etc. 
There are ten modules running at a temperature 
of —18 °C for a long-term storage of seeds and 
two modules maintained at medium-term storage 
conditions for storage of active collections 
(Fig. 24.6). About 60 accessions belonging to 32 
families with orthodox seed storage behaviour 
holding the status under endemic and threatened 
category are available in the National Genebank 
which are listed in Table 24.3. All these species 
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are regularly monitored at an interval of 10 years 
for seed quality, health and quantity. NBPGR is 
strengthened with ten regional stations in differ- 
ent agro-climatic regions of India and 59 crop- 
based National Active Germplasm (NAG) sites 
for timely regeneration of the required germ- 
plasm and its utilization. 


24.2.2.2.2 Ultra-Desiccation: A Cost- 
Effective Conservation Strategy 

Ultradry storage technique is a ‘low-input’ alter- 
native method to the conventional cold storage of 
seed and allows preservation at room tempera- 
ture. It is considered a useful low-cost option 
where no adequate refrigeration is required 
(Theilade et al. 2004). Research on various 
aspects of ultradry seed storage, including drying 
techniques such as sun/shade drying (Probert and 
Hay 2000) or vacuum/freeze drying (Corbineau 
and Come 1988) and their applicability to a 
broader number of species, is under investigation 
to assess its feasibility in various species. 


24.2.2.2.3 Storage of Nonorthodox Seeds 
Seeds under this category cannot be maintained 
under conventional storage conditions. The pro- 
cessing of seeds under this category needs special 
attention due to the limitation in the availability 
of specific species and the seed quantity. So seed 
samples are acceptable even if quantity is limited. 
These seeds can be only maintained for short to 
medium term depending on its seed physiology. 
All seeds are hygroscopic in nature, and they tend 
to absorb and desorbs moisture depending on the 
surrounding humidity of the air. So depending on 
the seed moisture content, the seeds are equili- 
brated over the saturated salts to a safe and criti- 
cal moisture content at a safe temperature of 
12-15 °C which extends its storability to about 
24 weeks (Radhamani et al. 2003). Similarly, the 
storability could be extended for a period of 
3 months by sub-imbibed storage in high osmoti- 
cum salts of polyethylene glycol 6000 (PEG 
6000) alone or in combination with abscisic acid 
(Fig. 24.7). A few successful storage methods for 
nonorthodox seeds are as follows: 
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Table 24.3 Endemic and threatened species stored in NGB showing orthodox seed storage behaviour (NBPGR 2012) 


Sl.no. Family Scientific name 

1 Acanthaceae Barleria prionitis 

2 Alliaceae Allium stracheyi 

3 Amaranthaceae Achyranthes aspera 

4 Apiaceae Pinda concanensis 

5 Apocynaceae Tabernaemontana heyneana 

6 Araceae Arisaema caudatum 

7 Asparagaceae Chlorophytum borivilianum 

8 Asteraceae Inula racemosa, Saussurea costus 

9 Burseraceae Boswellia ovalifoliolata, Canarium strictum, Commiphora wightii 

10 Caesalpiniaceae §_ Hardwickia binata 

1] Calophyllaceae Calophyllum inophyllum 

12 Celastraceae Lophopetalum wightianum 

13 Colchicaceae Iphigenia magnifica 

14 Cycadaceae Cycas beddomei 

15 Dioscoreaceae Dioscorea deltoidea 

16 Dipterocarpaceae Shorea robusta 

17 Fabaceae Caesalpinia sappan, Gymnocladus assamicus, Mucuna pruriens, Pterocarpus indicus, 
Pterocarpus santalinus, Tephrosia purpurea, Vigna khandalensis, Vigna trilobata, 
Atylosia cajanifolia, Alysicarpus vaginalis, Dalbergia latifolia, Kingiodendron 
pinnatum 

18 Flacourtiaceae Flacourtia indica, Hydnocarpus pentandra 

19 Liliaceae Chlorophytum glaucoides 

20 Lythraceae Woodfordia fruticosa 

21 Myrtaceae Knema attenuata, Syzygium stocksii, Syzygium zeylanicum 

22 Papaveraceae Meconopsis robusta 

23 Poaceae Andropogon pumilus, Brachiaria reptans, Cenchrus biflorus, Cenchrus prieurii, 
Digitaria pennata, Oryza rufipogon 

24 Ranunculaceae Aconitum ferox 

25 Rhizophoraceae — Rhizophora apiculata, Rhizophora mucronata 

26 Rosaceae Prunus cornuta 

27 Rubiaceae Canthium dicoccum 

28 Santalaceae Santalum album 

29 Scrophulariaceae Picrorhiza scrophulariiflora 

30 Solanaceae Solanum vagum 

31 Vitaceae Cayratia pedata 

32 Zingiberaceae Hitchenia caulina 
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Short- to medium-term storage: 
Successful storage methods of recalcitrant seeds 


— Short-term storage has been found success- 
ful by moist/sub-imbibed storage in poly- 
ethylene glycol 6000. 

— Partial drying with the application of fungi- 
cide (0.2—05 % captan/thiram) and storage 
in perforated polyethylene bags. 

— Equilibration over saturated salt of potas- 
sium nitrate 94 % RH before storage. 

— Alternate drying and wetting of seeds every 
15 days during storage. 


Successful storage methods of intermediate seeds 
— Medium-term storage at 10—12 °C in screw 
cap plastic bottles 
— Medium-term storage at 10-12 °C in paper 
bags 
Seedling conservation. Under this technique, the 
growth of the young seedlings is arrested at 
low temperature (12-15 °C) in dark condi- 
tions with minimum water-soaked blotting 
sheets in petri plates (Fig. 24.8). Such seedlings 
at requirement are transferred to the pots, and 
further growth is continued at normal temperature 
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Fig. 24.7 A view of the 
National Genebank (NGB) 
at NBPGR showing 
storage modules on both 
sides (Pandey et al. 2010) 


and light conditions. This technique has been 

attempted in several tree species such as 

Symphonia globulifera, Dryobalanops aro- 

matica, Syzygium cuminii, Caryota urens, 

Saraca indica, etc. (Marzalina et al. 1992; 

Radhamani et al. 2003). 

Long-term storage: 

¢ Cryopreservation. With the advancement 
of new and innovative technologies in the 
field of cryobiology, several attempts in 
different crops have been made to success- 
fully cryopreserve the zygotic embryos and 
embryonic axes of intermediate and recal- 
citrant seed species. The embryonic axes are 
preferred as explants in most cases as they 
are highly meristematic well-organized small, 
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independent structures. Further details are 
discussed in Sect. 24.2.2.3. 


24.2.2.3 In Vitro Conservation 

In vitro techniques are useful tools for the conser- 
vation of plant biodiversity especially the genetic 
resources of nonorthodox seeds (where seeds are 
desiccation/freezing sensitive), vegetatively 
propagated crop species and rare and endangered 
plant species. In vitro culture techniques ensure 
the production and rapid multiplication of 
disease-free material. Medium-term conserva- 
tion of in vitro cultures is achieved by reducing 
growth (slow growth) of plant cultures by alter- 
ing the mineral concentration of the media, add- 
ing high osmotic solutions, changing light and 
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Seeds stored as seedlings in 
Syzygium cuminii 








Seeds stored as seedlings 
in Saraca indica 


Fig. 24.8 Conventional short-term storage methods in some recalcitrant seed species 


temperature conditions of growing cultures thus 
Increasing intervals between _ subcultures 
(Fig. 24.9). For long-term conservation, cryo- 
preservation (storage in liquid nitrogen at 
—196 °C) is the best option which allows the stor- 
age of germplasm by ceasing the complete meta- 
bolic activity, for extended periods, protected 
from contamination and with limited mainte- 
nance. In recalcitrant seeds, only limited success 
has been achieved due to their structural com- 
plexity and heterogeneity in seed characteristics 
and seeds being shed at high water content. 
Excised embryos/embryonic axes, somatic 
embryos, buds and apical meristems which can 


be desiccated easily to low moisture content are 
the alternative systems for low temperature stor- 
age. Under this technique, the protocol for the 
development of the whole plant from the embryos 
and subsequently their hardening and successful 
transfer to the field is an essential prerequisite 
(Radhamani and Chandel 1992; Engelmann 
1991). 


24.2.2.3.1 Slow Growth Storage 

Slow growth in tissue culture is a technique used 
for the preservation of in vitro grown plants 
(Engelmann 1997). However, the use is limited 
to few crop species. Minimal growth storage is 
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Fig. 24.9 A view of the national tissue culture repository at NBPGR (Pandey et al. | ) 


usually applied to differentiated plantlets and 
developing meristem cultures. The in vitro 
storage programme is influenced by various 
parameters such as the type of explants used, its 
physiological maturity, the type of storage mate- 
rial and post-collection storage conditions. These 
techniques are being developed and practised in 
several rare and endangered crop species as listed 
in Table (Fay ; Sarasan et al. yi 


24.2.2.3.2 Cryopreservation 
Cryopreservation is the storage of biological 
material at low temperature of —196 °C in the liq- 
uid phase of liquid nitrogen for the maximal phe- 
notypic and genotypic stability (Steponkus 
). This method being relatively convenient 
and economical, a large number of genotypes and 
variants could be conserved and thus maximize 
the potential for storage of genetically desirable 
material (Engelmann and Takagi ). Seeds of 
some orthodox species, embryos/embryonic axes 
and dormant buds display natural dehydration 
processes and do not require any pretreatment for 
cryopreservation. However, other systems such 


as cell suspensions, calluses, shoot tips, embryos, 
etc. contain high amounts of cellular water 
thereby making the system sensitive and prone to 
freezing injury. Under such conditions, cells have 
to be dehydrated artificially to protect them from 
the damages caused by crystallization of intracel- 
lular water (Mazur 1984). 
The techniques used are of two types — tradi- 
tional/classical and new (Withers and Engelmann 
). Under traditional techniques, system is 
slowly cooled down to a prefreezing temperature, 
followed by the rapid immersion in liquid nitro- 
gen. With temperature reduction during slow 
cooling, the cells and the external medium ini- 
tially supercool, followed by ice formation in the 
medium (Mazur +). Traditional freezing pro- 
cedures include pregrowth of samples, cryopro- 
tection, slow cooling to a determined prefreezing 
temperature and quick immersion of samples in 
liquid nitrogen, storage, rapid thawing and recov- 
ery, whereas the new advance novel cryopreser- 
vation techniques involve  vitrification-based 
procedures. Cell dehydration is performed prior 
to freezing by exposing the samples to concen- 
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Table 24.4 Jn vitro techniques available in endemic and threatened plant species (Fay 1992; Sarasan et al. 2006) 


Sl. no. 


O OH AYHD NM BH WwW 


13 


14 
15 
16 


17 
18 
19 
20 
21 
pH 
20 
24 
25 


26 


2. 
28 
29 


30 
31 
32 


33 


Explant used 
Axillary branching 
Axillary bud 


Bulb scale 

Chlorophyllous root, leaf, internode 
Corm 

Hypocotyl segment 

Hypocotyl, cotyledon 

Leaf 


Mature leaves and stem, petiole 
and roots of young seedlings 


Nodal segment (seedling explant) 
Nodal shoot 


Nodal segment, hypocotyl, cotyledon 
and leaf 


Node 


Node, internode, leaf 
Petiole 
Rhizome 


Root 

Root, hypocotyl, cotyledon, leaf 
Seedling 

Seedlings and shoots 

Shoot bases and immature floral buds 
Shoot segments 

Shoot tip 

Shoot tips, axillary buds 


Shoot tips, axillary buds and segment 
of young leaves 


Shoot tips, lateral buds, node, leaf calli, 


internode 
Shoot tips, leaf base, foliar leaves 
Shoot tips, nodes 


Shoot tips, nodes, internodes, hypocotyl, 


embryos, leaf 
Stem 


Terminal and nodal segment axillary bud 


Zygotic embryos 


Nodal segment/axillary bud 


Botanical name 
Feronia limonia 


Ceropegia candelabrum, Gerbera aurantiaca, Rumex 
acetosella, Trichopus zeylanicus, Valeriana wallichii, 
Wrightia tomentosa 


Sternbergia fischeriana 

Holostemma ada-kodien 

Gloriosa superba 

Pimpinella tirupatiensis, Coptis teeta 
Meconopsis simplicifolia 

Arnebia euchroma 


Psoralea corylifolia 


Coleus forskohlii, Anogeissus sericea 
Aconitum napellus, Aconitum noveboracense 


Gymnema sylvestre 


Decalepis hamiltonii, Pittosporum napaulensis, 
Nepenthes khasiana, Syzygium travancoricum, Adhatoda 
beddomei 


Rotula aquatica 

Nardostachys jatamansi 

Geodorum densiflorum, Ipsea malabarica, Kaempferia 
galanga 

Decalepis arayalpathra 

Saussurea obvallata 

Oncidium catharinensis 

Pterocarpus marsupium 

Chlorophytum borivilianum 

Caralluma edulis, Commiphora wightii 
Swertia chirata 

Aconitum carmichaeli, Rheum emodi 
Renanthera imschootiana 


Rauvolfia serpentina 


Vanda coerulea 
Gentiana kurroo 


Withania somnifera 


Celastrus paniculatus, Dendrobium moschatum 
Picrorhiza kurroa 

Ocotea catharinensis, Podophyllum hexandrum, 
Santalum album 


Ceropegia fantastica, Daphne cneorum, Coronopus 
navastii, Frerea indica, Huernia hystrix 
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trated cryoprotective media and/or air desicca- 
tion followed by rapid cooling. Several different 
vitrification-based procedures are being devel- 
oped: (a) encapsulation-dehydration, (b) vitrifi- 
cation, (c)  encapsulation-vitrification, (d) 
dehydration, (e) pregrowth, (f) pregrowth- 
dehydration and (g) _ droplet-vitrification. 
Similarly, the seeds/embryos of endemic and 
threatened species can be cryopreserved using 
different explants, where the simple known pro- 
tocols are possible to be implemented. 

Although methodology and freezing protocols 
have been developed for various higher plants 
(Bunn et al. 2007) including orchids (Hirano 
et al. 2006) and also bryophytes and fern spores 
(Pence 2008) using different explants, limited lit- 
erature is available for cryopreservation of rare 
and endangered species. Touchell and Dixon 
(1993) successfully cryopreserved 68 native 
Western Australian species with pretreatment of 
seeds with DMSO. Desiccation and freezing 
were undertaken in seeds of rare temperate 
orchids by direct immersion in liquid nitrogen 
(Nikishina et al. 2007). Studies were also con- 
ducted with endangered, rare, ancient and wild 
Citrus species (Malik and Chaudhury 2006; 
Lambardi et al. 2007; Hamilton et al. 2009). 
Protocorms of Cleisostoma arietinum, a rare Thai 
orchid species, have been cryopreserved follow- 
ing the method of encapsulation-dehydration 
(Maneerattanarungroj et al. 2007). Similarly, 
shoot tips of the endemic endangered plant 
Centaurium  rigualii were cryopreserved 
(Gonzalez-Benito and Perez 1997). Turner et al. 
(2001) and Tanaka et al. (2008) reported success- 
ful cryopreservation of shoot tips of endangered 
Australian and Japanese species, respectively, 
using the vitrification technique. The droplet- 
vitrification technique has been used for freezing 
shoot tips of wild potatoes (Yoon et al. 2007) and 
wild relatives of Diospyros (Niu et al. 2009). 

Presently, cryopreservation 1s employed on a 
large scale for different types of materials where 
conventional seed storage methods are not fea- 
sible, and established protocols are available. 
Cryopreservation is used mainly for storing 
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seeds with limited longevity and of rare or 
endangered species at several genebanks such as 
the National Center for Genetic Resources 
Preservation (NCGRP; Fort Collins, CO), the 
National Bureau for Plant Genetic Resources 
(NBPGR) New Delhi, India (Fig. 24.10), Kings 
Park and Botanic Garden in Perth, Australia, 
and the Center for Conservation and Research 
of Endangered Wildlife at the Cincinnati Zoo in 
the USA. 


24.2.2.4 Pollen Banks 

Cryopreservation of pollen is easy and cost- 
effective in supplementing the usual germplasm 
preservation techniques by seed and clonal stor- 
age particularly for enriching haploid gene pools. 
Pollen banks are efficient, economical and space 
saving, compared to the maintenance of live field 
collections. Pollen storage was developed as a 
tool for controlled pollination of asynchronous 
flowering genotypes, especially in fruit trees and 
agroforestry species (Alexander and Ganeshan 
1988) and is an emerging technology for genetic 
conservation (Roberts 1975; Omura and Akhima 
1980; Withers 1991). Pollen can easily be col- 
lected and cryopreserved in large quantities in a 
relatively small space. In addition, exchange of 
germplasm through pollen poses less quarantine 
problems compared with seed or other plant 
propagules. 


24.2.2.5 DNA Banks 

DNA from the nucleus, mitochondria and chloro- 
plasts can be routinely extracted and immobilized 
into the nitrocellulose sheets where it can be 
probed with numerous cloned genes. Since these 
techniques are simple, efficient and the samples 
occupy limited space, they are considered advan- 
tageous and may be considered as a practical 
method of conserving germplasm in the future. 
Such DNA banks for endemic and threatened 
species have been established at Kew Botanical 
Gardens, the UK, and Missouri Botanical Garden, 
the USA. In India, NBPGR, New Delhi has initi- 
ated a DNA banking facility for the conservation 
of genomic resources of India. 
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Fig. 24.10 A view of the cryobank repository at NBPGR (Pandey et al. 2010) 


24.2.3 Bridging In Situ and Ex Situ 
Conservation 


Unlike crop species where conservation purpose 
of a collection is to fuel crop improvement, for 
endemic and threatened wild plants, the objective 
is to ensure the survival of self-perpetuating via- 
ble populations in the wild. The ex situ collec- 
tions can act as backup and facilitate in situ 
conservation of wild populations (Husband and 
Campbell 2004). Several strategies to combine 
the strengths of the two basic approaches to con- 
servation while minimizing the weaknesses have 
been proposed. 

The basic method of such an inter situ 
approach envisages establishment and mainte- 
nance of an artificially managed off-site collec- 
tion maintained within the natural habitat of the 
species (Given 1987). Such sites may be within 
the current range of the distribution of the species 
(Guerrant and Pavlik 1998), in recent past range 
(Burney and Burney 2007) or completely new 
areas possessing similar environmental space as 
the original habitat (Maschinski et al. 2012), 
where a population can be reintroduced. Uma 


Shaanker and Ganeshaiah (1997) have proposed 
a modified methodology for conservation of 
threatened plant species, wherein by mapping the 
genetic diversity and geographical distribution of 
the target species, donor and recipient sites can 
be identified, and such recipient sites termed as 
‘forest genebanks’ act as sinks for the import of 
gene pool from outside sites as pollen. A similar 
strategy termed quasi in situ approach has been 
suggested by Volis et al. (2009). Such strategies 
have been successfully tested for the conserva- 
tion of endemic and threatened flora in different 
parts of the world (Volis et al. 2009; Cochrane 
et al. 2010) and can be adapted to the Indian 
scenario. 


24.2.4 Geospatial Technology 


Geospatial techniques available today are key 
and powerful tools for assessing geographic 
information for monitoring land use and land 
cover changes, changes in landscape, mapping 
potential, species distributions and for monitor- 
ing biodiversity losses. Such techniques have fur- 
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ther proven useful for conservation and 
management of natural resources worldwide. The 
geospatial techniques are a combination of sev- 
eral integrating technologies such as geographic 
information system (GIS), remote sensing (RS), 
global positioning system (GPS) and modern 
information and communication technologies 
(ICT). 

With respect to conservation biology, geospa- 
tial techniques have fuelled studies on biodiver- 
sity fragmentations, species distributions and 
levels of species richness and how they operate. 
With the ongoing simplification of native ecosys- 
tems, declining population and rising loss of bio- 
diversity, such techniques take on an even greater 
role in_ biodiversity conservation — efforts. 
Increasing number of earth observation satellites 
with better repetitive information, improvement 
in spectral bands, spatial resolution from 50 cm 
to | km and also unprecedented number of remote 
sensing tools have facilitated addressing various 
biodiversity conservation challenges. The avail- 
ability of advanced sensors with increased spatial 
and spectral resolution has enabled conservation 
biologists for making the direct use of remote 
sensing for distinguishing species assemblages or 
even identifying species of individual trees. In 
India, remote sensing techniques have been used 
for land use classification mapping particularly 
for improved management of biodiversity-rich 
areas and wildlife through precise biodiversity 
assessment on a larger spatial scale. These tech- 
niques can be further exploited to conserve and 
maintain the forest tree species, wild relatives 
and medicinal plants particularly for the endemic 
and threatened species to ensure their efficient 
and effective conservation (Yadav et al. 2013). 


24.3 Future Perspectives 


The past 50 years have witnessed a major revolu- 
tion in the area of biodiversity conservation yet 
utilization of the same is a recent attempt. The 
development of new techniques for germplasm 
conservation and its implementation offers new 
opportunities for conservation of biodiversity in 
the form of seeds, pollen, vegetative propagules 
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and DNA. NBPGR, New Delhi, equipped and 
enriched with all the facilities such as seed gene- 
bank, in vitro repository, cryobank and _ field 
genebank widens the scope for the conservation 
and management of endemic and threatened plant 
species in India. 

The IUCN Red List categories have been 
widely used throughout the world which was in 
place for last 30 years. As knowledge about the 
existing species and threat classification pro- 
gressed, some countries have developed improved 
versions at national levels. The threat status kept 
changing with changing environmental condi- 
tions. The species which was at the verge of 
extinction might have been rescued, regenerated 
and recovered the status of which is not updated. 
The lack of available basic biological data led to 
the failure in species recovery programme. So 
sequential improvement in indentifying the spe- 
cies and its exact status needs to be identified 
regularly and updated in the database especially 
in the areas which are prone to various catastro- 
phes, such as cyclone, landslide, floods and 
drought. 

Limited availability and narrowness of species 
diversity are the indicators to find out the status 
of species extinction. Climate change is one of 
the major factors responsible for species extinc- 
tion. The conservation of threatened species is 
challenging such as their ability to move with 
changing climatic conditions, and often, it may 
not be feasible due to restricted habitats and 
interdependence on other species (Preston et al. 
2008). The endemic species have evolved in geo- 
graphic isolation which limits their dispersal to 
new areas, so the conservation of biodiversity 
under such conditions demands special chal- 
lenges for threatened and endemic species, which 
needs to be designed to fit into the present 
scenario. 

The selection procedure for conservation of 
threatened species depends on the biological 
nature of the targeted species, practicality and 
feasibility of particular method, its cost- 
effectiveness and security afforded by that appli- 
cation. The development of — suitable 
complementary technique for the species needs 
to be identified for successful conservation of 
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gene pool which requires multidisciplinary 
approach. The future challenges for the conserva- 
tion are scientific, technical, socio-economical, 
legal and political including public awareness. 
Although several strategies suggest creating 
space for the protection of the natural habitats, 
but in the developing countries, population explo- 
sion may be a major limiting factor for the avail- 
ability of space. During the last few years, 
progresses have been made with the development 
of various biotechnological tools and their imple- 
mentation in new conservation strategies which 
should be modified accordingly to accommodate 
in the changing trends. A coordinated effort of 
researchers, research institutions, NGOs and 
communities is the need of the hour that can 
jointly make an effort for the best suitable con- 
servation depending on the available resources. 
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Abstract 

Biotechnological approaches like micropropagation, somaclonal variation, 
in vitro conservation, synseed technology, protoplast fusion, production of 
flavour and colouring components and development of novel transgenics 
have great potential in conservation, utilization and increasing the production 
of spices. Efficient micropropagation systems are available for many spices 
which are being used for propagation, conservation, safe movement and 
exchange of germplasm, crop improvement through somaclonal variation 
and transgenic pathways. Studies on the production of metabolites, flavour 
and colouring compounds using immobilized and transformed cell cultures 
are being attempted. Molecular markers and maps are being generated for 
crop profiling, fingerprinting, identification of duplicates, and marker- 
assisted breeding. Transcriptome sequencing is becoming an important tool 
for identification, isolation and cloning of useful genes. Bio-technological 
approaches involving microbials (antogonists/hyper parasites and PGPRs) 
with broad spectrum of disease suppression and growth promotion have 
been found effective in crop health management in spice crops. 
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25.1 Introduction 

Spices and herbs are aromatic plants — fresh or dried 
plant parts of which are mainly used to flavour our 
food and confectionery and also in medicine and 
perfumery industry. Spices and herbs are grown 
throughout the world, and it has been estimated that 
these crops are grown on an area of 8 million ha 
globally contributing to 31.6 million tons of spices 
annually. India’s share to world spices production is 
6 million tons. The global spice industry amounts to 
1.1 million metric tons, accounting to US$3.475 
billion in value. India’s share at the global level is 
0.575 million metric tons, accounting to US$2.037 
billion, 1.e., 52 % in volume and 58.6 % in value 
(Source: Spices Board of India 2014). 

Black pepper, cardamom, ginger, turmeric, 
vanilla, cinnamon, clove, nutmeg, tamarind, etc. 
constitute the major spices, while coriander, 
cumin, fennel and fenugreek are important seed 
spices followed by saffron, lavender, thyme, 
oregano, celery, anise and sage are important 
herbal spices. The productivity of many of these 
crops 1s low due to the lack of high-yielding, pest 
and disease-resistant varieties. The past few years 
have witnessed a quantum jump in utilization of 
biotechnological tools to achieve the above 
through commercial propagation, development 
of novel varieties and marker-assisted breeding 
(Nirmal Babu et al. 201 1d). 


25.2 Black Pepper 


25.2.1 Micropropagation and Plant 
Regeneration 


Micropropagation has been employed for large- 
scale production of disease-free planting materials 
and germplasm conservation. High rate of multipli- 
cation coupled with the additional advantage of 
obtaining disease-free planting material makes 
micropropagation a viable alternative to conven- 
tional propagation (Nirmal Babu and Minoo 2003). 
Black pepper, Piper nigrum L., is native to India 
and is the most important spice in the world. 
Conserving the genetic diversity and development 
of Phytophthora foot rot resistance are the immedi- 
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ate priorities for all breeding programmes. 
Technologies for micropropagation of black pep- 
per using various explants were reported (Nirmal 
Babu 1997; Nirmal Babu et al. 2012b). Multiple 
shoots can be induced using BA in the culture 
medium (MS or SH Medium) either alone or in 
combination with auxins (Fig. 25.1a). Endogenous 
contamination severely hampers establishment of 
black pepper cultures to over come this constrains. 
A commercially viable protocol for large-scale in 
vitro multiplication of black pepper these problems 
was reported by Nazeem et al. (2004). Protocols 
were standardized for micropropagation of other 
endangered and medicinally important species of 
Piper like P. longum and P. chaba, P. betle, P. bar- 
beri and P. colubrinum (Nirmal Babu 1997; Nirmal 
Babu et al. 2012b). Joseph et al. (1996) and Yamuna 
(2007) reported the somatic embryogenesis from 
zygotic embryos, while Nair and Gupta (2003, 
2006) reported the cyclic somatic embryogenesis 
from the maternal tissues, which have tremendous 
potential for automated micropropagation. Nirmal 
Babu et al. (2005a) reported the somatic embryo- 
genesis from mature leaf tissues (Fig. 25.1b, c). 
These protocols are useful in transgenic experi- 
ments. Plant regeneration was reported in other 
Piper species like P. longum, P. betle, P. chaba, P. 
attenuatum and P. colubrinum through direct and 
indirect organogenesis (Nirmal Babu et al. 2012b). 
Attempts on induction of variability on somaclones 
for tolerance to Phytophthora foot rot resistance by 
Shylala et al. (1996) resulted in identification of 
tolerant somaclones through in vitro selection of 
calli as well as somaclones using crude culture 
filtrate and toxic metabolite isolated from 
Phytophthora capsici. 


25.2.2 Molecular Characterization 


Recent advances in molecular biology led to more 
emphasis on molecular markers for characteriza- 
tion of the genotypes, genetic fingerprinting, 
identification and cloning of important genes and 
marker-assisted selection and in understanding 
interrelationships at the molecular level. Menezes 
et al. (2009) and Joy et al. (2007, 2011) developed 
and characterized a total of 16 microsatellite 
markers from black pepper and used them to 
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Fig. 25.1 (a) Biotechnological approaches for spices. 
Micropropagation in black pepper; (b) plant regeneration 
in black pepper; (c) somatic embryogenesis in black pep- 
per; (d) regeneration of transformants in black pepper; (e) 
micropropagation in cardamom; (f) regeneration of 
embryoids from cardamom anthers; (g) plant regeneration 
in ginger; (h) somatic embryogenesis in cinnamon; (i) 
micropropagation in thyme; (j) micropropagation in mint; 


study the genetic diversity of 20 varieties from 
Brazilian collections and 40 popular genotypes 
and 4 different species of black pepper from India, 
respectively. Liao et al. (2009) reported the isola- 
tion and characterization of 11 and 9 polymorphic 
microsatellites loci from a Piper polysyphonum 
and Piper solmsianum, respectively. These micro- 
satellite markers provide a reliable means to 
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(k) plant regeneration in coriander; (I) synthetic seeds in 
cardamom; (m) microrhizomes in turmeric; (n) microrhi- 
zomes in ginger; (0) in vitro gene bank of spices; (p) ger- 
minating cryopreserved encapsulated buds of vanilla; (q) 
germinating cryopreserved pollen of vanilla; (r) in vitro 
multiplication of nutmeg mace; (s) ISSR profiles of black 
pepper varieties 


understand the population structure and interrela- 
tionships in the genus Piper. Jaramillo and Manos 
(2001) used phylogenetic analysis of sequences 
of the internal transcribed spacers (ITS) of nuclear 
ribosomal DNA based on a worldwide sample of 
the genus Piper. Nirmal Babu et al. (201 1a) 
reported the molecular interrelationships between 
24 Piper species using RAPD profiles. Chaveerach 
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et al. (2002) studied interrelationships between 
three Piper species, viz., P. kadsura, Piper retro- 
fractum and P. chaba using morphological 
characters and RAPD profiles to demonstrate a 
closer relation between P. retrofractum and 
P kadsura than between P. chaba_ and 
P. retrofractum. Genetic diversity of Piper species 
using RAPD and ISSR (Fig. 25.1s) was also 
reported recently by many workers (Sen et al. 
2010; Nirmal Babu et al. 2012b). Jiang and Liu 
(2011) applied RAPD and SRAP markers to ana- 
lyze genetic diversity in 74 individual plants of 
Piper spp. in Hainan Island. The SRAP technique 
clearly distinguished all Piper spp. from each 
other. 

Molecular markers like RAPD, AFLP, ISSR, 
SSR and ITS polymorphism were used for assess- 
ment of genetic variability in black pepper and to 
characterize important cultivars, varieties and 
related species of black pepper to develop finger- 
prints and to study the interrelationships (Pradeep 
Kumar et al. 2003; Nazeem et al. 2005; Nirmal 
Babu et al. 2011la). A mapping population was 
developed for the preparation of genetic map of 
black pepper (Nirmal Babu et al. 2011a), and 
DNA markers were also used to study the genetic 
fidelity among micropropagated black pepper and 
long pepper plants (Nirmal Babu et al. 201 1a). 

Johnson et al. (2005) used male parent spe- 
cific RAPD markers for the identification of 
hybrids in black pepper (Piper nigrum L.). Sex- 
specific markers were developed in Piper 
longum L. using RAPD (Banerjee et al. 1999) 
and differential display (Manoj et al. 2008) 
Piper betle L. using ISSR (Khadke et al. 2012) 
and RAPD markers (Samantaray et al. 2012b). 
Sheji et al. (2006) developed a SCAR marker 
for identifying Phytophthora-resistant lines of 
black pepper. RAPD technique was also 
employed for the authentication of dried black 
pepper from its adulterant Carica papaya L 
(Dhanya et al. 2009). 


25.2.3 Protoplast Culture 


The ‘protoplast’ is devoid of cell wall and this 
makes the protoplast technology suitable for 
genetic transformation by introduction of trans- 
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gene DNA and somatic hybridization by proto- 
plast fusion of species or subspecies resistant to 
traditional cross-breeding, or isolation of sub- 
cellular organelles etc. Reliable procedures are 
available for isolation, culture and fusion of pro- 
toplasts from a range of spices. Successful 
isolation and culture of protoplasts were reported 
in leaf tissues in black pepper (Shaji et al. 1998), 
and these protoplasts could be successfully devel- 
oped up to microcalli stage. 


25.2.4 Genetic Transformation 


Reports are available on Agrobacterium- 
mediated gene transfer system (Fig. 25.1d) in P 
nigrum (Nirmal Babu et al. 2005a, 201 1d, 2013). 
Maju and Soniya (2012) reported an efficient 
protocol for genetic transformation using 
Agrobacterium tumefaciens strain EHA 105 and 
multiple shoot production in Piper nigrum var. 
Panniyur-1 to understand the mechanisms that 
control the regeneration process in the species. 
Mani and Manjula (2011) reported the 
Agrobacterium-mediated transformation and 
endogenous silencing in Piper colubrinum by 
vacuum infiltration. The in planta transforma- 
tion via pollen tube pathway was done by Asha 
and Rajendran (2010) in black pepper variety 
Panniyur-2, using the total exogenous DNA of 
Piper colubrinum, a_ species resistant to 
Phytophthora capsict. 


25.2.5 Cloning and Isolation 
of Candidate Genes 


Spices are sources of many important genes with 
antibiotic and pharmaceutical properties. Efforts 
are being made to identify and isolate genes of 
interest both for crop improvement and of indus- 
trial value. Molecular cloning of a cDNA frag- 
ment encoding the defence-related protein 
B-1,3-glucanase in black pepper (P. nigrum L.) 
and methyl-glutaryl-CoA reductase in Piper col- 
ubrinum was reported (Girija et al. 2005a, b). 
PCR-based SSH technique was used to generate 
a leaf-specific subtracted cDNA library for Piper 
nigrum L. for the identification of more number 
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of tissue-specific genes (Alex et al. 2008). Bhat 
et al. (2005) reported the isolation and sequencing 
of CMV coat protein gene infecting black pepper 
and its possible deployment in black pepper to 
induce virus resistance. Dicto and Manjusha 
(2005) used PCR-based SSH to identify P. colu- 
brinum resistance genes which are differentially 
expressed in response to salicylic acid. Mani and 
Manjula (2011) reported the cloning and charac- 
terization of two isoforms of osmotin, an antifun- 
gal PR-5 gene homologue, from a salicylic 
acid-induced subtracted cDNA _ library in 
P. colubrinum. Cloned isoforms of osmotin from 
resistant species could be used for molecular 
breeding of black pepper. Bioprospecting of 
novel genes from black pepper was attempted by 
Sujatha et al. (2005). Heterologous probes were 
used to identify the presence of pea lectin genes 
and tomato protease inhibitor genes in black pep- 
per. Resistance gene analogues (RGAs) have 
been isolated in Piper nigrum and Piper colubri- 
num by Tiing et al. (2012). 


25.3 Cardamom 


25.3.1 Micropropagation and Plant 
Regeneration 


Cardamom, considered the ‘Queen of Spices’, is 
also native to India. The productivity of carda- 
mom is hampered by various diseases of viral 
aetiology. Utilization of virus-free planting mate- 
rial is an important input into disease manage- 
ment strategy. Cardamom is one of the first crops 
where commercialization of micropropagation 
has been achieved. Efficient in vitro methods for 
rapid clonal propagation of cardamom (Fig. 25.1le) 
are available (Nadgauda et al. 1983; Nirmal Babu 
et al. 1997; Nirmal Babu and Minoo 2003, 2011b, 
201 1e). Successful regeneration of plantlets from 
callus of seedling explants and anthers (Fig. 25.1f) 
of cardamom was reported (Nirmal Babu 1997; 
Nirmal Babu et al. 2011b, Rao et al. 1982). 
Manohari et al. (2008) also reported an efficient 
protocol for the induction of somatic embryogen- 
esis and plant regeneration in small cardamom. 
Identification of a few Katte tolerant somaclones 
was reported by Peter et al. (2001). 
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25.3.2 Molecular Characterization 


Molecular profiling of 11 species representing 5 
major tribes, viz., Amomum, Aframomum, 
Alpinia, Hedychium and Elettaria, and 96 collec- 
tions of cardamom germplasm using RAPD, 
PCR, RFLP and ISSR markers to elucidate their 
interrelationships, identification of duplicates 
and geographical origins of Indian cardamom 
was reported by Nirmal Babu et al. (201 1a). 
Molecular characterization of selected cardamom 
genotypes using AFLP markers was also done in 
Columbia by Tamayo 2007. Molecular profiling 
of Indian, Sri Lankan and Guatemalan exported 
cardamoms using RAPD/ISSR primers indicated 
that though there is a lack of genetic polymor- 
phism among them, they show the variation in 
quality parameters (Thomas et al. 2006). 


25.3.3 Protoplast Culture 


Successful isolation and culture of protoplasts 
were reported from cell suspensions and leaf tis- 
sues in cardamom (Geetha et al. 2000), and these 
protoplasts could be successfully developed up to 
microcalli stage. 


25.3.4 Genetic Transformation 


A preliminary study on the transformation of car- 
damom was attempted using the biolistic process 
to study the optimum conditions for gene deliv- 
ery and the efficiency of the plasmid vector 
pAHC 25 and promoter Ubi-1 (maize ubiquitin) 
for transformation and gene expression in carda- 
mom embryogenic callus. Transient expression 
of GUS gene was noticed in the bombarded cal- 
lus tissue (Nirmal Babu et al. 1998). 


25.4 Ginger 


25.4.1 Micropropagation and Plant 
Regeneration 


Ginger is the third most important spice that origi- 
nated in South Asia. There is no seed set in ginger 
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leading to limited variability, and this hampers crop 
improvement programme. Rhizome rot caused by 
Pythium aphanidermatum and bacterial wilt caused 
by Ralstonia solanacearum are the major diseases 
affecting ginger. Diseases of ginger are often 
spread through infected seed rhizomes. Tissue cul- 
ture will help in the production of pathogen-free 
planting material. Clonal multiplication of ginger 
from vegetative buds (Nadgauda et al. 1980; 
Nirmal Babu et al. 1997, 1998; Sharma and Singh 
1997); optimization of media composition for 
micropropagation of ginger (Nirmal Babu et al. 
201 1c); regeneration of plantlets via callus phase 
from leaf (Nirmal Babu et al. 1992a), vegetative 
bud (Fig. 25.1g), ovary and anther explants (Kacker 
et al. 1993; Nirmal Babu 1997; Nirmal Babu et al. 
2005a, b; Lincy et al. 2009; Ramachandran and 
Chandrashekaran 1992) and plant regeneration 
from ginger immature anthers (Nirmal Babu et al. 
1992b) and anther have been reported (Samsudeen 
et al. 2000). This system was used for inducing 
somaclonal variability in ginger where lack of seed 
set hampers conventional breeding. A few promis- 
ing high-yielding rhizome rot-tolerant somaclones 
also have been identified (Nirmal Babu et al. 1996; 
Sumathi 2007). 

In nature, ginger fails to set fruit. However, by 
supplying required nutrients to young flowers and 
by in vitro pollination, ‘fruit’ development and 
subsequently plants could be recovered. In vitro 
pollination attempts were successfully made by 
Nazeem et al. (1996) to overcome the pre-fertil- 
ization barriers like spiny stigma, long style and 
coiling of pollen tube that interfered with natural 
seed set in ginger successful seed set was obtained. 
Induction of tetraploid ginger through in vitro col- 
chicine treatment and tetraploid somaclone with 
extra bold rhizomes was also reported (Adaniya 
and Shirai 2001; Smith et al. 2004; Nirmal Babu 
et al. 1996, 2005b; Wang et al. 2010). 


25.4.2 Molecular Characterization 


RAPD profiling of 90 accessions of ginger indi- 
cated moderate to low level of polymorphism 
(Sasikumar and Zachariah 2003; Nirmal Babu 
et al. 2005a, b; Parthasarathy and Nirmal Babu 
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et al. 2011c). Various markers like AFLP 
(Wahyuni et al. 2003; Sajeev et al. 2011) and 
ISSR (Kizhakkayil and Sasikumar 2010) were 
used to study the genetic diversity in ginger. Jiang 
et al. (2006) used metabolic profiling and phylo- 
genetic analysis to investigate the diversity of 
plant material within the ginger species and 
between ginger and closely related species in the 
genus Zingiber and also for authentication of gin- 
ger. Chavan et al. (2008) developed SCAR mark- 
ers as complementary tools for distinguishing Z. 
officinale from the other Zingiber species. 


25.4.3 Protoplast Culture 


Successful isolation and culture of protoplasts 
were reported from cell suspensions and leaf tis- 
sues in ginger (Nirmal Babu 1997; Geetha et al. 
2000), and these protoplasts could be success- 
fully developed up to microcalli stage. Somatic 
hybridization of ginger through chemical fusion 
(PEG mediated) and its regeneration was reported 
by Guan et al. (2010). RAPD technique was used 
for the identification of hybrids, and flow cytom- 
etry analysis revealed the diploid nature of all 
regenerated progenies. 


25.4.4 Genetic Transformation 


Transient expression of GUS was successfully 
induced in ginger embryogenic callus bombarded 
with plasmid vector pAHC 25 and promoter 
Ubi-1 (maize ubiquitin) callus tissue (Nirmal 
Babu 1997). Agrobacterium tumefaciens strain 
EHA105/p35SGUSInt, effective in expressing 
B-glucuronidase activity, was used to transform 
ginger by Suma et al. (2008). 


25.4.5 Cloning and Isolation 
of Candidate Genes 


Nair and Thomas (2007) identified the three 
primer pairs designed from the conserved motifs 
of NBS domain of NBS-LRR gene class as most 
successful in isolating RGCs in ginger, and these 
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provide a base for isolation of RGC mining in 
ginger. They also reported the efficiency and 
sensitivity of SSCP (single-strand conformation 
polymorphism) analysis in discriminating 
Pythium susceptible and resistant Zingiber acces- 
sions (Nair and Thomas 2012; Nair et al. 2010) 
and the isolation of resistant gene designated 
ZzR1 from Z. zerumbet and its correlation 
between ZzR1 expression and resistance of the 
wild taxa to Pythium aphanidermatum infection 
(Nair and Thomas 2013). Swetha Priya and 
Subramanian (2008) reported the presence of R 
gene of CC-NBS-LRR class of plant resistant 
gene in ginger varieties against Fusarium oxyspo- 
rum f. sp. zingiberi. Fujisawa et al. (2010) cloned 
a novel gene that could encode s (beta) bisabolene 
synthetase from ginger. Prasath et al. (2011, 2013) 
reported the isolation of PR5 protein genes CaPR5 
and ZoPR5 which code for the precursor proteins 
of 227 and 224 amino acids. They used a PCR- 
based suppression subtractive hybridization 
(SSH) method to identify C. amada (a potential 
donor for bacterial wilt resistance to Zingiber offi- 
cinale) genes that are differentially and early 
expressed in response to the R. solanacearum 
infection compared to Z. officinale. The study 
highlighted the expression of LRR, GST and 
XTG much higher in resistant species (C. amada) 
than in susceptible species (Z. officinale). 


25.5 Turmeric 


25.5.1 Micropropagation and Plant 
Regeneration 


Turmeric of commerce is the dried rhizomes of 
Curcuma longa L. which belongs to the family 
Zingiberaceae. India is the major producer and 
exporter of this spice. Curcumin is the important 
colouring material from turmeric, and development 
of varieties with high recovery of curcumin is the 
need of the hour. Successful micropropagation of 
turmeric has been reported (Nadgauda et al. 1978; 
Nirmal Babu et al. 1997; Sunitibala et al. 2001; 
Salvi et al. 2002; Panda et al. 2007; Ghosh et al. 
2013). This technique is used for the production of 
disease-free planting material. Organogenesis and 
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plantlet formation were achieved via callus cultures 
of turmeric (Nirmal Babu et al. 1997; Sunitibala 
et al. 2001; Salvi et al. 2000). Variants with high 
curcumin content were isolated from tissue-cul- 
tured plantlets (Nadgauda et al. 1982). Root rot 
disease-tolerant clones of turmeric cv. Suguna 
were isolated using continuous in vitro selection 
technique against pure culture filtrate of Pythium 
graminicolum (Gayatri et al. 2005). 

Renjith et al. (2001) reported the in vitro 
pollination and hybridization between two short 
duration types VK-70 and VK-76 and reported 
the seed set and seed development. This reduces 
the breeding time and helps in recombination 
breeding which was so far not attempted in tur- 
meric. Protocols for micropropagation of many 
economically and medicinally important zingib- 
eraceous species like Amomum subulatum (large 
cardamom), Curcuma aromatica (kasturi tur- 
meric), C. amada (mango ginger), C. zedoaria, 
Kaempferia galanga, K. rotunda and Alpinia spp. 
were developed (Vincent et al. 1992; Chang and 
Criley 1993; Ravindran et al. 1996; Geetha et al. 
1997; Chan and Thong 2004; Chithra et al. 2005; 
Raju and Anita-D 2005. Islam et al. (2004) 
described an efficient protocol for microrhizome 
production, in vitro, in turmeric. Rahman et al. 
(2004) reported the efficient plant regeneration 
through somatic embryogenesis from leaf base- 
derived callus of Kaempferia galanga L. 


25.5.2 Molecular Characterization 


Kress et al. (2001) proposed a new phylogenetic 
analysis of Zingiberaceae based on DNA 
sequences of the nuclear internal transcribed 
spacer (ITS) and plastid mat K regions. The 
results suggest that Curcuma 1s paraphyletic with 
Hitchenia, Stahlianthus and Smithatris. Sasaki 
et al. (2004) applied single-nucleotide polymor- 
phism analysis of the trnK gene for the identifica- 
tion of Curcuma plants. Genetic diversity 
evaluation among C. longa and other 14 Curcuma 
species was done using ISSR and RAPD markers 
(Syamkumar and Sasikumar 2007) placing them 
into seven groups which is somewhat congruent 
with classification based on morphological 
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characters proposed by the earlier workers. 
Genetic diversity in turmeric was assessed by 
using molecular markers like RAPD (Jan et al. 
2011), both RAPD and ISSR (Singh et al. 2012) 
and PCR-based markers, viz., RAPD, ISSR and 
AFLP (Das et al. 2011; Nirmal Babu et al. 2007, 
2011d). Development and characterization of 
EST-derived and genomic microsatellites in 
Curcuma longa were reported by Joshi et al. 
(2010), Siju et al. (2010) and Senan et al. (2013) 
which could be used for diversity analysis. Nayak 
and Naik (2006) carried out four C nuclear DNA 
content and RAPD analysis of 17 promising cul- 
tivars of turmeric from India. 

Salvi et al. (2003) and Praveen (2005) using 
RAPD analyzed turmeric somaclones and con- 
cluded that plants regenerated using shoot tips 
showed genetic stability, while the callus-derived 
and inflorescence-derived plants showed varia- 
tions. Tyagi et al. (2007) confirmed the genetic 
stability of 12-month-old in vitro conserved tur- 
meric by RAPD profiling. Sinus et al. (2010) 
developed and amplified microsatellite markers 
from ESTs of turmeric. Sasaki et al. (2002) used 
sequence analysis of Chinese and Japanese 
Curcuma drugs on the 18S rRNA gene and trnK 
gene and its application based on amplification- 
refractory mutation system analysis for their 
identification and authentication. Sasaki et al. 
(2004) used SNP analysis based on the differ- 
ences in the nucleotide positions in the 177, 645, 
724 and a 4 base indel on the trnK gene obtained 
using three different lengths of 26 mer, 30 mer 
and 34 mer reverse primers for the identification 
of four Curcuma sp. studied by Xia et al. (2005) 
used 5S rRNA spacer and chemical fingerprints 
for quality control and authentication of Rhizoma 
Curcumae, a Chinese medicine used for the 
removal of blood stasis and alleviating pain. 


25.5.3 Protoplast Culture 


Successful isolation and culture of protoplasts 
were reported from cell suspensions and leaf tis- 
sues in turmeric (Geetha et al. 2000), and these 
protoplasts could be successfully developed up to 
microcalli stage. 
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25.5.4 Genetic Transformation 


An efficient method for stable transformation 
was developed in turmeric using particle bom- 
bardment on callus cultures (Shirgurkar et al. 
2006). Transgenic shoots regenerated were mul- 
tiplied, and stably transformed plantlets were 
produced. Polymerase chain reaction (PCR) and 
histochemical GUS assay confirmed the stable 
transformation. Transformed plantlets were resis- 
tant to glufosinate. A protocol for genetic trans- 
formation and regeneration was established in 
C. alismatifolia using shoot explants through 
Agrobacterium strain AGLO _ harbouring 
binary vector pBII121 or ~ pBI121-Ca- 
ACSI. Transformation was confirmed by PCR, 
GUS assay and Southern blotting of regenerated 
plants (Mahadtanapuk et al. 2006). 


25.5.5 Cloning and Isolation 
of Candidate Genes 


Isolation, cloning and characterization of a 
mannose-binding lectin from cDNA (Chen et al. 
2005), a stress-responsive CDPK — gene 
(ZoCDPK1) from ginger rhizome, using rapid 
amplification of cDNA end (RLM-RACE) tech- 
nique (Vivek et al. 2013) and violaxanthin deep- 
oxidase (GVDE) (Huang et al. 2007) have been 
reported. Over-expression of ginger CDPK1 
gene in tobacco conferred tolerance to salinity 
and drought stress. ZoCDPK1 functions in the 
positive regulation of the signalling pathways 
that are involved in the response to salinity and 
drought stress in ginger, and it is likely operating 
in a DRE/CRT independent manner. Joshi et al. 
(2010) used degenerate primers designed based 
on known R gene in combination to elucidate 
resistance gene analogues from Curcuma longa 
cultivar Suvarna. Kar et al. (2013) used a previ- 
ously isolated resistance gene candidate Czp11 
from C. zedoaria resistant to Pythium aphanider- 
matum as a template to characterize a major 
resistance gene CzRI1 through candidate gene 
approach in combination with RACE-PCR 
Strategy. 
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25.6 Vanilla 


25.6.1 Micropropagation and Plant 
Regeneration 


Vanilla planifolia, native to Mexico and Central 
America, now cultivated in other parts of the 
tropics, is the source of natural vanillin. 
Micropropagation of vanilla using apical meri- 
stem was standardized for large-scale multiplica- 
tion of disease-free and genetically stable plants 
(Kononowicz and Janick 1984; Minoo 2002; 
Minoo et al. 2006; Minoo and Nirmal Babu 
2009). Successful plant regeneration from shoot- 
and seed-derived callus was reported in vanilla 
(Nirmal Babu et al. 1997; Minoo 2002). 


25.6.2 Molecular Characterization 


Markers like RAPD and AFLP coupled with mor- 
phological characters were utilized to assess the 
variability and hybrid nature of genotypes and of 
successful interspecific hybridization and produc- 
tion of hybrids between V. planifolia and V. aphylla 
(Minoo et al. 2006). RAPD marker was used to 
estimate the level of genetic diversity and interrela- 
tionships among different clones of V. planifolia 
and related species (Minoo et al. 2010). The data 
showed very limited variation within accessions of 
V. planifolia indicative of its narrow genetic base 
and its close relationship with V. tahitensis (Besse 
et al. 2004; Schluter et al. 2007; Minoo et al. 2008b; 
Minoo and Nirmal Babu 2009, Minoo et al. 2010). 
A comparative study of RAPD and ISSR was 
reported to analyze the interrelationships among 
nine cultivated, wild and hybrid Vanilla species 
(Verma et al. 2009). Fourteen microsatellite loci 
developed from V. planifolia shown to be mono- 
morphic within the cultivated accessions and 11 
markers out of 14 were polymorphic when trans- 
ferable to V. tahitensis (Bory et al. 2008). 


25.6.3 Protoplast Culture 


Minoo et al (2008a) reported the isolation of 
viable protoplasts in Vanilla species, i.e. in 
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V. andamanica, and that of PEG-mediated 
protoplast fusion between V. planifolia and V. 
andamanica. The protoplast fusion technology 
can be useful in gene transfer of useful traits to V. 
planifolia especially the natural seed set and dis- 
ease tolerance observed in V. andamanica. 


25.6.4 Genetic Transformation 


Protocols are available for genetic transformation 
of Vanilla planifolia using indirect procedure, 
viz., Agrobacterium tumefaciens using shoot tip 
sections (Malabadi and Natarajy 2007) and 
protocorm-like bodies from shoot tips as explants 
(Ratheesh and Ishwara Bhat 2011), providing a 
very useful basis for further genetic improvement 
of the orchid. 


25.6.5 Cloning and Isolation 
of Candidate Genes 


Sequencing of neutral genes has been used for 
reconstructing the evolutionary history of vanil- 
loid orchids, including a few vanilla species 
(Cameron et al. 1999; Cameron and Chase 2000; 
Cameron 2004, 2009; Cameron and Molina 
2006). Nuclear and plastid sequences were also 
used for unravelling the origin of the Tahitian 
vanilla (Lubinsky et al. 2008). Recently, the 
length polymorphism of the nonneutral caffeic 
acid O-methyl transferase gene was also used to 
analyze 20 vanilla species and confirmed the 
strong differentiation of Old World versus New 
World species in the genus (Besse et al. 2009). 


25.7 Tree Spices 


25.7.1 Micropropagation and Plant 
Regeneration 


Cinnamon, clove, nutmeg, curry leaf, pomegran- 
ate, tamarind, allspice and garcinia are some of 
the important tree spices. In these perennial tree 
crops, identification and clonal multiplication of 
high-yielding ‘elite’ genotypes become a priority 
due to long pre-bearing period. Micropropagation 
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of cinnamon, Chinese cassia and camphor was 
reported from seedlings and mature tree explants 
(Mini et al. 1997; Nirmal Babu et al. 1997; Huang 
et al. 1998). Multiple shoots were induced from 
shoot tips and nodal segments of Cinnamomum 
camphora on Woody Plant Medium (WPM) 
(Huang et al. 1998 and from a cotyledonary node 
on MS medium (Azad et al. 2005). Successful 
micropropagation of Chinese cassia was reported 
by Inomoto and Kitani (1989) using nodal 
explants from seedlings on MS_ medium. 
Micropropagation protocols for C. camphora 
were developed by Nirmal Babu et al. 2003. 

In vitro multiple shoot induction was worked 
out in G. indica (Kulkarni and Deodhar 2002). 
Murashige and Skoog’s medium supplemented 
with BAP gave optimal response in different geno- 
types investigated. Micropropagation of three spe- 
cies of garcinia was reported by Huang et al. 
(2000), Malik et al. (2005) and Mohan et al. 
(2012). In vitro shoot initiation from explants of 
field-grown trees of nutmeg was reported by 
Mallika et al. (1997). Micropropagation of clove 
from seedling explants have been reported 
(Mathew and Hariharan 1990; Suparman and 
Blake 1990). MS medium supplemented with IBA 
or activated charcoal induced root formation. 
However, there are no reports on successful micro- 
propagation of clove from mature shoot explants. 

Reports on micropropagation of curry leaf, 
pomegranate, camboge and tamarind are also 
available (Mascarenhas et al. 1987; Hazarika et al. 
1995; Rao et al. 1997; Bhuyan et al. 1997; Mathew 
et al. 1999; Nirmal Babu et al. 2000; Mehta et al. 
2000). High-frequency direct shoot proliferation 
was induced in intact seedlings of M. koenigii 
(Bhuyan et al. 1997). Shoot proliferation is also 
reported from different explants like nodal cut- 
tings (Nirmal Babu et al. 2000), leaves (Mathew 
and Prasad 2007) and immature seeds (Rani et al. 
2012). Efficient micropropagation protocols for 
Pomegranate were reported (Bin and Jiang 2003; 
El-Agamy et al. 2009; Patil et al. 2011). The plant- 
lets grown on WPM were found to be significantly 
better in average survival, plantlet height and aver- 
age leaf number per shoot when compared to MS 
and NN media (El-Agamy et al. 2009; Kaji et al. 
2013). In vitro regeneration and high-frequency 


K. Nirmal Babu et al. 


regeneration of tamarind were achieved in different 
media compositions (Hussain et al. 2004; Pattepur 
et al. 2010). Thidiazuron can play a major role to 
induce germination in tamarind seedlings Mehta 
et al. (2004). Reports on successful callus induc- 
tion and plant regeneration in nutmeg, cinna- 
mon (Fig. 25.1h), camphor, pomegranate and 
curry leaf are available (Bhansali 1990; Iyer et al. 
2000; Kong et al. 2009; Shi et.al. 2009, 2010; Paul 
et al. 2011). 


25.7.2 Molecular Characterization 


Genetic identification among four cinnamon species 
(Cinnamomum cassia, C. zeylanicum, C. burmannii 
and C. sieboldii) using nucleotide sequences of 
chloroplast DNA was studied by Kojoma et al. 
(2002). The nucleotide variation at one site in the 
trnL-trnF IGS and at three sites in the trnL intron 
was used for the correct identification of 
Cinnamomum species. Furthermore, single-strand 
conformation polymorphism (SSCP) analysis of 
PCR products from the trnL-trnF IGS and the trnL 
intron resulted in different SSCP band patterns 
among C. cassia, C. zeylanicum and C. burmannii. 
RAPD technology was applied for the analysis of 
normal and excellent types of Cinnamomum cam- 
phora L. and genetic interrelationship of nine 
Cinnamomum species (Hui and Linshui 2003 and 
Joy and Maridass 2008), respectively. Sheeja et al. 
(2013) reported the diversity analysis of nutmeg 
(Myristica) and related genera using both RAPD 
and ISSR markers. Species-specific bands could be 
identified from all the accessions, which can be con- 
verted into SCAR markers for genotype identifica- 
tion and authentication. 

Yapwattanaphun et al. (2004) used ITS 
sequence data to elucidate phylogenetic relation- 
ship of mangosteen (Garcinia mangostana) and 
several wild relatives (Garcinia spp.). The ITS 
sequence analysis showed that G. atroviridis, G. 
cowa, G. dulcis, G. malaccensis, G. mangostana, 
G. rostrata and G. vilersiana have nucleotide 
additivity (two different nucleotides at the same 
nucleotide position) at several sites in the ITS 
region. The occurrence of these species might be 
related to hybridization with ancestors, but the 
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genomic compositions, even chromosome num- 
bers, of these species are still unknown. Sulassih 
and Santosa (2013) worked on the diversity analy- 
sis of mangosteen and its relatives based on mor- 
phological and ISSR markers, revealing a close 
relationship between G. celebica, G. malaccensis 
and G. mangostana. It was determined that G. 
malaccensis and G. celebica were ancestors based 
on morphological and ISSR markers. Thatte and 
Deodhar (2012) identified male- and female-spe- 
cific molecular markers in Garcinia indica. 

Genetic variability and relationship studies of 
curry leaf were done using RAPD, DAMD and 
ISSR by Verma and Rana (2013). Genetic diversity 
studies of pomegranate using RAPD, SSR, AFLP, 
RAMP, nuclear rRNA and internal transcribed 
spacer were reported by many workers (Durga¢ 
et al. 2008; Jbir et al. 2008; Awamleh et al. 2009; 
Zamani et al. 2010; Hasnaoui et al. 2010; Pirseyedi 
et al. 2010; Ebrahimi et al. 2010; Soriano et al. 
2011; Zhao et al. 2013; Singh et al. 2013). Genetic 
diversity studies of tamarind using RAPD and 
AFLP markers were done by Diallo et al. (2007) 
and Algabal et al. (2011) and in allspice by Wadt et 
al. (2004). 


25.7.3 Cloning and Isolation 
of Candidate Genes 


Genes encoding cinnamomin (a type II RIP), 
which has three isoforms, were isolated from 
camphor seeds by Yang et al. (2002). A geraniol- 
synthase gene 1s also isolated from Cinnamomum 
tenuipilum by Yang et al. 2005. Expression of 
CtGES was exclusively observed in the geraniol 
chemotype of C. tenuipilum. Furthermore, in situ 
hybridization analysis demonstrated that CtGES 
mRNA was localized in the oil cells of the leaves. 


25.8 Seed and Herbal Spices 


25.8.1 Micropropagation and Plant 
Regeneration 


Seed spices and herbs constitute a large group of 
widely different aromatic plants which are used as 
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spices, culinary herbs and medicinal herbs and 
those which are used in aroma _ therapy. 
Micropropagation protocols for many seed and 
herbal spices are available. They include corian- 
der, anise, thyme (Fig. 25.1i), peppermint 
(Fig. 25.1j}), spearmint, celery, lavender, savory, 
Ocimum, oregano, basil, sage, fennel, parsley, dill 
and garlic, saffron and Eryngium foetidum capsi- 
cum (Bhojywani 1980; Venkataraman and 
Ravishankar 1987; Cellarova 1992; Furmanowa 
and Olszowska 1992; Panizza and Tognoni 1992; 
Patnaik and Chand 1996; Vandemoortele et al. 
1996; Sajina et al. 1997a; Ochoa-Alejo and 
Ramirez-Malagon 2001; Gupta and Bhargava 
2001; Sharma et al. 2004; Aflatuni et al. 2005; 
Karaoglu et al. 2006; Majourhat et al. 2007; 
Minas 2009; Song et al. 2009; Ascough et al. 
2009; Falk et al. 2009; Kothari et al. 2010; Fadel 
et al. 2010; Irikova et al. 2011; Samantaray et al. 
2012a; Nhung and Quynh 2012; Kara and Baydar 
2012; Navroski et al. 2012; Zeybek et al. 2012; 
Rodeva et al. 2013; Santoro et al. 2013; Dixit and 
Chaudhary 2013; Keller and Senula 2013). Clonal 
propagation of chemically uniform fennel plants 
through somatic embryoids was reported by 
Miura et al. (1987). Shoot regeneration protocols 
for fenugreek, cumin and coriander (Fig. 25.1k) 
were reported (Nirmal Babu et al. 1997; Tawfik 
and Noga 2001; Ebrahimie et al. 2003; Aasim 
et al. 2009). Jakhar et al. (2003) reported the in 
vitro flowering and seed formation in cumin. 
Somatic embryogenesis has been established 
in saffron, garlic (Keles et al. 2010) and chilli 
(Blazquez et al. 2003; Sheibani et al. 2006; 
Munyon et al. 1989). An efficient protocol for 
organogenesis from root protoplasts (Xu et al. 
1982) and adventitious shoot formation in fennel 
was developed by investigating the effect of plant 
growth regulators by Jakhar and Choudhary 
(2012). Profuse callus differentiation was 
observed when medium was supplemented with 
1.0 mg/l BAP followed by 1.0 mg/l BAP + 
0.5 mg/l IBA. The shoot morphogenesis was 
observed in callus proliferated from the shoot 
apex explants incubated at 1.0 mg/l BAP + 
0.5 mg/l IBA, upon subculture on the same levels 
of plant growth regulator. In order to create vari- 
ability, organogenesis followed by mutagenesis 
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has also been identified as a potential in vitro 
technique. In this process, stock organogenetic 
callus is treated with physical or chemical muta- 
gens. The studies have shown positive indications 
to isolate promising mutants in cumin (Raje et al. 
2004). Similar type of efforts can be made for the 
creation of variability in cumin for resistance to 
Alternaria blight, root rot in fenugreek, and for 
many other stresses. 

In vitro methods of screening could prove 
highly useful in screening a large germplasm 
collections or cell lines for resistance to prevalent 
fungal diseases and tolerance to drought and salt 
stress. The reports are available on in vitro selection 
for salt tolerance in fenugreek on media contain- 
ing 0.025-1.5 % NaCl (Settu et al. 1997) and 
drought-tolerant cell lines cultured on media con- 
taining 0.25—1.50 % PEG in coriander (Stephen 
and Jayabalan 2000) through tissue culture. 
Selection of somaclonal variants resistant to 
Septoria apiicola by callus culturing in the pres- 
ence of fungal culture filtrate in celery (Evenor 
et al. 1994), Fusarium yellow-resistant celery line, 
a somaclonal variant (Lacy et al. 1996), and resis- 
tant to Alternaria blight in cumin (Shukla et al. 
1997b) has been reported by different workers. 


25.8.2 Molecular Characterization 


Genetic diversity study of fennel, fenugreek, 
cumin, coriander, thyme, dill, garlic and saffron 
using RAPD, ISSR, AFLP and SSR was reported 
by many workers (Paran et al. 1998; Lopez et al. 
2008; Rubio-Moraga et al. 2009; Imran et al. 
2010; Zahid et al. 2009; Abdoli et al. 2009; 
Singh et al. 2011; Jana and Shekhawat 2012; Jo 
et al. 2012; Keify and Beiki 2012; Khalil et al. 
2012; Suresh et al. 2013; Siracusa et al. 2013; 
Singh et al. 2013). Molecular marker systems 
based on SCAR, CAPS, RAPD, STS, SSR. TRAP 
and SRAP are available for identification and 
fingerprinting of fennel varieties by Agbiotech 
(2009). Genetic diversity and identification of 
variety-specific AFLP markers were developed 
in fenugreek by Vinay Kumar et al. (2012). 
Twelve novel polymorphic microsatellite loci 
were developed and characterized from a 
repeat-enriched genomic library of Crocus 
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sativus to study population and conservation 
genetics of this economically and medicinally 
important species Nemati et al. (2012). Fenwick 
and Ward (2001) studied the RAPD-based 
cultivar identification in mint. RAPD- and 
AFLP-based hybrid identification in Mentha was 
reported by Shasany et al. (2005). 


25.8.3 Protoplast Culture 


Shekhawat and Galston (1983) reported the 
isolation, culture and shoot regeneration from 
mesophyll protoplasts of fenugreek. Successful 
regeneration of whole plants from tissue-cultured 
shoot primordial of garlic was reported by Ayabe 
et al. (1995). 


25.8.4 Genetic Transformation 


Wang and Kumar (2004) reported the heterolo- 
gous expression of Arabidopsis ERS1 which 
causes delayed senescence in_ coriander. 
Agrobacterium rhizogenes which mediate trans- 
formed root cultures were analyzed for the pro- 
duction of essential oils in dill (Santos et al. 
2002). The development of efficient genetic 
transformation protocol using glucuronidase 
gene in caraway was studied, and gene transfer 
was more efficient when cotyledonary node 
explants were used (Krens et al. 1997). 
Preliminary experiments on Agrobacterium- 
mediated transformation of uidA (GUS) gene 
into cotyledon and hypocotyls explants in fenu- 
greek were reported by Khawar et al. (2004). 
Stable transformation in cumin through particle 
bombardment was reported by Singh et al. (2010) 
and through Agrobacterium-mediated transfor- 
mation by Pandey et al. (2013). 


25.8.5 Cloning and Isolation 
of Candidate Genes 


Molecular cloning of mannose-6-phosphate 
reductase and its developmental expression in 
celery was studied by Everard et al. (1997). Wang 
and Kumar (2004) reported the heterologous 
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expression of Arabidopsis ERS1 that causes 
delayed senescence in coriander. 


25.9 Development 
of Synthetic Seeds 


Synthetic seeds or artificial seeds are defined as 
artificially encapsulated somatic embryos, shoot 
buds, cell aggregates or any other tissue which 
can be used for sowing as a seed and those that 
posses the ability to convert into a plant under in 
vitro or ex vitro conditions and that retain its 
potential after storage also. Artificial or synthetic 
seeds can be an ideal system for low-cost plant 
movement, propagation, conservation and 
exchange of germplasm. Synthetic seeds were 
developed by encapsulation of in vitro developed 
small shoot buds in 3-5 % calcium alginate in 
black pepper, shoot buds in cardamom 
(Fig. 25.11), somatic embryos and in vitro regen- 
erated shoot buds in ginger and turmeric, in vitro 
regenerated shoot buds, protocorms in vanilla 
(Sharma et al. 1994; Sajina et al. 1997b), somatic 
embryos in cinnamon and curry leaf and nodal 
segments in pomegranate leaf (Sundararaj et al. 
2010; Minoo 2002; Gayatri et al. 2005; Naik and 
Chand 2006; Nikhil and Shukla 2013). Synseeds 
have been reported in cumin (Tawfik and Noga 
2002a, b), coriander (Kim et al. 1996; Stephen 
and Jayabalan 2000), fennel (Sajina et al. 1997b), 
celery (Pratap 1992), dill (Ratnamba and Chopra 
1974; Sehgal 1978) and nigella (Hamid Elhag 
and Olemy 2004), and regeneration of these 
plants has been successfully obtained. 


25.10 Microrhizome 


Microrhizome technology is useful for develop- 
ing disease-free planting material and hence is an 
ideal source of planting material suitable for 
germplasm exchange, transportation and conser- 
vation. /n vitro induction of microrhizomes in 
ginger (Fig. 25.lm) was reported by many 
workers (Bhat et al. 1994; Sharma and Singh 
1995; Nirmal Babu 1997; Nirmal Babu et al. 
2003, 2005b; Tyagi et al. 2007; Sumathi 2007; 
Zheng et al. 2008). The microrhizome-derived 
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plants have more tillers but were shorter. They 
gave fresh rhizome yield ranging from 100 to 
800 g per plant with an estimated yield of 10 kg 
per 3 m? bed. Many reports are available on in 
vitro. microrhizome formation in turmeric 
(Fig. 25.1n) (Nirmal Babu et al. 2003; Cousins 
and Alderberg 2008). Low sucrose is reported to 
decrease the size of microrhizome, but optimum 
microrhizome production at 6—9 % sucrose was 
also reported. Sucrose (6-9 %) was most effec- 
tive in rhizome formation. 


25.11 Conservation of Genetic 
Resources 


In Vitro Conservation 
of Germplasm 


25.11.1 


The genetic resources of spices are conserved 
either in seed gene banks and/or in field reposi- 
tories. Conservation of the germplasm in in 
vitro (Fig. 25.10) and cryobank is a viable and 
a safe augment to conventional conservation 
strategies (Nirmal Babu et al. 2012c). 
Conservation of pepper, cardamom, herbal 
spices, vanilla and ginger germplasm in in vitro 
gene bank by slow growth was reported (Nirmal 
Babu et al. 1999; Tyagi et al. 2007). Protocols 
for in vitro conservation by slow growth of 
black pepper and its related species, viz., P. 
barberi, P. colubrinum, P. betle and P. longum, 
were standardized by maintaining cultures at 
reduced temperatures, in the presence of 
osmotic inhibitors and at reduced nutrient lev- 
els or by minimizing evaporation loss by using 
closed containers. The technology for in vitro 
conservation of zingiberaceous crops like gin- 
ger, turmeric, Kaempferia, cardamom and their 
related species was standardized by Geetha 
(2002) and in vanilla by Minoo (2002), Minoo 
and Nirmal Babu (2009). Slow growth tech- 
niques are being used for medium-term conser- 
vation of spices in in vitro repository at NBPGR 
(Mandal et al. 2000). Suspensions of embryo- 
genic cell lines of fennel, conserved at 4 °C for 
up to 12 weeks, produced normal plants upon 
transfer to laboratory conditions (Umetsu et al. 
1995). 
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25.11.2 Cryopreservation 


Cryopreservation of black pepper and cardamom 
seeds in liquid nitrogen (LN,) was reported by 
Choudhary and Chandel 1994, 1995. The tech- 
nology for cryopreservation of black pepper, car- 
damom, ginger, turmeric and vanilla germplasm 
(Fig. 25.1p) using vitrification, encapsulation and 
encapsulation-vitrification methods is available 
(Minoo 2002; Yamuna 2007; Minoo et al. 2012; 
Nirmal Babu et al. 2012). Cryopreservation of 
encapsulated shoot buds of endangered Piper 
barberi has achieved (Nirmal Babu et al. 2012c). 
Efficient cryopreservation technique for in vitro 
grown shoots of ginger based on encapsulation- 
dehydration, encapsulation-vitrification and vit- 
rification procedures was reported by Yamuna 
(2007). Minoo (2002) reported the cryopreserva- 
tion of vanilla pollen (Fig. 25.1q) for the conser- 
vation of haploid genome as well as assisted 
pollination between species that flower at differ- 
ent seasons and successful fertilization using 
cryopreserved pollen. Cryopreservation of cori- 
ander (Coriandrum sativum L.) somatic embryos 
using sucrose preculture and air desiccation was 
reported by Popova et al. 2010. Cryopreservation 
of celery using LN, was reported by Gonzalez- 
Benito and Iriondo 2002. 


25.12 Production of Secondary 
Metabolites 


Biotechnology can be utilized to exploit the 
potential of spices for bio-production of useful 
plant metabolites. The use of tissue culture for 
the biosynthesis of secondary metabolites partic- 
ularly in plants of pharmaceutical significance 
holds an interesting alternative to control produc- 
tion of plant constituents. This technique is all 
the more relevant in recent years due to the ruth- 
less exploitation of plants in the field leading to 
reduced availability. /n vitro proliferation of nut- 
meg mace (Fig. 25.1r) and synthesis of flavour 
components in culture were reported by Nirmal 
Babu et al. (1992a). Since mace is the source of 
anticarcinogenic compound myristicin, this tech- 
nique with improvement can be used for the pro- 
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duction of myristicin. Most of the reports in 
saffron were on the in vitro proliferation of stigma 
and in vitro synthesis of colour components and 
metabolites. The proliferation of stigma of saf- 
fron in vitro and chemical analysis of metabolites 
produced through tissue cultures of Crocus sati- 
vus were reported (Sano and Himeno 1987; 
Himeno et al. 1988; Sarma et al. 1991). 

Plant cells cultured in vitro produce wide 
range of primary and secondary metabolites of 
economic value. Production of phytochemicals 
from plant cell cultures has been presently used 
for pharmaceutical products. Production of fla- 
vour components and secondary metabolites in 
vitro using immobilized cells is an ideal system 
for spices crops. Ahmad et al. (2013) concluded 
that regenerated tissues of P. nigrum are a good 
source of biologically active metabolites for 
antimicrobial activities, and callus culture pre- 
sented itself as a good source for such activities. 
Production of saffron and capsaicin was reported 
using such system (Ravishankar et al. 1993, 
1995; Johnson et al. 1996; Venkataraman and 
Ravishankar 1997). Johnson et al. (1996) 
reported the biotransformation of ferulic acid 
vanillylamide to capsaicin and vanillin in immo- 
bilized cell cultures of Capsicum frutescens. 
Reports on the in vitro synthesis of crocin, picro- 
crocin and safranal from saffron stigma (Himeno 
and Sano 1995) and colour components from 
cells derived from pistils (Hori et al. 1988) are 
available for further scaling up. Callus and cell 
cultures were established in nutmeg, clove, cam- 
phor, ginger, lavender, mint, thyme (Furmanowa 
and Olszowska 1992), celery, etc. Cell immobili- 
zation techniques have been standardized in gin- 
ger, Sage, anise and lavender (Ilahi and Jabeen 
1992). The production of essential oils from cell 
cultures (Ernst 1989) and accumulation of essen- 
tial oils by Agrobacterium —tumefaciens- 
transformed shoot cultures of Pimpinella anisum 
were reported (Salem and Charlwood 1995). The 
regulation of the shikimate pathway in suspen- 
sion culture cells of parsley (Conn and McCue 
1994) and production of anethole from cell cul- 
tures of Foeniculum vulgare (Hunault and Du 
Manoir 1992) were reported. Growth and pro- 
duction of monoterpene by transformed shoot 
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cultures of Mentha citrata and Mentha piperita 
in flasks and fermenters was reported by Hilton 
et al. 1995. The production of rosmarinic acid in 
suspension cultures of Salvia officinalis has been 
discussed by Hippolyte et al. (1992). Reports on 
the production of phenolic flavour compounds 
using cultured cells and tissues of vanilla are 
also available (Dorenburg and Knorr 1996). Jn 
vitro production of petroselinic acid was reported 
from cell suspension cultures of coriander (Kim 
et al. 1996a). Kintzios et al. (2004) reported the 
scaling up of micropropagation of Ocimum basi- 
licum L. in an airlift bioreactor and accumulation 
of rosmarinic acid. Though the feasibility of in 
vitro production of spice principles has been 
demonstrated, methodology for scaling up and 
reproducibility need to be developed before it 
can reach commercial levels. Once standardized, 
this technology has tremendous potential in 
industrial production of important compounds 
like capsaicin, vanillin, crocin, picrocrocin, saf- 
ranal, myristicin, anethole, menthol and 
curcumin. 


25.13 Biotechnological 
Approaches on Disease 
Management in Spice Crops 


Crop losses in spice crops due to biotic stress are 
serious. Particularly, soilborne nature of the plant 
pathogens, poor amenability for effective disease 
management and lack of durable host resistance 
are the real constraints of production (Sarma 
et al. 2014). Foot rot and slow decline of black 
pepper (Sarma 2010), clump and capsule rot of 
small cardamom rhizome rot, Fusarium yellows 
and bacterial wilt in ginger and turmeric and root 
rots and wilts in cumin, coriander and fenugreek 
still remained as major threats in these crops. 
Though high degree of host resistance was identi- 
fied for Phytophthora capsici, Radopholus similis 
and Meloidogyne incognita (root/foot rot and 
slow decline diseases), in Piper colubrinum, a 
native of Brazil, efforts to develop resistant trans- 
genics are yet unsuccessful. Pending successful 
incorporation of R genes in susceptible cultivars, 
it 1s imperative to explore the available natural 
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resources to combat these biotic stresses in these 
crops. Microbial biocontrol technology as a part 
of integrated disease management (IPM) is an 
ideal option. This technology is ecofriendly and 
would reduce pesticide load into the ecosystem. 
Hence, it received considerable attention during 
the last four decades. In this well-proven micro- 
bial biotechnology, biological control (Cook 
1985) was found effective in reducing the crop 
losses in soilborne diseases of several crops, 
including spice crops (Sarma et al. 2014). 
Antagonism/antibiosis, hyperparasitism, growth 
promotion and induced systemic resistance (ISR) 
are the basic principles that govern this technol- 
ogy. Hyperparasitic fungi like Trichoderma har- 
zianum and T. viride, Pochonia chlamydosporia 
and Paecilomyces lilacinus and PGPRs (plant 
growth-promoting rhizobacteria) like 
Pseudomonas fluorescens, Pseudomonas cepacia 
(Fridlender et al. 1999), Bacillus subtilis and 
Arbuscular mycorrhiza (AM) were found effec- 
tive in reducing crop losses caused by soilborne 
plant pathogens in the spice crops. The large- 
scale production of these microbes adopting both 
solid and liquid fermentation technologies is in 
place. The delivery systems/formulations of these 
microbials both in solid and liquid (Chet 1987; 
Batta 2004) are now available in the market for 
the farming community to utilize them for the 
seed treatment and soil application. Application of 
these along with organic inputs that would serve 
as food base for their multiplication is suggested. 
The seed treatment is particularly important for 
seed spices, ginger and turmeric, where contami- 
nated seed is the primary source of infection 
(Singh et al. 1972; Sarma and Anandaraj 1998). 

Either fungus or bacterium to be successful 
biocontrol agents in a given ecosystem, rhizo- 
sphere competence (Ahmed and Baker 1987; 
Harman 1992) and competitive saprophytic abil- 
ity (CSA) are the important traits that are of para- 
mount importance (Sarma 2006). 


25.13.1 Trichoderma 


Intensive investigations carried out during the 
last four decades clearly established the potential 
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of Trichoderma as an effective biocontrol agent 
in suppression of several soilborne plant patho- 
gens (Papvizas 1985; Papvizas and Chet 1987). 
These biocontrol agents received considerable 
attention in the management of spice crop dis- 
eases, viz., black pepper (Rajan et al. 2002; Saju 
2004; Sarma and Saju 2004; Sarma 2010; Sarma 
et al. 2013), small cardamom (Suseela Bhai 
1998; Suseela Bhai et al. 1993), ginger (Sharma 
and Jain 1979; Usman et al. 1996; Usman 1997), 
turmeric, cumin (Vyas and Mathur 2002; Hagag 
and Abosedera 2005), coriander (Gopal Lal et al. 
2010) and fenugreek (Kakani et al. 2009). In 
vitro selection based on the inhibition of the tar- 
get pathogen, the inhibitory effects of both vola- 
tile and nonvolatile antibiotics on suppression of 
the target pathogen, disease suppression and 
growth promotion of the target crop both in pot 
culture and field evaluation are the criteria 
adopted to evaluate their bio-efficacy. The mode 
of action of this hyperparasite includes produc- 
tion of the lytic enzyme like cellulases, gluca- 
nases and chitinase, and antibiotics that are 
necessary for the degradation of the cell wall of 
pathogen were studied in these pathogens of 
spice crops. The disease suppressive effect of 7! 
harzianum was established through soil applica- 
tion that reduced the capsule rot of cardamom 
and was on par with treatment with copper fun- 
gicides and potassium phosphonate (Suseela 
Bhai 1998). Suppression of root rot in cardamom 
nurseries was demonstrated with soil application 
of T. harzianum (Sarma 2006). Similar results 
were obtained in rhizome rot of ginger caused by 
Pythium aphanidermatum both as seed treatment 
and soil application along with neem cake 
(Usman et al. 1996). In cumin, seed contamina- 
tion with F. oxysporum f. sp. cumini and also wilt 
suppression were accomplished with seed treat- 
ment and soil application (Vyas and Mathur 
2002). Root rot caused by Rhizoctonia solani in 
fenugreek was successfully controlled with T. 
viride application both as soil treatment and seed 
coating (Kakani et al. 2009). Economic viability 
of Trichoderma technology in black pepper was 
reported by Madan et al. (2006). 
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25.13.2 Plant Growth-Promoting 
Rhizobacteria (PGPRs) 


Rhizosphere of any crop plant is associated with 
abundant microbial load that would impact the 
crop health and productivity. This is an estab- 
lished fact and has been researched extensively. 
Microbes associated with rhizoplane, rhizosphere 
and endophytes (Hallmann 2001) received 
greater attention in plant microbe interaction to 
focus on growth promotion and disease suppres- 
sion in crop plants (Kloepper et al. 1980, 1993). 
In general, the parameters for selection and stud- 
ies on mode of action remained the same as that 
were adopted in fungal biocontrol agents. There 
is a spurt of research activity on this group of 
organisms taking advantage of their potential 
both for growth promotion and disease suppres- 
sion in several plant pathosystem (Lopper 1988). 
Kumar et al. (2013) have _ characterized 
Pseudomonas aeruginosa for their genetic and 
functional properties. Extensive studies carried 
out on black pepper — P. capsici — pathosystem 
revealed that Pseudomonas fluorescens increased 
growth in black pepper through production of 
growth promoters (Diby et al. 2001). Similarly, 
the induced systemic resistance (ISR) was 
achieved through the enhancement of defence- 
related enzymes like PAL, peroxidase, polyphe- 
nol oxidase (Diby and Sarma 2005; Diby et al. 
2001) and mycolytic enzymes in black pepper 
(Diby et al. 2005). The enhancement of nutrient 
uptake in black pepper through this microbial 
association was found promising (Diby et al. 
2001). In the case of ginger, the bio-efficacy of a 
bacterial strain mixture (Bacillus subtilis -S2BC1 
and Burkholderia cepacia) in suppressing rhi- 
zome rot and Fusarium yellows (Shanmugam 
et al. 2012, 2013) has been reported. The efficacy 
of microbial mixtures in root disease suppression 
has been reported (Raupach and Kloepper 1978). 
Similar results were reported in P. capsici-Black 
pepper pathosystem (Jisha et al. 2002). There is a 
need for field evaluation of these microbial 
mixtures to exploit their commercial potential. 
Harman and Stasz (1991) have utilized protoplast 
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fusion technique, for the production of superior 
biocontrol fungi. 


25.13.3 Arbuscular/Vesicular 
Arbuscular Mycorrhiza (AM/ 
VAM) 


The importance of AM/VAM was extensively 
studied for their role in enhanced nutrient uptake 
and the protection of root system from soilborne 
plant pathogens (Atkinson et al. 1994; Schenk 
1981). Increased nutrient uptake, altered host 
metabolism through enhanced production of phe- 
nolics, growth regulators and defence mechanisms 
are the factors implicated in root protection from 
soilborne plant pathogens. Glomus fasciculatum 
inoculum incorporated in black pepper nursery 
mixture was found effective in reducing root infec- 
tion caused by P. capsici, R. similis and M. incog- 
nita in black pepper (Anandaraj et al. 1996). 


25.13.4 Production of Healthy 
Nursery Stock of Spice Crop 


The production of root rot-free rooted cuttings of 
black pepper received considerable attention. 
Root rot caused by Phytophthora and plant para- 
sitic nematodes at nursery stage go unnoticed 
without any visible foliar symptoms. Hence, for- 
tifying nursery mixture with biocontrol agents 
(AM, fungi and PGPRs) is an accepted technol- 
ogy which is becoming popular in all pepper- 
growing countries (Sarma 2010; Sarma et al. 
2013, 2014). Similar methodology is also prac- 
ticed in cardamom nurseries. 


25.13.5 Biosafety Regulations 


The formulations of these microbials both solid 
and liquid (Batta 2004) with ideal population of 
these biocontrol agents are now available in the 
market for seed treatment and soil application. In 
general, the use of these formulations along with 
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organic inputs 1s suggested to ensure their multi- 
plication and colonization. However, biosafety of 
microbials, to be used for crop protection, 1s an 
essential prerequisite. The regulatory norms for 
many of these are now in place (Kulakshetra 
2004) and are further being improved. Alarmingly 
Pseudomonas aeruginosa an endophyte in black 
pepper highly effective in suppressing all the 
three major pathogens of the crop could not be 
used in crop protection, since P. aeruginosa 1s a 
human and animal pathogen (Kumar et al. 2012). 
The use of P aeruginosa has been banned in 
Europe for crop protection as a policy. Similar 
biosafety regulatory mechanisms are needed in 
India and elsewhere to make this technology 
farmer friendly. However, large-scale field dem- 
onstrations and correct appraisal among the 
farming community is called for to popularize 
this technology. Critical ecological studies of 
these microbes, their quantification in relation to 
protection through molecular techniques 
(Aravind et al. 2011), identification of organisms 
with multiple mode of action to address the dis- 
ease complexes, strainal improvement and the 
field demonstrations are needed to popularize 
this microbial biotechnology for crop protection 
in spice crops (Sarma et al. 2014). 


25.14 Conclusions 


Biotechnology has the potential to be a key tool 
to achieve sustainable agriculture and agri-based 
industry, through improvement of food produc- 
tion in terms of quantity, quality and safety while 
preserving the environment. Significant progress 
has been made in the field of biotechnology for 
micropropagation, conservation and manage- 
ment of genetic resources, disease and pest man- 
agement and molecular characterization. 
Identifying markers linked to important agro- 
nomic characters will help in marker-assisted 
selection to shorten breeding time. Application of 
recombinant DNA technology for the production 
of resistant types to biotic and abiotic stress has 
to go a long way before they can be effectively 


504 


used in spices improvement. Though programmes 
have been initiated in many laboratories for in 
vitro secondary metabolite production, these 
techniques are to be refined and scaled up for 
possible industrial production of the products. 
Owing to their commercial potential, intensifi- 
cation and application of biotechnology in 
spices 1S important and indispensable in the 
coming decade. Microbial intervention through 
T:harzianum, T.viride, Pflorescens and AM fungi 
has been found effective in disease suppression 
and growth promotion. Induced systemic resis- 
tance (ISR) as reflected in defense response 
appears to be one of the modes of action in dis- 
ease suppression in black pepper and ginger. This 
microbial biocontrol technology at present is 
being practiced by the farming community for 
effective disease management in spice crops. 
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Introduction 


Abstract 

Metabolic engineering considers metabolic and cellular system as an 
entirety and accordingly allows manipulation of the system with con- 
sideration of the efficiency of overall bioprocess, which distinguishes 
itself from simple genetic engineering. Metabolic engineering in plants 
involves the modification of endogenous pathways to redirection of one 
or more enzymatic reactions to produce new compounds or improve/ 
retard production of known compounds. Metabolic engineering is the 
redirection of one of more enzymatic reaction to produce new com- 
pounds in an organism, improve production of existing compounds or 
mediate the degradation of compounds. It reviews the current status of 
metabolic engineering. 
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compartmentation, transport and accumulation. 
Recently, the focus has been on metabolic 


Significant advances have been made in meta- 
bolic engineering through the application of 
genomics and proteomics technologies to eluci- 
date and characterise metabolic pathways and 
their regulation at all levels like genes, enzymes, 
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pathways in the context of the entire cell rather 
than at the single pathway level. The various 
interactions between proteins, DNA and metabo- 
lites, called the interactome, are the key determi- 
nant of any cellular process. Proteins are the 
functional entities that carry out the actual meta- 
bolic process by interconverting metabolites 
(metabolome). It 1s necessary for the metabolic 
engineer to understand these relationships in 
order to accurately design and control biological 
systems (Vemuri and Aristidou 2005). 
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26.2 Aims of Metabolic 
Engineering 


The goal of metabolite engineering is to increase 
the production of certain compounds in the nor- 
mal producing plant species or to transfer a path- 
way to other plant species or production of novel 
compounds not yet produced in nature by plants 
by gene manipulation. To increase the production 
of compounds, two general approaches have been 
followed. (1) Methods have been employed to 
change the expression of one or a few genes, 
thereby overcoming specific rate-limiting steps 
in the pathway, or to shut down competitive path- 
way and to decrease catabolism of the product of 
interest. (2) Attempts have been made to change 
the expression of regulatory genes. 


26.3 Metabolic Engineering 


Neumann et al. (2009) and Kumar (2015) (this 
volume) reviewed in detail the production of 
secondary metabolites, pharmaceutical and other 
biologically important compounds in plant cell 
and cultures. Ramawat et al. (2009) and Kumar 
and Sopory (2010) reviewed metabolic engineering 
in plants with respect to primary and secondary 
metabolites including terpenoids, carotenoids, 
phenolics and flavonoids. Plant molecular breed- 
ing and use of plant biotechnology for production 
of novel compounds have been described by 
Kumar et al. (2014). Several antitumor com- 
pounds have been isolated from plants. Kumar 
(2010), Kumar and Roy (2011) and Dar et al. 
(2013) reviewed breeding methods for plant 
tissue culture and under field conditions for 
medicinally important plants. Sujatha and Kumar 
(2008) and Tang and Newton (2010) described 
different plant tissue culture systems with improved 
efficiency using Agrobacterium- mediated 
genetic transformation. Various natural products 
like tropane alkaloids, atropine, hyoscyamine, 
scopolamine and steroidal precursors are mainly 
produced by roots (Ramawat and Merillon 2000; 
Kumar and Roy 2006; Kumar and Sopory 2008, 
2010; Kumar and Shekhawat 2009). Eclipta alba 
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(L.) Hassk. was transformed by Agrobacterium 
rhizogenes strain MTCC 532 (Bhansali and 
Kumar 2014). Plants of family Valerianaceae, 
e.g. Nardostachys jatamansi, have been induced in 
tissue culture to produce higher levels of com- 
pounds than in intact plants. Studies on medicinal 
uses of ferns and their biotechnology have been 
extensively covered by Fernadez et al. (2011). 


26.3.1 Biofuels 


The world’s fossil fuel reserves are running out. 
Governments around the world are looking to 
diversify their energy sources with nuclear, 
hydroelectric, wind and biofuel energy. Recent 
advances in synthetic biology and metabolic 
engineering can engineer microorganisms to pro- 
duce new fossil fuel replacements. Such products 
might include short-chain, branched-chain and 
cyclic alcohols as well as alkanes, alkenes, esters 
and aromatics. 

Biofuels include bioethanol, biodiesel (methy] 
and ethyl esters), biohydrogen, alkanes and vari- 
ous other hydrocarbon mixtures (such as wood 
gas and syngas). Today, however, ethanol is by 
far the most widely available; in the United 
States, for example, it constitutes 99 % of all bio- 
fuel produced (a total of over four billion gallons 
in 2005 from the fermentation of sugars derived 
from corn). Ethanol is biodegradable and pro- 
duces slightly less greenhouse emissions than 
fossil fuel (carbon dioxide is recycled from the 
atmosphere to produce biomass); it can replace 
harmful fuel additives (e.g. methyl tertiary butyl 
ether, MTBE). 

Ethanol is derived from corn or wheat for 
bioethanol and soya bean for biodiesel. However, 
recently biotech innovations and metabolic engi- 
neering are making it possible to use cellulosic 
biomass (wood chips, corn stalks, willow trees 
and switchgrass) for biofuel production. 

Metabolic engineering for the production of 
biofuels has been reviewed by Kumar (2013). 
Bioethanol from bamboo is shown to be both 
technically and economically feasible, as well as 
competitive with petrol in China. Some measures 
may include improving sugar release with more 
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effective pretreatments and reduced enzyme usage, 
accessing low-cost bamboo feedstock or selecting 
feedstocks with higher/more accessible 
cellulose. 


26.3.2 Lignocellulosic Biomass 
to Ethanol 


Lignocellulosic biomass, such as agricultural 
residues and forestry wastes, has been widely 
recognised as a sustainable source for biofuel 
production (Gong et al. 2014). Roy and Kumar 
(2013) reviewed methods for lignocellulosic 
materials. However, cellulose and hemicellulose, 
two major sugar polymers of lignocelluloses, 
have to be depolymerised by hydrolysis to enable 
more efficient microbial utilisation (Acker et al. 
2013b). Biomass hydrolysates usually contain 
monosaccharides as well as various amounts of 
oligosaccharides. Oligosac-charides are more 
challenging substrates than monosaccharides for 
microorganisms, because assimilation of oligo- 
saccharides may require additional hydrolytic 
enzymes and transportation systems. 


26.4 Metabolic Engineering 
of Oilseed Crops 


Recently, the demand for bioenergy produced 
from plant materials has increased. Transcription 
factors that control the biosynthesis of lignocellulose 
and lipids should be particularly investigated. 
Increase of the lipid content and modification of 
fatty acids in oil seeds have been required by the 
oleo-chemical industry. 

The metabolic engineering of seed oils is attrac- 
tive as the modification of lipid metabolism to 
change the quantity and quality of fatty acids in 
plants has important applications in the food 
industry, as well as in the production of deter- 
gents, fuels, lubricants, paints and plastics (Drexler 
et al. 2003). 

It is possible to genetically manipulate the 
composition of long-chain and medium-chain 
fatty acid in some of common oil crops, such as 
Glycine max (soybean), Brassica napus (oilseed 
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rape), Helianthus annuus (sunflower) and Zea 
mays (maize) (Drexler et al. 2003). 

Recent notable studies include experiments 
in which oilseed rape (Brassica napus) was 
transformed with multiple cDNAs encoding 
enzymes for the desaturation and functional 
modification of long-chain fatty acids. Liu et al. 
(2003) expressed 18:1 D12 desaturase either 
alone or in combination with 18:2 D6 desaturase. 
With the 18:1 D12 desaturase alone, the seeds 
produced up to 46 % linoleic acid, whereas with 
both enzymes, the seeds produced up to 43 % 
linolenic acid. Recently, the a-linolenic acid con- 
tent of rice grains has been increased tenfold by 
expressing the soybean FAD3 gene encoding O3 
fatty acid desaturase (Anai et al. 2003). 


26.5 Metabolic Engineering 
of Secondary Metabolites 


26.5.1 Terpenoids 


Terpenoids, also known as isoprenoids, are the 
largest family of natural compounds, consisting 
of more than 40,000 different molecules. All 
terpenoids are synthesised from the common 
precursors, dimethylallyl pyrophosphate and 
isopentenyl pyrophosphate. This occurs through 
two distinct pathways, the mevalonate-independent 
pathway and the mevalonate pathway, both of 
which have been targets for metabolic engineer- 
ing (Broun 2004). Transgenic plants with modi- 
fied terpenoid production help in fundamental 
studies of the biosynthesis and regulation of 
these compounds. Monoterpenoids (C10), ses- 
quiterpenoids (C15), diterpenoids (C20) and 
tritrepenoids (30) are considered to be second- 
ary metabolite. Following the recent discovery of 
the role of the 2-C-methyl-D- erythritol-4-phos- 
phate (MEP) pathway in the biosynthesis of plas- 
tidial terpenoid, such as the carotenoids and 
monoterpenes and diterpenes, several genes of 
this pathway have been cloned (Broun 2004; 
Broun and Somerville 2001). Luecker et al. 
(2004) have reported the transformation of 
tobacco plants with three different monoterpene 
synthases from lemon. When the three transgenes 
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were stacked in one transgenic line, the tobacco 
plants produced more terpenoids and displayed a 
different terpenoid profile to wild-type plants. 


26.5.2 Ornamentals 


The global value of the ornamental sector of the 
horticulture industry is likely to be 250-400 billion 
USD (Chandler 2013). Value addition to ornamen- 
tals used in the floristry, gardening, landscaping 
and the environmental amenity industries can be 
done by metabolic engineering. The molecular 
breeding approaches represent vast opportunities 
as alternatives to chemical treatments. Genetic 
engineering has potential to create ornamental 
plants suitable for environmentally friendly pro- 
duction. Metabolic engineering approaches tar- 
geting elements of the ethylene signal transduction 
pathway and its perception (e.g. etrl-1 and IPT) 
serve as possible alternatives to avoid the use of 
chemical ethylene inhibitors. Over-expression of 
GA2ox genes and reduction of expression of 
GA20ox genes as well as modulation of regulatory 
genes involved in GA perception and signal trans- 
duction (GAI, SHI) and overall hormone homeo- 
stasis (KNOX) have successfully led to compact 
plants (see for details Liitken et al. 2012). 

Lilies, Lilium spp., are of importance in floricul- 
ture. New lily cultivars with improved qualities 
such as resistance to insects and diseases, longevity 
and novel flower colour are desirable for both 
producers and consumers. Genetic modification 
has been proven to be an alternative to conventional 
breeding for the introduction of resistance to 
insects and disease in ornamental crops (Chandler 
and Brugliera 2011). Wang et al. (2012) provided 
information on factors affecting lily regeneration 
in a broad range of cultivars and also on gene 
transfer for further studying the transformation in 
the commercially most interesting lily hybrid 
cultivars, i.e. Orientals and OTs. 


26.5.3 Beta-carotene 


‘Golden rice’ contains up to 200 mg B-carotene 
per 100 g grain. Engineering high levels of 
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B-carotene in the endosperm of ‘golden rice’ is a 
major breakthrough. Rice endosperm accumu- 
lates geranylgeranyl diphosphate. Therefore, four 
novel enzymes are required for B-carotene syn- 
thesis in this tissue: phytoene synthase, phytoene 
desaturase and z-carotene desaturase and lycopene 
b-cyclase. The researchers found that phytoene 
synthase and crtI alone were sufficient not only 
for the synthesis of lycopene but also for that of 
B-carotene and zeaxanthin. 

Brassica napus seeds and oil contain negligi- 
ble amounts of B-carotene. In this case, the sim- 
ple over-expression of a bacterial phytoene 
synthase, crtB, fused to a plastidic transit peptide 
and under the control of a seed-specific napin 
promoter, is sufficient to boost B-carotene 300- 
fold, to a level similar to that found in oil palm. 
The B-carotene (provitamin A) levels have been 
increased in rice and canola seeds and in tomato 
fruits (Roemer et al. 2002). Astaxanthin, a non- 
plant carotenoid, has been produced in tobacco 
flowers (Mann et al. 2000). For several reasons, 
the carotenoid biosynthetic pathway is a very 
interesting target for genetic engineering. The 
carotenoids are important colour compounds in 
flowers, fruits and food products, and they are 
also good antioxidants. Vitamin A is formed from 
B-carotene. Vitamin A deficiency is a widespread 
problem in many areas like Asian countries. The 
introduction of B-carotene biosynthesis into the 
major staple food rice, by over-expression of 
phytoene synthase, phytoene desaturase and 
lycopene-cyclase, is thus an _ important 
achievement. 


26.5.4 Insect Repellents 

In one study, petunia was transformed with the 
(S)-limonene synthase gene from Clarkia brew- 
eri resulting in the production of linalool, which 
repels aphids (Lucker et al. 2001). 


26.5.5 Flavonoids 


Flavonoids have a wide range of colours from 
pale yellow to red, purple and blue. The flavonoid 
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biosynthetic pathway has been the choice to 
genetically engineer flower colours to obtain 
novel colours. Because of their antioxidant activity, 
higher levels of anthocyanin and flavonoids in 
food are an interesting objective. Much work has 
focused on tomato. Chalcone isomerase (CHI), 
an early enzyme of the flavonoid pathway, was 
found to be the key to increase flavonol production 
(Muir et al. 2001). Over-expression of petunia 
CHI gene led to a 78-fold increase of flavonoid 
levels in the tomato peel. Upon processing such 
tomatoes, a 21-fold increase of flavonols in tomato 
paste was achieved, if compared with non-trans- 
genic controls. 


26.5.6 Alkaloids 


The terpenoid indole alkaloid pathway has been 
the target of numerous genetic engineering 
attempts, owing to the fact that about 15 terpe- 
noid indole alkaloids are industrially important, 
including the antitumour alkaloids vinblastine, 
vincristine and camptothecin. Most efforts have 
been concentrated on the early part of the pathway 
and on over-expression of early genes, aiming to 
increase the metabolite flux into the alkaloid 
pathway. In particular, the genes encoding tryp- 
tophan decarboxylase (TDC) and strictosidine 
synthase (STR) have been studied extensively in 
Catharanthus roseus cell cultures (Verpoorte 
et al. 1994; Verpoorte et al. 2000; Verpoorte and 
Memelink 2002). 


26.6 Alkaloid Production 
in Tryptophan 
Decarboxylase 
Silenced Lines 


Tryptophan decarboxylase, a pyridoxal phosphate— 
dependent enzyme that produces tryptamine 1 
from tryptophan 2 (Facchini et al. 2000), was tar- 
geted for gene silencing (Helliwell and Waterhouse 
2005). The plasmid designed to suppress tryp- 
tophan decarboxylase (pTDC1) was transformed 
into Agrobacterium rhizogenes, which was then 
used to infect C. roseus seedlings to generate 
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hairy root culture as described (Hughes et al. 
2002; Runguphan and O’Connor 2009). Hairy 
root lines harbouring the pTDCi silencing plas- 
mid were cultured in liquid media, and alkaloids 
were extracted from the plant tissue according to 
standard protocols. Gratifyingly, the production 
of all major tryptamine-derived alkaloids ajmali- 
cine 3, serpentine 4, catharanthine 5 and taberso- 
nine 6 was substantially decreased in the five 
silenced lines that were examined. Suppression 
of alkaloid production appears to be stable, with 
little change in alkaloid production levels after 
seven subcultures performed thus far. 


26.7 C3 and C4 Metabolism 
and Plant Tissue Culture 
Studies 


The majority of terrestrial plants, including many 
important crops such as rice, wheat, soybean and 
potato, are classified as C3 plants that assimilate 
atmospheric CO2 directly through the C3 photo- 
synthetic pathway. C4 plants such as maize and 
sugarcane evolved from C3 plants, acquiring the 
C4 photosynthetic pathway to achieve high pho- 
tosynthetic performance and high water- and 
nitrogen-use efficiencies. So, there is great scope 
to study the metabolic pathway and try to engi- 
neer this in another plant, which is having less 
photosynthetic performance and high water and 
nitrogen efficiencies. The good news 1s that tech- 
nology continues to hold great promise for the 
future of plant metabolite engineering. 


26.8 Conversion of Cellulose into 
Microbial Lipids by 
Cryptococcus curvatus 


The oleaginous yeast C. curvatus can produce 
microbial lipids using a mixture of glucose and 
xylose as well as cellulosic biomass as feedstocks. 
Gong et al. (2014) has developed the simultane- 
ous saccharification and lipid production (SSLP) 
process for direct conversion of cellulose into lip- 
ids by oleaginous species in the presence of cel- 
lulase and B-glucosidase. 
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26.9 Tocopherols 


Higher vitamin E doses of 100—1,000 IU have 
been associated with cancer reduction, improved 
immune response and cardiovascular benefits. 
Such high doses are currently realised only through 
supplemental tocopherol intake. Tocopherols are 
also used in the pharmaceutical and cosmetics 
industry and as animal feed additives to improve 
the quality and shelf life of meat (Bramley et al. 
2000). Natural vitamin E originates almost exclu- 
sively from soybean oil and is used predominantly 
for human applications due to its premium price 
and limited availability. 


26.10 Role of Plant Tissue Culture 
Studies on Secondary 
Metabolite Production 
and Metabolic Engineering 


Plant cell walls determine the shape and architec- 
ture of the plant body and play major roles during 
cell differentiation, defence against pathogens 
and cellular growth. Cellulose, the main polysac- 
charide of plant cell walls, is made of B-1,4- 
linked glucan chains, which form crystalline 
microfibrils and act as a framework for the depo- 
sition of other wall components. The matrix 
polymers, such as hemicelluloses and pectins, are 
structurally diverse polysaccharides, and most of 
them harbour heavily substituted side chains. 
Second-generation biofuels are generally produced 
from the polysaccharides in the lignocellulosic 
plant biomass, mainly cellulose. However, because 
cellulose is embedded in a matrix of other 
polysaccharides and lignin, its hydrolysis into 
the fermentable glucose is hampered. 


26.11 Cell Wall Polysaccharides Are 
Synthesised in Different 
Cellular Compartments 


Cell wall polysaccharides are synthesised in 
different cellular compartments. Cellulose is 
synthesised at the plasma membrane by hexa- 
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meric cellulose synthase (CESA) complexes, 
which are believed to hold 36 individual CESA 
proteins (Somerville et al. 2004). In contrast to cel- 
lulose, the synthesis of the matrix cell wall poly- 
mers, that is, hemicelluloses and pectins, is 
carried out in the Golgi apparatus (see Liepman 
et al. 2005; Sterling et al. 2006; Jensen et al. 
2008). These polymers are synthesised by a 
plethora of glycosyltransferases and are sub- 
sequently modified by different hydrolases, 
esterases and lyases (Henrissat et al. 2001; 
Somerville et al. 2004). Modification and secre- 
tion of cell wall components can depend on the 
demand for cell wall biosynthetic events that 
may be transduced by cell surface signalling 
devices needs to be transformed into cellular 
responses. These may include post-translational 
modifications of enzymes, modulation of the 
rate of secretion and transcriptional activations 
or repressions. 

Although the lignin content had the main 
effect on saccharification, also other cell wall fac- 
tors could be engineered to potentially increase 
the cell wall processability, such as the galactose 
content. A better understanding of the effect of 
lignin perturbations on plant cell wall composi- 
tion and its influence on saccharification yield 
provide new potential targets for genetic improve- 
ment (Acker et al. 2013a). 

In contrast to common perception, a reduction 
in lignin was not compensated for by an 
increase in cellulose, but rather by an increase in 
matrix polysaccharides. 

Cell wall components to the cell surface are 
dependent on the secretory shuttling system 
(Taylor 2008). The endomembrane system also 
contains V-ATPase complexes which acidify the 
vesicle lumen, a process required for certain 
aspects of membrane trafficking. The vesicles 
containing the CESAs are transported to the 
plasma membrane with the help of actin cables 
via interactions with actin-associated proteins, 
possibly KAM1/MUR3 (Tamura et al. 2005; 
Geisler et al. 2008). After the delivery of the 
CESA complexes to the plasma membrane, vesi- 
cle components are recycled back to the endo- 
membrane system. Microfibrils are produced by 
CESA complexes and are co-aligned with the 
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microtubules beneath the plasma membrane via 
microtubule-associated proteins (see review 
Geisler et al. 2008). 

This system encompasses protein synthesis 
and folding in the ER, glycan modification and 
synthesis in the Golgi apparatus and endosomal 
vesicle loading and shuttling to different cellular 
locations, including the plasma membrane and 
the vacuole. Brux et al. (2008) reported that the 
cellulose content is dependent on V-ATPase 
activity. The pH difference across the vesicle 
membrane is established by endosomal-localised 
vacuola ATPases (V-ATPases). There is also evi- 
dence that these differently located isoforms may 
carry out different biological functions in the 
cell. RNA interference of the TGN-localised, but 
not of the tonoplast-located, WHA-a isoform 
leads to reduced cell expansion (Brux et al. 
2008). The VHA-al protein is located at the 
trans-Golgi network (TGN), whereas two other 
isoforms, VHA-a2 and VHAa3, are specific for 
the tonoplast (Dettmer et al. 2006). This is con- 
sistent with phenotypes observed in other mutants 
for V-ATPase subunits, which also display altered 
morphology of Golgi stacks (Strompen et al. 
2005; Dettmer et al. 2010). 

It is therefore possible that the defects in TGN 
trafficking caused by blocking V-ATPase activity 
result in retention of the CESAs in the TGN. Thus, 
the activity of TGN-localised V-ATPases appears 
crucial for vesicle trafficking during cellulose 
synthesis and cell expansion in plant cells. As in 
other eukaryotic cells, small GTPases and gua- 
nine nucleotide exchange factors (GEFs) are 
involved in the regulation of vesicle formation 
and trafficking in plants and also in the definition 
of the membrane identity of endomembrane 
compartments (Strompen et al. 2005; Hurtado- 
Lorenzo et al. 2006). This suggests that the 
V-ATPase could act both as a pH modulator and 
sensor that governs the recruitment of proteins 
from the cytosol to the endosomal membrane 
(Paredez et al. 2006; Dettmer et al. 2005, 2006). 
At least one component that is directly involved 
in the synthesis of cell wall polymers also affects 
vesicle trafficking (Madson et al. 2003; Tamura 
et al. 2005). The importance of the actin fila- 
ments for cell wall deposition was further under- 
scored in a recent study performed in xylem cells 
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(Wightman and Turner 2008). They suggested 
that not only microtubules, which may guide the 
CESAs (Paredez et al. 2006), but also actin cables 
may be crucial for cellulose deposition 
(Wightman and Turner 2008). 


26.12 Compounds 
of Pharmaceutical 
Importance 


In the 1950s, Charles Pfizer Co. (USA) tried 
unsuccessfully to produce pharmaceutical 
compounds through the generation of large-scale 
cultures of plant cells. Such an attempt was how- 
ever well received abroad, i.e. Germany and 
Japan, and resulted in the development of feasi- 
ble industrial applications of cultured cells in the 
late 1970s (see Kumar and Roy 2006, 2011; 
Thorpe 2012; Neumann et al. 2009; Kumar and 
Sopory 2010). By using radioimmunoassay tech- 
niques and enzyme-linked immunosorbent 
assays, aS well as cell cloning and repeat selec- 
tion of high-yielding strains from heterologous 
cell populations, improvements in production 
have been attempted. 

Cell immobilisation techniques have also been 
used. The utilisation of bioreactors allowing the 
large-scale growth of cells has also contributed to 
the field. Stirred tank reactors and several types 
of air-driven reactors have been employed. Such 
methods involve a two-step process: biomass 
accumulation through active cell growth fol- 
lowed by product synthesis coupled with reduced 
cell proliferation (Sharma et al. 2011). 

Enhancement of secondary product formation 
has also been achieved by selecting mutant lines 
overproducing the desired product and by using 
abiotic factors, including ultraviolet treatments 
and exposure to heat or to cold and heavy metals 
(Neumann et al. 2009; Kumar and Sopory 2010; 
Kumar and Roy 2011). 


26.13 Conclusions 
Metabolite engineering is generally defined as 


the redirection of one or more enzymatic reaction 
to produce new compounds in an organism, 
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improve the production of existing compounds or 
mediate the degradation of compounds. As pre- 
sented above, several points in a given metabolic 
pathway can be controlled simultaneously either 
by over-expressing and/or suppressing several 
enzymes or through the use of transcriptional 
regulators to control several endogenous genes. 
Over the past few years, there has been a growing 
realisation that metabolic pathways must be 
studied in the context of the whole cell rather 
than at the single pathway level and that even the 
simplest modifications can send ripples through- 
out the entire system. Metabolic engineering in 
plants involves the modification of endogenous 
pathways to increase flux towards particular 
desirable molecules or redirection of one or more 
enzymatic reaction to produce new compounds 
in an organism or mediate the degradation of 
compounds. Recently, significant advances have 
been made in metabolic engineering through 
the application of genomics and proteomics 
technologies to elucidate and characterise meta- 
bolic pathways. The major challenge for the 
coming years is to obtain more information about 
regulation at all levels like genes, enzymes, com- 
partmentation, transport and accumulation. This 
will open the way for successful strategies for 
altering the accumulation of certain compounds. 
In this review, efforts are made to look at recent 
examples of single- and multiple-gene metabolic 
engineering in primary and secondary metabo- 
lism groups. 
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Abstract 


Genetic engineering is the alteration of the genome of an organism by 
introduction of one or a few specific foreign gene mainly with the help of 
Agrobacterium or particle gun. The modified organism is described as 
‘transgenic organism’. If gene is inserted in plants, they are called as trans- 
genic plants. In this highly sophisticated method of genetic modification 
of plants, the genetic material or DNA sequence coding for a desirable 
trait is located in the donor organism by a variety of molecular technique 
and then cut out from the parental DNA using the ‘molecular scissors’ 
known as restriction endonucleases. Genetic engineering makes it possible 
to exchange or introduce the gene of interest, using the modern tools of 
recombinant DNA (rDNA) techniques, within same or different species or 
also within kingdoms. Genetically altered foods are made from plants or 
animals that have had their deoxyribonucleic acid (DNA) changed in a 
way that does not happen naturally. Often, individual genes are transferred 
from one organism to another, even when the organisms are not related. 
As the population on earth is expected to rise from 6.7 billion to 9 bil- 
lion by 2050, this will increase the demand for food; hence, there is a need 
to increase food to 50 % by 2030. Because of many problems like avail- 
ability of arable land which has affected due to urbanisation, desertifica- 
tion and environmental degradation. The second big challenge is the 
availability of water good enough for drinking and farming. Water 
resources are under severe strain in many parts of the world. Hence, the 
challenge of food production by 50 % must be tackled on the available 
land with water scarcity using biotechnology tools. Modification of plants 


by rDNA technology would be most beneficial. 
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The need for better traits in the plant is the base of GM plants, and the 
skill and knowledge of transferring gene into plants indorsed the concep- 
tion that some of the primary transgenic crops like tobacco, tomato, corn, 
soybean and cotton developed mainly for insect and herbicide resistance 
with single or multiple gene are under cultivation in different parts of the 
world. Apart from these qualities, researchers have also proposed on traits 
like increased vitamin and mineral contents of grains by enhanced amino 
acid profiles, better vegetable oils by improving nutritional composition of 
oilseed and crops with increased tolerance to biotic and abiotic stresses. 
The production of active medical compounds and industrial enzymes and 
bioplastics is an important trait for manipulation. 


Keywords 
Genetically modified (GM) crops ¢ Gene transfer in plants * Agrobacterium 


e Particle gun * GM foods ¢ Biosafety 


27.1 Introduction 


Genetic engineering, the term we know today, 
started with the discovery of Agrobacterium 
tumefaciens and its ability to integrate its own 
genome into host plant, and thereafter, particle 
gun bombardment and electroporation technique 
came in the game which facilitates insertion of 
gene of interest into the plant genome (Barton 
et al. 1983; Herrera-Estrella et al. 1983; Horsch 
et al. 1985; Fromm et al. 1985; Klee et al. 1987; 
Sanford et al. 1987). Deliberate alteration of the 
genome of an organism by introduction of one or 
a few specific foreign gene is referred to as 
‘genetic engineering’ or “genetic transformation’, 
and the modified organism is described as a 
‘transformed’ or ‘transgenic organism’. In this 
highly sophisticated method of genetic modifica- 
tion of plants, the genetic material or DNA 
sequence coding for a desirable trait is identified 
in the donor organism by a variety of molecular 
technique and then cut out from the parental 
DNA using the ‘molecular scissors’, restriction 
endonucleases. The relatively small pieces of 
DNA are then introduced in the recipient plant 
cells by one of several possible methods, and the 
plant is regenerated from them. Such plants 


having transgene in their genome are called as 
genetically engineered plants, i.e. GE plants or 
genetically modified (GM) crops and food. 

Genetic engineering makes it possible to 
exchange or introduce the gene of interest, using 
the modern tools of recombinant DNA (rDNA) 
techniques, within same species or from species 
into an organism or within kingdoms. Genetically 
altered foods are made from plants or animals 
that have had their deoxyribonucleic acid (DNA) 
changed in a way that does not happen naturally. 
Often, individual genes are transferred from one 
organism to another, even when the organisms 
are not related. 

By manipulating the genes and genome of 
plants, scientists have created two types of genet- 
ically altered foods: first are those beneficial to 
the producer and second are those beneficial to 
the consumer. Manipulation in genome all 
depends upon the traits we want to introduce in 
the plant. The most important beneficial traits 
include insect resistance, viral resistance, herbi- 
cide resistance, increased yield or reduced time 
needed to ripen. Consumer-friendly traits include 
longer freshness, greater nutritional value or 
medicinal value. 
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27.2 Need for Genetically 
Modified Crops 


The population on earth is expected to rise from 
6.7 billion to 9 billion by 2050. This will increase 
the demand for food, and to accommodate this, 
agriculture production all over the world need to 
be increased by 50 %, by 2030 (Song et al. 2003). 
One of the biggest problems is availability of 
arable land. The percentage of agriculture land is 
limited, the remaining land lost to urbanisation, 
desertification and environmental degradation. 
Second big challenge is availability of water 
good enough for drinking and farming. Water 
resources are under severe strain in many parts of 
the world. In the last 6 years, there is a fourfold 
decrease in the availability of freshwater per per- 
son (Tzfira and Citovsky 2006). Seventy percent 
of the total available water is used for agriculture 
(The Royal Society 2009), and about 50 % of the 
world total wetland has already disappeared. 
Hence, the challenge of raising food production 
by 50 % must be tackled on the available land 
with water scarcity. 

Climate change is the next big challenge 
which will put pressure on agriculture produc- 
tion. Greenhouse effect, glacier melting on poles 
and rise in the sea level ultimately submerge low- 
lying croplands. Temperature will rise above 30°, 
there will be increased heat and drought spells 
which ultimately lead to limited crop production 
(Sanford et al. 1987). Despite of these abiotic 
stresses, biotic stresses are also going to be chal- 
lenging for food production. Pest and diseases 
are also expected to increase. The stress due to 
biotic and abiotic factors leads to 30-60 % yield 
losses each year globally. 

Current situation 1s at critical stage and will be 
more demanding for creating more land and 
water for cultivation. We cannot increase the land 
availability for agriculture but what we can do is 
improve our plants to withstand these stresses. 
Thus, the important goal would be to adopt our 
existing food crops to higher temperature, 
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decrease water availability, flooding, rising salin- 
ity and challenging pathogen and insect threat 
(Song et al. 2003). Such improvements will 
require diverse approach that will enhance the 
sustainability of our agriculture and farmers. 
These include more effective land and water 
management, integrated pest management of new 
generation of agriculture and crop tolerance to 
diverse stresses (Shah et al. 1986). 


27.3 Classical Breeding Versus 
Genetic Engineering 


Modifications made in plants by rDNA technol- 
ogy and classical breeding are having both simi- 
larities and differences. Similarities involve 
exchange of genome/genes, changes in sequence 
order, up- and downregulation of genes in an 
organism and utilisation of many of the enzymes. 
However, differences with the rDNA technique 
are, small amount of genetic material is modified 
(one or few genes). But in classical breeding 
technique, where whole genome of an organism 
is involved, which exchanges position of genes 
within organism (Fig. 27.1). The next important 
difference is that in rDNA technology we can 
precisely control expression and place of expres- 
sion of gene, where this precision is not possible 
with classical breeding (Cho et al. 2000). 

Third important difference is that classical 
breeding can be done in related species or gen- 
era. For example, a wild Lycopersicon variety 
can be crossed with cultivated tomato 
(Lycopersicon esculentum) varieties (Chetelat 
et al. 1995), and wheat (Triticum aestivum) can 
be crossed with rye (Secale cereale) to yield trit- 
icale (Stallknecht et al. 1996). But in many 
crosses, wild relatives and different genera or 
species are not compatible, or crosses can be 
made but the resulting embryos must be rescued 
by in vitro culture to obtain a plant (Shah et al. 
1986). Exactly apposite to this, in rDNA 
approach, we can use genetic material from any 
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Fig. 27.1 Classical 
breeding vs rDNA/GM 
food (Source: Biotech 


Mentor’s Kit 2003) Wild Relative 


living organism. This gives us freedom to intro- 
duce DNA from bacteria, fungus, virus and ani- 
mals to plants. Thus, we can come up with gene 
combinations never seen before. 


27.3.1 Marker-Assisted Selection 
in Aid to Classical Breeding 


We all know classical breeding is a cumbersome 
and tedious method which takes several years to 
develop a single variety with desirable character. 
To increase its speed and specificity, marker- 
assisted selection (MAS) comes in aid. When we 
know which specific gene or allele is responsible 
for certain characters in plants, then it can be 
used as a marker to assist selection of specific 
traits in breeding. This ability to mark or select 
the desirable character specifically and to avoid 
the deleterious ones in a fast and reliable manner 
is critical to the development of improved germ- 
plasm through breeding. 

MAS can help to quickly identify plants with 
desirable traits in large populations. These mark- 
ers are closely associated with the desirable trait 
and are indirectly used for trait selection in plants 
(Dubcovsky 2004). Nowadays with the knowl- 
edge of complete and partial genome of the 
organism, it is possible to identify marker within 
the causative gene itself rather than linked to it. 
For example, Xa5 gene, which provides resis- 
tance to bacterial blight in rice (Oryza sativa), 
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marker was found in gene itself, and its use 
prevented separation of the marker from the trait 
by recombination (lyer-Pascuzzi and McCouch 
2007). 

MAS particularly comes handy for selection 
of traits when (a) phenotypic screening in the 
field is difficult or costly, e.g. drought/frost toler- 
ance or resistance to exotic diseases or pests; (b) 
multiple alleles exist; and (c) recessive or low 
heritability traits exist that require progeny test- 
ing. MAS is used for some important traits, e.g. 
leaf rust resistance in wheat (Niederberger et al. 
2004), erect panicle in rice (Kong et al. 2007), 
soybean (Glycine max) rust resistance (Hyten 
et al. 2007), drought adaptation in maize (Ribaut 
and Ragot 2007) and root-knot nematode resis- 
tance in cotton (Gossypium hirsutum) (Ynturi 
et al. 2006). 

Organic farming is a new upcoming trend in 
international market. GM crop utilisation is 
banned in this practice. National Organic Standards 
specify that these varieties cannot be intentionally 
grown by organic farmers. But MAS can be used 
to introduce desired traits from wild or related spe- 
cies, and these species are acceptable to organic 
growers. Utilising MAS to create food crops has 
led to the term ‘super organics’ (Manning 2004, 
Super organics. Wired, May, Issue 1205 53), crops 
that are grown under certified organic conditions 
and were created through classical breeding using 
genomic information and MAS, not through rDNA 
methods. 
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27.4 Global Scenario 
of Genetically 
Modified Crops 


The success of these GM crops can be measured 
by increase in their global acreage or farmer 
acceptance. Biotech crop hectares increased by 
more than 100-fold from 1.7 million hectares in 
1996 to 175.2 million hectares in 2013. In 2013, 
175.2 million hectares of biotech crops were 
grown globally. Hectare of biotech crops 
increased every single year from 1996 to 2013 
with 12 years of double digit growth rates. In 
2013, hectarage of biotech crops grew at an 
annual rate of 3 %, up by 5 million from 170.3 
million hectares in 2012. The top biotech crops in 
order of hectarage are soybean, maize, cotton and 
canola (Fig. 27.2). Other biotech crops grown in 
2013 are alfalfa, sugar beet, papaya, squash, pop- 
lar, tomato and sweet pepper. Traits also played 
major role in their acceptance worldwide as can 
be seen in Fig. 27.3. For the second consecutive 
year, developing countries planted more biotech 
crops than industrial countries (Fig. 27.4). 
Acceptance of some GM crops in comparison 
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with non-GM counterparts can be seen in 
Fig. 27.5. Farmers from Latin America, Asia and 
Africa collectively grew 94.1 million hectares or 
54 % of the global 175.2 million hectares of bio- 
tech crops in 2013, compared with industrial 
countries at 81.1 million hectares, or 46 % of the 
global total. Twenty-seven developing and indus- 
trial countries planted biotech crops in 2013. Of 
the 27 countries which planted biotech crops in 
2013, 19 were developing and 8 were industrial 
countries (Figs. 27.6 and 27.7). The five lead 
developing countries in Latin America (Brazil, 
Argentina), Asia (India, China), and Africa 
(South Africa) grew 47 % of global biotech crops 
(James 2006 Global status of commercialised 
biotech/GM crops: 200642). Certainly these GE 
crops are successful. 


27.5 Genetically Modified Crops 
and Their Traits 


The need for better traits (characters) in the plant 
is the base of GM plant development, and the skill 
and knowledge of transferring gene into plants 





Fig.27.2 Global area of biotech crops by crops (Source: James 2013) 
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Fig. 27.3 Global area of biotech crops by traits (Source: James 2013) 
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Fig. 27.4 Global area of biotech crops in industrial and developing countries (Source: James 2013) 
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Global Adoption Rates (%) for Principal 
Biotech Crops (Million Hectares, Million Acres), 2013 
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Fig.27.5 Global adoption rate for main biotech crops (Source: (James 2013)) 
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Fig.27.6 Global map of biotech crop countries (Source: James 2013) 
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27 countries which have adopted 


Increase over 2012 


Source: Clive James, 2013. 


biotech crops 


hectares. 


Fig.27.7 Global area of biotech crop (Source: James 2013) 


indorsed the conception of some of the primary 
transgenic crops like tobacco, tomato, corn, soy- 
bean and cotton (Bevan et al. 1983; Martineau 
2001; Gordon-Kamm et al. 1990; McCabe et al. 
1988; Umbeck et al. 1987; Nandeshwar et al. 
2009) (Table 27.1). Bacterial genes were intro- 
duced in crops to provide defence against pests 
and diseases which ultimately condensed applica- 
tion of pesticides in the field (Gatehouse 2008). 
Herbicide-tolerant crops help in weed manage- 
ment practices a lot. This aided the acceptance of 
more environmentally friendly farming practices 
which lead to improved soil quality and dimin- 
ished greenhouse gas emissions and fertiliser use, 
on the other hand refining the quality of water 
run-off to streams, rivers, lakes and aquifers 
(Cerdeira and Duke 2006; Fernandez-Cornejo 
et al. 2012; Marra et al. 2004; Qaim 2005; Raven 
2010; Scheffe 2008; Shah et al. 1986). 

Apart from these qualities, researchers have 
also proposed on traits like increased vitamin and 
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mineral contents of grains by enhanced amino 
acid profiles (Lucca et al. 2006), better vegetable 
oils by improving nutritional composition of oil- 
seed (Knauf 1987) and crops with increased tol- 
erance to biotic and abiotic stresses (Wang et al. 
2003). The production of active medical com- 
pounds and industrial enzymes (Davies 2010) 
and bioplastics (Poirier et al. 1995) (Table 27.2). 

In the history of GE plants, GE tobacco was 
the first candidate to be reported in 1983 (Bevan 
et al. 1983) but not grown commercially. Flavr 
Savr'™ tomato was the first GM crop commer- 
cialised in 1994 with longer shelf life (Paoletti 
et al. 2008; Martineau 2001). This was followed 
by other commercial GM crop candidates which 
move in the market such as canola, corn, cotton, 
soybean, alfalfa and others (Table 27.1). 
Table 27.3 provides the list of countries involved 
in GM crop production and import, and Table 27.4 
shows the companies involved in research and 
development of GM crops. 
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Table 27.1 List of genetically modified crops 


1. Alfalfa (Medicago sativa) 

2; Argentine Canola (Brassica napus) 
3) Bean (Phaseolus vulgaris) 

4. Carnation (Dianthus caryophyllus) 
5. Chicory (Cichorium intybus) 

6. Cotton (Gossypium hirsutum L.) 

7. Creeping bentgrass (Agrostis stolonifera) 
8. Eggplant (Solanum melongena) 

9. Flax (Linum usitatissimum L.) 

10. Maize (Zea mays L.) 

11. Melon (Cucumis melo) 

12. Papaya (Carica papaya) 

13. Petunia (Petunia hybrida) 

14.  ~=Plum (Prunus domestica) 

15. Polish canola (Brassica rapa) 

16. — Poplar (Populus sp.) 

17. — Potato (Solanum tuberosum L.) 

18. — Rice (Oryza sativa L.) 

19. — Rose (Rosa hybrida) 

20. Soybean (Glycine max L.) 

21. Squash (Cucurbita pepo) 

22. Sugar beet (Beta vulgaris) 

23. Sugarcane (Saccharum sp.) 

24. Sweet pepper (Capsicum annuum) 
25. Tobacco (Nicotiana tabacum L.) 
26. Tomato (Lycopersicon esculentum) 


27. Wheat (Triticum aestivum) 


Source: ISSA, GM Approval Database (http://www.isaaa. 
org/) 


Table 27.2 List of traits/characters involved in GM crops 


1. 2,4-D herbicide tolerance 

3. Anti-allergy 

5. Coleopteran insect resistance 
7. Delayed ripening/senescence 
9. Drought stress tolerance 

11. Fertility restoration 

13. Glyphosate herbicide tolerance 
15. Lepidopteran insect resistance 
17. Mannose metabolism 

19. Modified alpha amylase 

21. Modified flower colour 

23. Modified starch/carbohydrate 
25. Nicotine reduction 

27. Oxynil herbicide tolerance 

29. Sulfonylurea herbicide tolerance 
31. Visual markers 
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Though GE crops have considerable large 
acreage, they lack the diversity of crop types and 
traits in commercial production. Some of the lim- 
ited minor acreage GE crops, which are commer- 
cially successful, are papaya (Carica papaya), 
certain types of squash (Cucurbita sp.) and 
sweetcorn (James 2006). 

Most of the major GE crops are based on traits 
like pest protection via genes from Bt or HT, 
largely showing tolerance on Monsanto’s 
RoundUp® herbicide and Bayer’s Liberty® her- 
bicide. Bt cotton resistance to bollworms is 
among the most accepted all over the world. 
Recently, maize engineered for rootworm and 
European corn borer resistance (both Bt based) 
and tolerance to RoundUp® herbicide. 

The only engineered whole fruit available 
commercially in the USA today is GE papaya by 
the name ‘Rainbow’ and ‘SunUp’, and _ this 
papaya contains a viral coat protein gene 
(Gonsalves et al. 2000; Gordon-Kamm et al. 
1990; Lius et al. 1997). GE squash (yellow crook- 
neck, straight neck and zucchini) is another 
whole GE food available in the USA, and it is 
resistant to watermelon mosaic virus 2 (WMV2), 
zucchini yellow mosaic virus (ZYMV) and 
cucumber mosaic virus (CMV) (Tricoli et al. 
1995; Gaba et al. 2004). 


Altered lignin production 


2 

4. Antibiotic resistance 

6. Delayed fruit softening 

8. Dicamba herbicide tolerance 
10. Enhanced photosynthesis/yield 
12. Glufosinate herbicide tolerance 
14. Isoxaflutole herbicide tolerance 
16. Male sterility 

18. Mesotrione herbicide tolerance 
Modified amino acid 

. Modified oil/fatty acid 
Multiple insect resistance 
Nopaline synthesis 

28. Phytase production 

Viral disease resistance 


Source: ISSA, GM Approval Database (http://www.isaaa.org/) 
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Table 27.3 List of countries with GM crop approvals 


1. Argentina 

2. Australia 

3: Bangladesh 
4. Bolivia 

oP Brazil 

6. Burkina Faso 
ye Canada 

8. Chile 

9. China 

10. Colombia 

1]. Costa Rica 
12. Egypt 

13. European Union 
14. Honduras 

15. India 

16. Indonesia 

Li; Iran 

18. Japan 

19. Malaysia 

20. Mexico 

vA Myanmar 

22; New Zealand 
23. Norway 

24. Pakistan 

25; Panama 

26. Paraguay 

21s Philippines 
28. Russian Federation 
29: Singapore 
30. South Africa 
31. South Korea 
= ue Switzerland 
33. Taiwan 

34. Thailand 

aD. Turkey 

36. United States of America 
37. Uruguay 


Source: ISSA, GM Approval Database (http://www.isaaa. 
org/) 


Another GE food in USA is GE sweetcorn, 
engineered with a Bt gene which provides protec- 
tion against earworms (Helicoverpa zea), very 
notorious in North America. Damage by earworm 
results in subsequent fungal and bacterial attack 
and quality loss (Horvath 2003). Bt expression 
reduces insect attack and as a result reduces dam- 
age of insects, and mycotoxigenic fungi numbers 
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are decreased, and this results in lower levels of 
mycotoxins, such as fumonisins, which have 
toxic effects on humans (Westhuizen et al. 2003). 

Pest resistance traits are very important in 
taming crop performance by protecting against 
pests. A gene from wild Mexican potato genome 
(Solanum tuberosum) put into cultivated potato, 
allowing the GM potato to survive exposure to 
the many races of Phytophthora infestans, caus- 
ative agent for the Irish potato famine (Song et al. 
2003). Mi gene native to tomato was upregulated, 
which provides resistance against the root-knot 
nematode (Rossi et al. 1998). Europe which is 
reluctant to GM crops also held first field trial of 
GE grapes (Vitis vinifera) which took place in the 
northern Alsace region of France in 2005. A coat 
protein gene from fan leaf virus was inserted into 
the grape rootstock (Bouquet et al. 2003). 

Crop yield is another trait in GM crops. In 
2001, transgenic rice plants exhibit higher photo- 
synthesis due to expression of maize proteins 
pyruvate orthophosphate dikinase (PPDK) and 
phosphoenolpyruvate carboxylase (PEPC) (Ku 
et al. 2001). 

Nitrogen use efficiency is also another impor- 
tant agronomic trait which aimed at reducing 
fertiliser usage and increasing sustainability. 
The plant-specific transcription factor Dof], 
when introduced into the model plant species 
Arabidopsis, increased nitrogen content by 
~30 %, improving growth under low-nitrogen 
conditions (Yanagisawa et al. 2004). 

High salt and high and low water availability 
are the important abiotic stress factors for which 
we need to improve crops. In this league, CBF 
gene from cold response pathway of Arabidopsis 
induces expression of COR (cold regulator) 
genes which enhances the tolerance to freeze in 
non-acclimated plants (Liu et al. 2004). Similarly, 
GM tomato which is engineered to overexpress 
vacuolar Na*/H* antiport is able to produce fruits 
in nearly 40 % of seawater concentration, 1.e. 
around 200 mM sodium chloride, and fruits 
showed very less amount of sodium chloride 
(Zhang et al. 2001). Similarly GM rice was pro- 
duced with incorporation of three genes, which 
enriched rice with vitamin A, well known as 
Golden Rice (Park et al. 2005). 
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Table 27.4 List of GM developers 


Sl eesd (eal (om 


boc Osea et (ec oak 


10. 


11. 
12. 
13. 
14. 
15. 
16. 


17. 
18. 
19. 
20. 
ZN, 
a 
2): 
24. 


2D: 
26. 


21, 
28. 


29: 


30. 


31. 
32. 
33. 


34. 
BD: 
36. 
37. 
38. 


Agricultural Biotech Research Institute (Iran) 
Agritope Inc. (USA) 
BASF 


Bayer CropScience (including fully and partly 
owned companies) 


Bayer CropScience and MS Technologies LLC 
Bayer CropScience and Syngenta 

Beijing University 

Bejo Zaden BV (Netherlands) 

Central Institute for Cotton Research and 
University of Agricultural Sciences Dharwad 
(India) 

Centre Bioengineering, Russian Academy of 
Sciences 

Chinese Academy of Agricultural Sciences 
Cornell University and University of Hawaii 
Cotton and Sericulture Department (Myanmar) 
DNA Plant Technology Corporation (USA) 
Dow AgroSciences LLC 


Dow AgroSciences LLC and DuPont (Pioneer 
Hi-Bred International Inc.) 


DuPont (Pioneer Hi-Bred International Inc.) 
EMBRAPA (Brazil) 

Florigene Pty Ltd. (Australia) 

Genective S.A. 

Huazhong Agricultural University (China) 
Institute of Microbiology, CAS (China) 

JK Agri Genetics Ltd (India) 


Maharashtra Hybrid Seed Company 
(MAHYCO) 


MetaHelix Life Sciences Pvt. Ltd (India) 


Monsanto Company (including fully and partly 
owned companies) 


Monsanto Company and Bayer CropScience 


Monsanto Company and Dow AgroSciences 
LLC 


Monsanto Company and DuPont (Pioneer 
Hi-Bred International Inc.) 


Monsanto Company and Forage Genetics 
International 


Monsanto Company and Scotts Seeds 
Nath Seeds/Global Transgenes Ltd (India) 
National Institute of Agrobiological Sciences 
(Japan) 
Novartis Seeds and Monsanto Company 
Origin Agritech (China) 
PT Perkebunan Nusantara XI (Persero) 
Renessen LLC (Netherlands) 
Renessen LLC (Netherlands) and Monsanto 
Company 

(continued) 
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Table 27.4 (continued) 


39. — Research Institute of Forestry (China) 
40. SEITAS.A. (France) 


41. Seminis Vegetable Seeds (Canada) and 
Monsanto Company (Asgrow) 


42. South China Agricultural University 
43. Stine Seed Farm, Inc. (USA) 

44. Suntory Limited (Japan) 

45. Syngenta 

46. Syngenta and Monsanto Company 


47. United States Department of Agriculture — 
Agricultural Research Service 


48. University of Florida 

49. University of Saskatchewan 

50. Vector Tobacco Inc. (USA) 

51. Zeneca Plant Science and Petoseed Company 


Source: ISSA, GM Approval Database (http://www. isaaa. 
org/) 


GM plants have also been engineered to focus 
problems related to human diseases. These 
include development of vaccine against pneu- 
monic and bubonic plague (Alvarez et al. 2006); 
a potato-based vaccine for hepatitis B, shown to 
raise immunological responses in humans 
(Thanavala et al. 2005); a GM pollen vaccine that 
reduces allergy symptoms (Niederberger et al. 
2004); and an edible rice-based vaccine targeted 
at alleviating allergic diseases such as asthma, 
seasonal allergies and atopic dermatitis (Takagi 
et al. 2006). 

Development of alternative energy source is 
also an important trait which scientist wants to 
achieve, which helps to tackle non-renewable 
energy crises and greenhouse gas emissions. This 
involves engineering Chlamydomonas 
hardtii to produce hydrogen gas, a renewable 
energy source (Melis and Happe 2001). 


rein- 


27.6 GM Food Products 


A number of processed foods are made by 
enzymes from the organisms or whole organism 
that change the nature of food, for example, yeast 
used for beer and wine, bacteria for yogurt and 
blue cheese made from fungi. These organisms 
and enzymes are modified to give better results. 
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Cheesemaking enzyme, i.e. rennin which is 
isolated from rennet of stomach of slaughtered 
calves. In cheesemaking, rennet is used to coagu- 
late milk to separate the curds (solids) and whey 
(liquid). Recombinant rennin or chymosin was 
the first protein produced through rDNA means 
to be used in food (Green et al. 1985). The gene 
responsible for chymosin production was cloned 
in yeast and Escherichia coli. And these GM 
organisms were then used for large-scale produc- 
tion of enzyme. Engineered chymosin is cur- 
rently used in approximately 60 % of US hard 
cheese products (Biotechnol Ind Organ (BIO) 
2007). Other products produced by rDNA meth- 
ods include food supplements, such as vitamin 
B2 (riboflavin) (Perkins et al. 1999), a-amylase 
(used to produce high-fructose corn syrup and 
dry beer) and lactase (added to milk to reduce the 
lactose content for persons with lactose 
intolerance). 


27.7 Development 
of Transgenic Crops 


For the development of transgenic crops, the bio- 
chemistry of the gene, its mode of action, expres- 
sion pathway, safety of the gene and gene product 
must be known in detail. And that is the first and 
most important step of the procedure. Each step 
advancement represents a financial commitment 
on behalf of technology developers of between 
USD10 million and USD46 million (McDougall 
2011). 

The process of genetic engineering can be 
divided into the following steps: 


27.7.1 Step 1: Gene Discovery 


An idea of commercial product, once formulated, 
the work of the gene discovery process begins. A 
number of candidate genes are considered from 
variety of sources like genomes of plants, micro- 
organisms, animals, study of naturally occurring 
biological processes and review of relevant scien- 
tific literature. A number of candidate genes are 
then tested in model plants like Arabidopsis 
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thaliana and tobacco and in plant cell culture to 
understand their nature. 

For example, Cry gene discovery from soil 
bacterium Bacillus thuringiensis (Bt) is a good 
case study for gene discovery. Bt have been used 
for many years for spraying to control insects on 
crops (Schlenkera and Robertsb 2009), and now 
Bt have been shown to have a number of genes 
which can be used for insect resistance trait 
(Scheffe 2008). The gene coding for the insecti- 
cidal toxin (Bt toxin) was then inserted into a 
crop plant, making it resistant to feeding damage 
by the target pest. Nowadays due to bioinformat- 
ics and proteomics studies, candidate genes can 
then be engineered to produce proteins with 
increased spectra of activity based on their struc- 
ture and function. 


27.7.2 Step 2: Nucleic Acid (DNA/ 
RNA) Extraction 


Nucleic acid, 1.e. either DNA or RNA extraction, 
is the first step in the genetic engineering process. 
Most reliable and trusted methods are used for 
isolating these components from the cell. The 
normal isolation procedure starts with the disrup- 
tion of the cell, then separation of cell debris 
from nucleic acids and precipitation of the DNA/ 
RNA which form threadlike pellets. 


27.7.3 Step 3: Gene Cloning 


There are basically four stages in any cloning 
experiment: generation of DNA fragments (by 
PCR or restriction digestion), joining to a cloning 
vector, propagation in a host cell and selection of 
the required sequence (Fig. 27.8). In DNA extrac- 
tion, all DNA from the desired organism is 
extracted. This genomic DNA is treated with spe- 
cific enzymes called restriction enzymes cutting 
it into smaller fragments with defined ends to 
allow it to be cloned into bacterial vectors. Copies 
of the vector will then harbour many different 
inserts of the genome. These vectors are trans- 
formed into bacterial cells and thousands of cop- 
les are produced. Using information relating to 
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Fig.27.8 Gene cloning 
procedure (Source: 
ISAAA, Agricultural 
Biotechnology Biotech 
Booklet-3 (http://www. 
isaaa.org/)) 


Fig.27.9 Gene stacking 
(Source: ISAAA, 
Agricultural Biotechnology 
Biotech Booklet-3 (http:// 
www.isaaa.org/)) 
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DNA cloning of specific fragments 
into a self-replicating genetic 
element so that the DNA molecule 
can be reproduced 


PARTS OF A GENE 


Promoter — Initiates transcription has the promoter 
sequences including the AUG bases 


Coding region — codes fora 


gene product 


specific molecular marker sequences and the 
desired phenotype, the vector harbouring the 
desired sequence is detected, selected and iso- 
lated and clones are produced. Restriction 
enzymes are again utilised to determine if the 
desired gene insert was cloned completely and 
correctly. Or one can design a gene specific 
primer or degenerated primer from the sequence 
data available on Internet and then can amplify 
the specific gene by PCR, and the PCR product 
can be cloned in cloning vector and then can be 
sequenced and characterised (Fig. 27.8). 





Termination sequence — marks 
where to stop transcription, UUU, 
UUA, etc 


27.7.4 Step 4: Gene Design 
and Packaging 


Once the gene of interest has been cloned, it has to 
be linked to pieces of DNA that will control its 
expression inside the plant cell (Fig. 27.9). These 
pieces of DNA will switch on (promoter) and off 
(terminator) the expression of the gene inserted. 
Gene designing/packaging can be done by replac- 
ing an existing promoter with a new one, incorpo- 
rating a selectable marker gene and reporter gene 
and adding gene enhancer fragments, introns and 
organelle-localising sequences, among others. 
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27.7.4.1 Promoters 

Promoters allow differential expression of genes. 
For instance, some promoters cause the inserted 
genes to be expressed all the time, in all parts of 
the plant (constitutive), whereas others allow 
expression only at certain stages of plant growth, 
in certain plant tissues, or in response to external 
environmental signals. The amount of the gene 
product to be expressed is also controlled by the 
promoter. Some promoters are weak, whereas 
others are strong. Controlling the gene expres- 
sion 1s an advantage in developing GM plants. 


27.7.4.2 Selectable Marker Genes 
Selectable marker genes are usually linked to the 
gene of interest to facilitate its detection once 
inside the plant tissues. This enables the selection 
of cells that have been successfully incorporated 
with the gene of interest, thus saving consider- 
able expense and effort. Genetic engineers used 
antibiotic resistance and herbicide resistance 
marker genes to detect cells that contain the 
inserted gene. Cells that survive the addition of 
marker agents to the growth medium indicate the 
presence of the inserted gene. Although increase 
in antibiotic resistance in humans and animals is 
unlikely to occur using antibiotic resistance 
marker, genes coding for resistance to non- 
medically important antibiotics are preferred. In 
addition, alternative types of marker genes have 
been developed which are related to plant metab- 
olism such as phosphomannose isomerase, 
xylose isomerase and others. 


27.7.4.3 Reporter Genes 

Reporter genes are cloned into the vector in close 
proximity to the gene of interest, to facilitate the 
identification of transformed cells as well as to 
determine the correct expression of the inserted 
gene. Reporter genes that have been used include 
the beta-glucuronidase gene (gusA gene) which 
acts on a particular substrate producing a blue 
product, hence making the transformed cells 
blue; the green fluorescent protein (gfp) which 
allows transformed cells to glow under a green 
light; and luciferase gene that allows cells to 
glow in the dark, among others. 
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27.7.4.4 Enhancers 

Several genetic sequences can also be cloned in 
front of the promoter sequences (enhancers) or 
within the genetic sequence itself (introns, or 
non-coding sequences) to promote gene expres- 
sion. An example is the cloning of the cauliflower 
mosaic virus promoter enhancers in front of the 
plant promoter. 

Once the gene of interest is packaged together 
with the promoter, reporter and the marker gene, it 
is then introduced into a bacterium to allow for the 
creation of many copies of the gene package. The 
DNA isolated from the bacterial clones can then 
be used for plant cell transformation using particle 
bombardment. If however the use of bacteria 
Agrobacterium tumefaciens is preferred in the 
plant transformation, the whole gene package 
should be cloned in between two border sequences 
(left and right border) of a binary vector plasmid. 
This will allow processing of the Agrobacterium 
so that only the transfer DNA (T-DNA) will be 
incorporated into the plant genome (Fig. 27.10). 


27.7.5 Step 5: Transformation 


The most common methods used to introduce the 
gene package into the binary vector into plant 
cells in a process called transformation or gene 
insertion include biolistic transformation using 
the gene gun and Agrobacterium-mediated trans- 
formation (Altpeter et al. 2005; Tzfira and 
Citovsky 2006). There are many techniques of 
gene transformation in plants which include in- 
planta transformation procedure. 


27.7.5.1 Particle Bombardment 

Particle bombardment is a mechanical method of 
introducing the desired gene. The desired genetic 
sequence is cloned into a plant DNA vector and 
introduced into the plant using the gene gun or 
particle gun. As in the common gun, the gene gun 
uses minute particles of tungsten or gold as the 
bullet. These particles are coated with the DNA 
solution and fired to the plant cells through the 
force of the helium gas inside a vacuum-filled 
chamber. The DNA and the _ tungsten/gold 
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Fig.27.10 Gene construct components (Source: ISAAA, Agricultural Biotechnology Biotech Booklet-3 (http://www. 


isaaa.org/). Source: Ye et al. (2000)) 


particles get inside the cell, and within 12 h, the 
inserted DNA gets inside the nucleus and inte- 
grated with the plant DNA. The tungsten/gold 
particles are sequestered to the vacuole and elim- 
inated later. Transformed cells are cultured 
in vitro and induced to form small plants (regen- 
eration) that express the inserted gene. 


27.7.5.2 Agrobacterium tumefaciens- 
Mediated Transformation 
The ‘sharing’ of DNA among living forms is well 
documented as a natural phenomenon. For thou- 
sands of years, genes have moved from one organ- 
ism to another. For example, Agrobacterium 
tumefaciens, a soil bactertum known as ‘nature’s 
own genetic engineer’, has the natural ability to 
genetically engineer plants. It causes crown gall dis- 
ease in a wide range of broad-leaved plants, such as 
apple, pear, peach, cherry, almond, raspberry and 
roses. The disease gains its name from the large 
tumour-like swellings (galls) that typically occur at 
the crown of the plant, just above soil level. 
Basically, the bactertum transfers part of its 
DNA to the plant, and this DNA integrates into the 
plant’s genome, causing the production of tumours 
and associated changes in plant metabolism. 
Molecular biologists have utilised this biological 


mechanism to improve crops. The genes that cause 
the galls are removed and replaced with genes 
coding for desirable traits. Plant cells infected with 
the bacterium will not form galls but produce cells 
containing the desired gene, which when cultured 
in nutrient medium will regenerate into plants and 
manifest the desired trait. 

The main goal in any transformation proce- 
dure is to introduce the gene of interest into the 
nucleus of the cell without affecting the cell’s 
ability to survive. If the introduced gene is func- 
tional, and the gene product is synthesised, then 
the plant is said to be transformed. Once the 
inserted gene is stable, inherited and expressed in 
subsequent generations, then the plant is consid- 
ered a transgenic. 


27.7.6 Step 6:In Vitro Regeneration 
of Plant 


The parts of the plant to be made transgenic, 1.e. 
explants are treated with Agrobacterium containing 
desire gene then grown on selective media and 
regenerated into whole plant in vitro on MS 
media or other defined media. After complete 
regeneration and establishment, they are tested for 
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presence of a transgene using various techniques 
of molecular biology. 


27.7.7 Step 7: Detection 
of Inserted Genes 


Molecular detection methods have been devel- 
oped to determine the integrity of the transgene 
(introduced gene) into the plant cell. 


27.7.7.1 Polymerase Chain Reaction 
Polymerase chain reaction or PCR is a quick test 
to determine if the regenerated transgenic cells or 
plants contain the gene. It uses a set of primers 
(DNA fragments), forward and backward prim- 
ers, whose nucleotide sequences are based on the 
sequence of the inserted gene. The primers and 
single nucleotides are incubated with the single- 
stranded genomic DNA, and several cycles of 
DNA amplification are conducted in a PCR 
machine. Analysis of the PCR products in aga- 
rose gel will show if the plants are really trans- 
formed when DNA fragments equivalent in size 
with the inserted gene are present and amplified 
in the form of bands. 


27.7.7.2 Southern Blot Analysis 

Determines the Integrity 

of the Inserted Gene 
Southern blot analysis determines the integrity of 
the inserted gene whether the gene is complete 
and not fragmented, at the correct orientation, 
and with one copy number. The DNA coding 
sequence is the probe binding to the single- 
stranded genomic DNA of the transgenic plant 
which is implanted on a nitrocellulose paper. 
Autoradiography will reveal the transgenic status 
of the plant. One copy of the transgene is desired 
for optimum expression. 


27.7.7.3 Northern Blot Analysis 

Northern blot analysis determines whether the 
transcript or the messenger RNA (mRNA) of the 
introduced DNA is present and is correctly tran- 
scribed in the transgenic plant. The messenger 
RNA of the transgenic plants is isolated and pro- 
cessed to bind to the nitrocellulose membrane. 
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Labelled DNA is used to bind to the mRNA and 
can be visualised through autoradiography. 


27.7.7.4 Western Blot Analysis 

Western blot analysis or protein immunoblotting 
is an analytical technique used to detect whether 
the transgenic plants produce the specific protein 
product of the introduced gene. Protein samples 
are extracted from the transgenic plants, pro- 
cessed into denatured proteins and transferred to 
a nitrocellulose membrane. The protein is then 
probed or detected using the antibodies specific 
to the target protein. 


27.7.7.5 Quantitative PCR Analysis 
Quantitative PCR analysis is the latest method 
where we can monitor real-time data of PCR 
amplification reaction. In this method we use 
very specific primers for the gene, and they were 
tagged by fluorescence dye which helps us to 
monitor real-time development in the reaction. 
This method can be used to study gene expres- 
sion and copy number detection of the gene. 


27.7.8 Step 8: Transgenic Plant 
Testing 


Controlled-environment testing in a greenhouse 
or growth chamber allows for rapid screening of 
many transgenic events. For example, transgenic 
plants expressing a protein with insecticidal 
properties can be screened via a high-throughput 
insect assay in a greenhouse, and events that are 
being developed for herbicide tolerance can be 
treated with the appropriate herbicide to select 
for events that exhibit the desired level of toler- 
ance. The controlled-environment screening can 
be combined with high-throughput imaging sys- 
tems to measure and quantify plant growth and 
development under these specific conditions. 
Transgenic events that meet performance cri- 
teria under controlled conditions are then tested 
in field trials and closely monitored under regu- 
lated conditions. These trials include efficacy 
tests to evaluate the performance of the devel- 
oped trait; in the case of insect-control traits, the 
events are tested by infesting test plants with the 
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target insect, either artificially or via natural 
insect pressure, and then scoring the level of 
damage on the plants after insect feeding. The 
agronomic performance of the event is tested in 
addition to trait efficacy. To be considered for 
commercialisation, an event that shows excellent 
protection against insect-feeding damage must 
also demonstrate agronomic and phenotypic 
characteristics that are equivalent or superior to 
those of a non-transgenic isoline in the absence 
of or with minimal insect pressure. 


27.7.9 Step 9: Backcross Breeding (If 
Needed) 


Genetic transformation is usually conducted in 
elite or commercial varieties which already pos- 
sess the desired agronomic traits but lack the 
important trait of the transgene. Thus, once suc- 
cessfully conducted, the genetically modified 
plant will be easily recommended for commer- 
cialisation if it shows stability in several genera- 
tions and upon successfully passing and fulfilling 
varietal registration requirements. 

However, some plant transformations may 
have been performed in plant varieties which are 
amenable to genetic transformation but are not 
adapted nor important in the target country. There 
may also be sterility problems in the transgenic 
plant. In such cases, conventional plant breeding 
is performed where the transgenic plant becomes 
the pollen source in the breeding programme and 
the elite lines or commercial varieties as the 
recurrent parent. Backcross breeding enables the 
combination of the desired traits of the recurrent 
parent and the transgenic line in the offspring. 

The length of time in developing transgenic 
plant depends upon the gene, crop species, avail- 
able resources and regulatory approval. It varies 
from 6 to 15 years before a new transgenic plant 
or hybrid is ready for commercial release. 


27.8 Biosafety 


The theoretical framework for safety evaluation 
of genetically engineered crops: 
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Through a structure of science-based risk 
assessment and risk management measures, 
regulatory agencies evaluate GM crops and 
derived products, globally. Assessment stan- 
dards are established by regulatory agencies 
around the world. GM crops must meet all 
these standards. This process is composed of 
four steps: hazard identification, hazard char- 
acterisation, exposure assessment and integra- 
tive risk characterisation (Codex Alimentarius 
2001; Eur Comm 2000; Eur Food Saf Auth 
2006). The safety of GE crops is determined as 
with conventional crops, based on the standard 
that there must be ‘a reasonable certainty that 
no harm will result from intended uses under 
the anticipated conditions of consumption’ 
(Organ Econ Co-op Dev 1993a, b). This stan- 
dard is important because no food, either con- 
ventional or genetically engineered, is safe in 
absolute terms. For example, many conven- 
tional foods, such as potatoes, soybeans and 
celery, contain toxins and/or anti-nutrients, 
and, like most traditional foods, they have not 
been subject to food or feed safety studies. 
Instead, the safety of these crops and foods has 
been established by virtue of their long history 
of safe use (Constable et al. 2007). 


27.8.1 A Comparative Approach 
to Risk Assessment 


Safety of a GE crop assessment historically 
begins with a comparative assessment, also 
referred to as the concept of ‘substantial equiva- 
lence’ (Chassy et al. 2004; Organ Econ Co-op 
Dev 1993a, b). This tactic helps categorise simi- 
larities and differences between the newly 
developed crop and a conventional counterpart 
that has a long history of safe use (Constable 
et al. 2007; Kok et al. 2008). Any diversion from 
the conventional plant will be focused of the 
food, feed and environmental safety assessment, 
which then prove whether GM crop can be con- 
sidered as safe or not (Codex Alimentarius 
2009; Food Agric Organ UN, World Health 
Organ 1996, 2000; Organ Econ Co-op Dev 
1993a, b). 
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27.8.2 Safety Assessment Strategy 
for Genetically 
Engineered Crops 


There is a rigorous process of safety insurance 
for GM crops when to use as food, as animal feed 
and as an integrated part of the environment. 
Many important assumptions were drawn during 
the safety studies undertaken during the develop- 
ment of the GM crops. In the history of mankind, 
GM crops go through much difficult and thor- 
ough safety assessments. This assessment is 
revived by multiple agencies under strict rules 
with national and international safety assessment 
guidelines (Paoletti et al. 2008). 


27.8.3 Food and Feed Safety 


A number of comparative and toxicological 
approaches and guidelines which are internation- 
ally approved are followed to assess the safety of 
food and feed of GM crop (Codex Alimentarius 
2003a, b, 2009; Eur Food Saf Auth 2008; Faustman 
and Omenn 2008; (Food Agric Organ UN, World 
Health Organ 1996; Organ Econ Co-op Dev 1998). 
Safety and nature of the introduced protein is a 
main concern in this assessment. 

Other important point is assessing any poten- 
tial compositional differences in GM crop com- 
pared with a conventional control. This includes 
assessing the levels of numerous nutrients, anti- 
nutrients, toxins, allergens and other important 
compounds normally present in crops—e.g. mac- 
ronutrients such as carbohydrates, proteins 
(including all amino acids), oils (including fatty 
acids) and vitamins and minerals—to ensure that 
the GE crops are compositionally equivalent to 
conventional varieties (Organ Econ Co-op Dev 
2001, 2002, 2004). 90-day animal toxicity stud- 
ies adapted from Organisation for Economic 
Co-operation and Development (OECD) Test 
Guideline 408 are also performed using the grain 
or another harvested component of the GE crop 
(Organ Econ Co-op Dev 1998). 
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27.8.4 Environmental Risk 
Assessment (ERA) 


Potential harmful effects on environment of GM 
crop product are assessed through ERA. It mea- 
sures the ecological impact of GM crop on envi- 
ronment. The guidance from the US 
Environmental Protection Agency is consistent 
with other international scientific guidance docu- 
ments (Aust Gov Dep Health Ageing Off Gene 
Technol Regul 2009; Can Food Insp Agency 
2012; Eur Food Saf Auth 2010; Organ Econ 
Co-op Dev 1993a, b) and publications for ERAs 
of GE crops Nickson 2008; Raybould 2006, 
2007; Wolt et al. 2010). 


27.9 Pros and Cons of GM Crops 


So far we have seen that there are great goals that 
can be achieved by this rDNA technology and 
GM crops. Increased production, greater nutri- 
tional value, greater resistance to biotic and abi- 
otic stress, better taste and good for human health 
are some of the main characteristics/traits that 
GM crops can provide over natural crops. But, on 
the other hand, there is a great concern about the 
odd effects like unforeseen hostile health effects, 
potential damage to environment, commercial 
exploitation by private firms, etc. 


27.9.1 Pros 
27.9.1.1 Improved Tolerance to Biotic 
and Abiotic Stresses 

As we have seen with the selection of proper can- 
didate gene, crops can be made tolerant to biotic 
stresses like they are made resistant to insects, 
pest and diseases (like Bt cotton, Bt corn, etc.) 
which ultimately reduces the use of insecticides, 
fungicides, pesticides and other chemicals in the 
agriculture which helps to improve gross income 
of the farmers and also helps to reduce the load of 
harmful chemicals in the environment. 
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27.9.1.2 Biotech Crops Contribute 

to Food Security, 

Sustainability and Climate 

Change 
From 1996 to 2012, economic gains at the farm 
level of US$116.0 billion were generated glob- 
ally by biotech crops, due to reduced production 
costs and substantial yield gains. Biotech crops 
have reduced the amount of pesticides used by 
497 million kg. In 2012 alone, fewer insecticide 
sprays reduced CO, emissions by 26.7 billion kg, 
equivalent to taking 11.8 million cars off the road 
for a year (James 2006). 


27.9.1.3 Increased Crop Productivity 
GM crops have significantly contributed to the 
food, feed and fibre security. GM crops have pro- 
vided more affordable food by reducing produc- 
tion cost. GM crops allow less ploughing, less 
pesticide uses and less labour which help farmers 
to add in their gross income. 


27.9.1.4 GM Crops Help 

in Conservation 

of Biodiversity 
GM crops are a land-saving technology. We have 
GM crops that can provide more yield in the lim- 
ited availability of land. This protects the defor- 
estation needed for new agriculture land and 
helps to protect biodiversity. 


27.9.1.5 Reduce Agricultural 
Eco-Footprint 

Because of insect- and pest-resistant GM crops, 

the pesticide spraying was reduced by 503 mil- 

lion kg. This has tremendously reduced the 

chemical load in environment due to agriculture 

practices. 


27.9.1.6 Resistance to Herbicides 
Herbicide-resistant GM crops have made it easy 
to control the troublesome weeds in the field. 
This has limited the use of herbicides, which has 
decreased environmental impact from herbicides 
by 18.7 %. 
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27.9.1.7 Help Mitigate Climate 
Changes 

Fewer uses of pesticides, insecticides and herbi- 
cides in the field help to reduce the fuel used in 
the process of development, transport and appli- 
cation. This reduced the CO, emission equivalent 
to removing 11.9 million cars from the road for 
completely 1 year. 


27.9.1.8 Faster Growth 

Crops can be engineered to grow faster, and they 
can be cultivated in the areas where shorter grow- 
ing period is available and also multiple harvest 
seasons can be taken. 


27.9.1.9 Improved Nutritional Quality 
Crops can also be engineered to produce larger 
and better essential vitamins and minerals. They 
can also be altered to increase amount of carbo- 
hydrates, proteins and saturated and unsaturated 
fat content. 


27.9.1.10 Medicine and Vaccine 
Production 

GM crops can also produce medicines and vac- 

cines, so that human treatment can be achieved 

cheaply and efficiently through diet in very poor 

countries. 


27.9.1.11 Improved Taste 

Crops can be engineered to produce better tasting 
food, which may encourage people to eat more 
healthy food that are currently not likable (Fig. 27.11). 


27.9.2 Cons 


27.9.2.1 Unexpected Side Effects 

There are concerns about the unpredictable side 
effects of GM food on human health. The prob- 
lem is allergic reactions; genetic modification 
often mixes or adds new proteins that weren’t 
indigenous to the original plant, which may cause 
new allergic reactions to humans or livestock. 
Most of the GM crops have antibiotic resistance 
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Fig.27.11 Pros of GM crop shows us the promising positive effects of GM crops on human society and world environ- 
ment (Source: Brookes et al. 2014; James 2013) 
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genes in them. It is speculated that when humans 
eat them, these antibiotic features persist in our 
bodies and make actual antibiotic medications 
less effective. But the possibility of this to hap- 
pen is not there or very less. We eat vegetables 
and meat in our day-to-day life for thousands of 
years, but have we seen any example that a human 
or animal show characters of vegetables and ani- 
mals? When we eat something, the nuclear mate- 
rial get digested and fabricated completely. 


27.9.2.2 Labelling of GM Food 

Labelling of GM food is a matter of concern. 
Nowadays so many products are available in the 
market; customer may never know what is he or 
she eating? It may be GM or non-GM food. Not 
all countries have strict rules on labelling of GM 
foods or their ingredients or GM-processed food. 
Vegetarians and vegans might unknowingly eat 
plant-based foods containing genes from 
animals. 


27.9.2.3 Threat to Diversity 

Toxic genes for specific insects may kill other 
beneficial insects, which may have effects on ani- 
mals further up the food chain. Another risk is 
that these foreign genes may escape into wild. If 
like herbicide gene cross passed into wild weed 
species, it can produce super weed which will be 
very hard to control. Making plants resistant to 
bacteria, fungi, viruses and insects can cause 
these pathogens to become more virulent which 
will cause more damage to crops. 


27.9.2.4 Effect on Non-GM Crops 

Pollens from GM crops can transfer on non-GM 
counterpart which may genetically modify the 
non-GM crop which will spread in wild 
varieties. 


27.9.2.5 Overuse of Herbicides 

It is very essential to educate the farmers about 
the GM product they are planting. Herbicide- 
resistant crop may encourage them to spray more 
herbicides for quick results; it may increase the 
load of chemicals in the soil and pollute rivers 
and other waterways. This may poison the fish 
and other animals, birds and plants and can 
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accumulate in all food chains in ecosystem caus- 
ing lethal damage. 


27.9.2.6 Benefits May Not 

Be Available to Everyone 
Due to high price and top-end technology 
involved in the production and due to IPR issues, 
it may be possible that GM technology is being 
monopolised by a small number of private 
companies. 
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Abstract 

For centuries humans have used plants as a source of food, fiber, fuel, and 
medicine because they have the ability to synthesize a vast array of complex 
organic compounds using light, carbon dioxide, and water. Advances in 
recombinant DNA and transgenic technologies during the last several 
decades have opened many new avenues to further exploit plants for 
production of many novel products. The potential to use plants to synthe- 
size diverse native and nonnative industrial and pharmaceutical products 
coupled with the depletion of fossil fuels that are the source of many com- 
mercially important products and the adverse effects of chemical synthesis 
of platform chemicals on the environment have renewed considerable 
interest in using plants for large-scale production of chemicals and value- 
added compounds. To accomplish this, different genetic engineering and 
transformation strategies have been developed for introducing multiple 
genes (gene stacking), modulating their expression with regulatable 
promoters, and targeting the products to a specific compartment in the 
cells. Successful metabolic engineering of plants should lead to sustained 
production of platform chemicals, pharmaceuticals, and biopolymers. In 


M. Golovkin 

Foundation for Advancement of Science, Technology 
and Research, PA Biotechnology Center, 
Doylestown, PA 18902, USA 


A.S.N. Reddy (Db) 

Department of Biology, Program in Molecular Plant 
Biology, Program in Cell and Molecular Biology, 
Colorado State University, 


Faculty of Science, Botany Department, Zagazig Fort Collins, CO 80523-1878, USA 
University, Zagazig 44519, Egypt e-mail: reddy @lamar.colostate.edu 
Bir Bahadur et al. (eds.), Plant Biology and Biotechnology: Volume II; Plant Genomics 551 


and Biotechnology, DOI 10.1007/978-8 1 -322-2283-5_28, © Springer India 2015 


552 


S.E. Abdel-Ghany et al. 


this chapter, we present an overview of different methods that are currently 
used to introduce and manipulate expression of one or more genes into 
plants and discuss some of the recent achievements in producing value- 
added products and pharmaceuticals in plants. 


Keywords 


Metabolic engineering * Pharmaceuticals ¢ Platform chemicals * Value- 
added compounds * Biomass ¢ Butanetriol * Bioenergetics * Bioplastics « 
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28.1 Introduction 

Plants, as photoautotrophs, convert light energy 
into chemical energy and use it for their growth, 
development, and synthesis of many complex 
organic compounds. Humans have been using 
plants for centuries as a renewable source of 
food, fiber, fuel, pharmaceuticals, and other bio- 
materials. Since plants naturally produce a vast 
array of complex organic compounds and sec- 
ondary metabolites, they have been used exten- 
sively to produce many commercially important 
industrial and consumer products (United States 
Department of Agriculture 2008; Saito and 
Matsuda 2010; Yoon et al. 2013). During the last 
century, petroleum-based chemicals were widely 
used to produce diverse products ranging from 
plastic polymers to cosmetics. However, increasing 
demand for petroleum and petroleum-derived 
products due to population growth (United 
Nations 2013), the diminishing supplies of fossil 
fuels, and the adverse impact of producing 
industrial chemicals from fossil fuels on the 
environment renewed the interest in using plants 
to produce industrial chemicals and other value- 
added products in a sustainable and environmen- 
tally safe manner. It is anticipated that plants will 
play an important role in bio-based economy in 
reducing reliance on petrochemicals and in 
producing many unconventional chemicals that 
are currently produced from _ petrochemical 
feedstock and also in producing biofuels (United 
States Department of Agriculture 2008). This 
bio-based approach offers several advantages for 
the production of industrial chemicals. These 
include sustainability as plants are renewable 


sources and the likelihood of being highly cost- 
effective as they use light energy and CO, to pro- 
duce sugars and a vast repertoire of organic 
compounds that can serve as precursors for the syn- 
thesis of desirable carbon compounds. 
Furthermore, bio-based synthesis of industrial 
chemicals is environmentally friendly as compared 
to chemical synthesis. In addition, the development 
of technologies in the 1980s to stably introduce and 
express genes in plants has opened new avenues to 
use plants to produce novel platform chemicals 
and pharmaceutical products that are not normally 
produced in plants (Poirier et al. 1992, 1995; 
Bohmert-Tatarev et al. 2011; Golovkin 2011; 
Abdel-Ghany et al. 2013). 

Advances in plant biotechnology have allowed 
engineering of complex biochemical pathways not 
only by manipulating expression of a single gene 
but also by introducing multiple genes from other 
plants and non-plant systems including bacteria. 
Coordinated expression of multi-subunit proteins 
and manipulation of metabolic pathways have been 
widely used for diverse applications (Table 28.1). 
These include improvement of food and feed qual- 
ity (Fitzpatrick et al. 2012; Tieman et al. 2012), 
increased tolerance of crop plants to abiotic and 
biotic stresses (Rhodes 1994; Nanjo et al. 1999; 
Munns 2005), development of weed and insect 
resistant crops, and use of plants for production of 
polymers (bioplastics), biofuels, platform chemicals, 
and pharmaceuticals (Nikolau et al. 2008; Andrianov 
et al. 2010; Bohmert-Tatarev et al. 2011; Abdel- 
Ghany et al. 2013; Golovkin 2011). 

This chapter will focus on production of com- 
mercially important products such as platform 
chemicals and pharmaceuticals in plants using 
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Table 28.1 Some examples of different applications of plant biotechnology in agriculture, medicine, and other areas 


Agriculture 
Herbicide resistance 
Pathogen resistance 


Manipulate growth and development 
Yield and product quality 
Senescence and stress control 


plant biotechnological approaches. Introduction 
and manipulation of foreign genes in plants is 
fundamental to all biotechnological approaches; 
hence, we will first provide an overview of differ- 
ent methods that are currently used to introduce 
and manipulate expression of one or more genes 
in plants. We will then discuss how these methods 
are used to engineer plants to produce many 
valuable products by manipulating existing meta- 
bolic pathways or introducing novel pathways. 


28.2 Commonly Used Methods 
to Introduce Genes into 
Plants 


Prior to the 1980s, people used to introduce 
desired genes/traits into crop species through 
conventional breeding between the domesticated 
varieties and the wild relatives that have the traits 
of interest, and then new cultivars with desirable 
characteristics are obtained by backcrossing (Yan 
et al. 2010; Powell et al. 2012). This is very 
laborious, time consuming, and limits gene trans- 
fer between sexually compatible plant species. 
However, genes of many desirable traits may not 
be found in crossable relatives of the interested 
crop. The development of a method in the early 
1980s to stably introduce and express a foreign 
gene from other organisms into plants has 
changed the way to introduce new traits into 
plants. As discussed below, it is now possible to 
introduce multiple genes (gene stacking) from 
any organism into the nuclear or chloroplast 
genome of plants; produce proteins constitu- 
tively, in a particular cell type or in response to a 
chemical or environmental cue; and target the 
proteins to a desired cellular location. These 
improvements allowed high level of expression 


Molecular farming 
Pharmaceutical proteins 
Medicinal compounds 


Polymers 


Other areas 
Phytoremediation 
Biofuels 

Energetic materials 


Industrial chemicals/proteins 


Cosmetics 


of multiple genes and allowed engineering of 
plant metabolism to produce desired chemicals. 
In Table 28.2, we summarized various methods 
to introduce and manipulate expression of genes 
in plants and their advantages and disadvantages. 


28.2.1 Agrobacterium-Mediated 
Transformation 


The most commonly used method for introducing 
a gene into a plant genome is Agrobacterium- 
mediated transformation using the soilborne 
pathogen Agrobacterium tumefaciens (Tzfira and 
Citovsky 2003; Valentine 2003; Gelvin 2012). In 
Agrobacterium-mediated transformation, the 
bacterium represents the biological machinery 
that delivers the gene(s) of interest into the plant 
genome. This process involves proteins derived 
from both the bacterial pathogen and the host 
plant (reviewed in Gelvin 2012). In nature, 
Agrobacterium infects some plant species and 
causes crown gall tumors. During infection, the 
Agrobacterium transfers a portion of the DNA, 
called I-DNA, from the tumor-inducing (Ti) 
plasmid into the host where it subsequently 
integrates into the host genome. Wild-type 
T-DNA encodes genes involved in hormone and 
opine biosynthesis that are expressed in host cells 
and cause cell proliferation and tumor formation 
(Gelvin 2012). To introduce desired genes into 
plants, the T-DNA was engineered by replacing 
the hormone and opine biosynthetic genes with 
selectable marker genes and multiple cloning 
sites for insertion of gene(s) of interest flanked by 
the T-DNA left and right border sequences. The 
engineered T-DNA was also moved into a small 
plasmid for easy cloning of genes using an 
Escherichia coli system, and then this vector with 
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Table 28.2 Summary of current plant transformation and expression systems 


Expression 
Major system levels 
Stable Nuclear 
Constitutive expression Modest 
Transgenic plants and (<1 % TSP) 
cell cultures (Seeds >1 %) 
Chloroplast 
Constitutive expression High 
Transplastomic plants (>5 % TSP) 
Transient Plant viruses 
Transient expression! High 
modern systems/ (>10 % TSP) 


wild-type plants 


Advantages 


Inheritable gene 

Broad host range 

Possible direct consumption 
Plant-specific PTM 
(glycosylation) 

High yield and scale-up 


High expression and 
accumulation 

Inheritable gene 

No gene transfer though pollen 
No position effect and gene 
silencing 

Scale-up capacity 


High expression and 
accumulation 

Easy molecular manipulations 
Simple and quick transformation 
Short production times 

High scale-up capacity 

Less environmental risks 
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Disadvantages 


Low expression 
Lengthy procedure 
Positional effect and 
gene silencing 
Environmental risks 


Prokaryotic machinery 
No glycosylation 
Narrow host range 
Somatic propagation 
Lengthy procedure 


Not inheritable 
Restricted to some hosts 
Need of purification 
(endotoxins) 

Some environmental 
risks 


PTM posttranslational modification, TSP total soluble protein 


genes of interest is introduced back into an 
Agrobacterium strain (helper strain) containing a 
Ti plasmid harboring the virulence (vir) genes but 
without T-DNA (Anami et al. 2013). To stably 
introduce gene(s) into plants, the Agrobacterium 
containing the engineered plasmid 1s cocultivated 
with the explant. During the cocultivation, 
Agrobacterium transfers T-DNA into the host 
cell, which then integrates randomly into the 
nuclear genome by nonhomologous recombina- 
tion. Transgenic calli or somatic embryos are 
selected and regenerated on selection medium. 
In Arabidopsis, transgenic lines can be generated 
by submerging or spraying the floral buds with 
Agrobacterium culture and then selecting the 
transgenic lines by germinating the seeds on 
selection plates (Clough and Bent 1998). In 
nature, Agrobacterium mainly infects dicotyle- 
donous plants (De Cleene and De Ley 1976). 
However, under laboratory conditions, it is the 
method of choice for genetic manipulation of 
many plant species including monocots (Tzfira 
and Citovsky 2006). Some genotypes and plant 


species are not transformable with Agrobacterium, 
a barrier that limits the use of Agrobacterium- 
mediated transformation in these species. 


28.2.2 Direct Gene Transfer 


Direct gene transfer by high-velocity micropro- 
jectiles (also called particle bombardment or 
biolistic method) and electroporation has also 
been widely used to introduce genes into plants, 
especially in those species that are not amenable 
for transformation with Agrobacterium (Altpeter 
et al. 2005; O’ Kennedy et al. 2011). In case of 
particle bombardment, metal particles coated with 
plasmid DNA containing the gene(s) of interest 
are introduced into plant cells in a pressurized 
helium stream using a gene gun. Physical and 
chemical parameters such as rupture pressure, 
type of particles, particle travel distance, and DNA 
concentration can be optimized to different 
tissues and different species. In case of electropora- 
tion, plasmid DNA 1s introduced into protoplasts 
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by subjecting the cells to an electric shock 
(Anami et al. 2013). There are advantages and 
disadvantages of both Agrobacterium-mediated 
transformation and particle bombardment meth- 
ods. The ability of Agrobacterium to introduce a 
single or a few copies of gene per genome is an 
advantage over particle bombardment that results 
in insertion of multiple copies, which might 
result in recombination and instability (Gao 
and Nielsen 2013). On the other hand, some 
advantages make particle bombardment a method 
of choice for engineering of recalcitrant plant 
species, different cell types, organelle transfor- 
mation, and simultaneous transformation of 
multiple genes with no need for complex cloning 
or crossing of lines carrying different transgenes 
(Altpeter et al. 2005). 


Stable Versus Transient 
Transformation 

Once foreign DNA is integrated into the host 
genome (stable transformation) by either method, 
it is inherited and segregated in the progeny 
according to Mendelian laws (De Block et al. 
1984; Horsch et al. 1984). However, during prop- 
agation of these lines, complete loss of function 
of the transgene or significant variation of gene 
expression among lines might occur as a result of 
gene silencing, multiple integration events, or 
complex genetic rearrangements (Slater et al. 
2003; Schiermeyer et al. 2004). Analysis of a 
large number of independent transgenic lines for 
each gene is necessary to achieve the highest 
expression/accumulation of introduced gene 
product. In stable transformation, the effect of 
transgenes can be studied at the biochemical, 
physiological, morphological, genetic, and cell 
biological levels to gain insight into their func- 
tions and biological roles. This is different from 
transient transformation where gene expression, 
subcellular localization, or protein targeting can 
be tested within hours to days, thus avoiding in 
vitro plant regeneration procedures (Sheen 2001). 
Transient transformation can be performed by 
shooting DNA-coated beads into epidermal 
peels, cell cultures, and tissues, electroporation 
with protoplasts, or agroinfiltration of leaves 
(Kirienko et al. 2012; Mangano et al. 2014). 


28.2.2.1 
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Agroinfiltration of Nicotiana benthamiana leaves 
has been used widely to study protein-protein 
interactions by means of bimolecular fluores- 
cence complementation technology (BiFC) and 
fluorescence resonance energy transfer (FRET) 
(Ali et al. 2008; Boruc et al. 2010). Recently, an 
Arabidopsis proteome chip prepared from 
thousands of transiently expressed Arabidopsis 
cDNAs in Nicotiana benthamiana leaves was 
used to perform protein-protein interaction 
studies, which resulted in identification of targets 
of calmodulin and MAP kinases (Popescu et al. 
2007; Lee et al. 2011). 


28.2.3 Organelle Transformation 


There is a public concern about the insertion of a 
transgene in the nuclear genome as the trans- 
genes could move — “transgene flow” — to wild 
species or to weedy relatives growing in natural 
or seminatural communities (Wilkinson et al. 
2003; Singh et al. 2010). In recent years, research- 
ers have begun to introduce genes into plastids as 
an alternative approach to nuclear transformation 
as it offers several advantages over transferring 
genes into nuclear genomes (Bock 2007; Wani 
et al. 2010). First, plastid transformation has the 
potential to eliminate the gene flow through 
pollen grains since plastids are maternally inher- 
ited in a majority of angiosperms (Wani et al. 
2010). Second, plastid transformation allows 
high levels (up to 40 %) of protein expression 
compared to 0.5—3 % in case of nuclear transfor- 
mation (Daniell et al. 2002). The high production 
of expressed protein in transplastomic plants 
(plants with transformed plastid genome) is pos- 
sible due to the high copy number of plastids per 
plant cell, and also the expressed proteins are less 
affected by pre- and posttranscriptional gene 
silencing. Third, plastid transformation has the 
advantage of expressing multiple genes from 
polycistronic mRNA with the absence of epigen- 
etic effect and gene silencing (Sidorov et al. 
1999; Maliga 2001). However, there are a few 
drawbacks in introducing genes into the plastid 
genome. The expressed proteins in the plastids 
are not glycosylated because of the prokaryotic 
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nature of the plastids. This is important especially 
for production of pharmaceuticals (Golovkin 
2011). Another problem is the lack of tissue- 
specific and developmentally regulated control 
mechanisms (Wang et al. 2009). This is neces- 
sary when expression of the transgene 1s desired 
to be limited to a specific tissue, organ, and/or 
developmental stage or the high level of foreign 
proteins in plastids may have deleterious pleio- 
tropic effects (Magee et al. 2004). Recently, the 
toxic effects of transgene expression in plastids 
have been addressed by using an inducible 
expression system that requires coordination 
between nucleus and plastids. A T7 RNA poly- 
merase gene (IT7RNAP) fused to a chloroplast 
transit peptide at the N-terminus was cloned 
under the control of an alcohol-inducible pro- 
moter and introduced into the nuclear genome of 
tobacco. Induction of T7RNAP by ethanol appli- 
cation leads to its expression and transport to 
plastids where transcription of the plastid transgene 
under the control of the T7 promoter is turned on 
(Lossl et al. 2005). Also, an external control of 
plastid transgene expression was developed using 
modified promoters containing binding sites for 
bacterial Lac repressor. This allows chemical 
induction of plastid transgene in intact plants or 
after harvesting (Muhlbauer and Koop 2005). 
Plastid transformation has been used to introduce 
new traits and also for basic research to under- 
stand processes in plastids such as RNA editing 
(Ruf and Bock 2011). Transgenic chloroplasts 
have been tested for the production of many 
therapeutic proteins. This method avoids the 
expensive fermentation procedures, and, in 
addition, the oral delivery of plastid-derived 
therapeutic proteins could eliminate the expensive 
purification, cold storage, and cold transportation 
(Singh et al. 2009). More than 20 vaccines were 
successfully expressed in chloroplasts (Golovkin 
2011), suggesting that chloroplast contains the 
mechanisms that allow correct folding and disul- 
fide bond formation that result in fully functional 
proteins. However, further studies are needed to 
establish a feasible production system for bio- 
pharmaceuticals in chloroplasts. If functional 
glycoproteins could be expressed in chloroplasts, 
then plastid engineering would be of great use in 
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expressing many commercially important proteins. 
Plastid engineering was also used for enhancing 
the nutritional value of food crops such as pro- 
ducing fruits with elevated contents of B-carotene 
(Apel and Bock 2009), for phytoremediation 
(Ruiz et al. 2003), for reversible male sterility 
(Ruiz and Daniell 2005), for biotic and abiotic 
stress tolerance (Wani et al. 2010), and for 
production of platform chemicals such as the bio- 
plastic polyhydroxybutyrate (PHB) (Nakashita 
et al. 2001; Lossl et al. 2003) (Sect. 28.7.4.2). 

Although the first transgene delivery to plastids 
was achieved using the Agrobacterium-mediated 
method (Block et al. 1985), particle bombardment 
is the method of choice for plastid transforma- 
tion. To obtain a genetically stable transplastomic 
line, all genome copies have to be uniformly 
transformed (homoplasmic), a process that takes 
a long time. Plastid transformation occurs through 
the following steps: First, introduction of the 
plasmid containing the selectable marker (e.g., 
an antibiotic resistance) and the gene of interest 
by the biolistic process or by polyethylene 
glycol; second, integration of the foreign DNA 
by homologous recombination into the plastid 
genome; third, gradual elimination of wild-type 
genome copies during repeated cell division on a 
selective medium; and fourth, regeneration of 
the transplastomic shoot. In some cases, the anti- 
biotic resistance used in selection is not desirable 
in the final products; therefore, many strategies 
have been developed to eliminate the necessity 
of using these selectable markers (Maliga 2002, 
2003; Wang et al. 2009). 


28.2.4 Viral-Based Transient 
Expression of Proteins 


Modular transient expression systems using plant 
viral vectors are offering a robust industrial 
method to achieve high yields of heterologous 
proteins. The advantage with this method is that 
the transgene 1s not heritable. Two strategies have 
been used for transient expression using viral 
vectors. The first is called a “full virus strategy.” 
The protein of interest, e.g., small protein fragment 
such as immunogenic epitope, 1s expressed as a 
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fusion to the viral coat protein usually in tobacco 
(Grevich and Daniell 2005). The major limitation 
of this strategy is that inserts larger than | kb 
cannot usually be expressed. In the second strategy 
that uses so-called deconstructed viral vectors, 
there is no “20-25 amino acid barrier’ for coat 
protein fusion and longer proteins up to 80 kDa 
can be expressed. In this strategy, only the viral 
elements required for efficient expression of the 
interested gene were maintained, while the 
components needed for infectivity, amplification/ 
replication, cell-to-cell movement, assembly of 
viral particles, and systemic spread are provided 
using nonviral components (Gleba et al. 2007). 
Depending on the vector used and the host organ- 
ism, the process takes 4-16 days, and depending 
on the gene of interest, up to 5 g recombinant 
protein per kg fresh leaf biomass can be produced 
(Gleba et al. 2007). The viral-based expression 
system has already been used to produce func- 
tional platform proteins of different origin such 
as biologically active human growth hormone 
and antigens (reviewed in Gleba and Giritch 
2011). Overall, this industrial process has a rapid 
scale-up and high-yield production of diverse 
heterologous proteins, thereby greatly reducing 
the cost and time. 


28.3 Manipulation of Transgene 
Expression 


Unlike in bacteria, in plant metabolic engineering 
great care must be taken to control transgene 
expression. Plants have organelles that affect 
resources compartmentalization. Also, plants have 
numerous specialized and differentiated organs 
in which gene expression may differ dramati- 
cally. In addition, temporal and developmental 
processes can significantly influence when and 
where the transgene will be active. Moreover, the 
aim of the engineering process also influences the 
design strategy. For example, is the engineering 
intended to impact native physiological processes 
or synthesis of metabolites or is the goal to over- 
produce new products or proteins? Such consid- 
erations mandate the type of promoters that drive 
the transgene and its level of expression. 


Dos 


The promoter, a portion of DNA upstream of a 
gene transcription start site, controls the gene 
expression levels and its specificity 1.e., constitu- 
tive, inducible, tissue-specific, or developmen- 
tally regulated. In transgenic plants, promoters 
are used to regulate the level and specificity of 
the transgene expression (Anami et al. 2013). 
Various types of promoters that differ in strength 
and regulation have been used in plant metabolic 
engineering. 


28.3.1 Constitutive Promoters 


Constitutive promoters are used to produce large 
quantities of the transgene product in all plant 
parts and throughout the growth and develop- 
ment period. Several constitutive promoters have 
been used successfully to express transgenes in 
plants. The most commonly used are those of 
viral origin such as the cauliflower mosaic virus 
35S promoter (35S CaMV) that confers a high 
expression in most cell types except pollen. The 
expression of these promoters is independent of 
environmental conditions, and they are active in 
various monocotyledonous and dicotyledonous 
plants (Odell et al. 1985; Benfey and Chua 1990). 
Although a very high level of expression is 
achieved by using the constitutive promoters of 
viral origin, gene silencing via co-suppression has 
been reported (Koosha etal. 1989). Co-suppression 
phenomena can be avoided by using constitutive 
promoters of plant origin such as maize ubiquitin 
(maize Ubi-1) (Shrawat and Lorz 2006), rice 
actins (OsActl and OsAct2) (McElroy et al. 
1990), Brachypodium distachyon elongation fac- 
tor, ubiquitin (BdEFla and BdUBI) (Coussens 
et al. 2012), and switchgrass ubiquitin promoters 
(PyvUbil and PvUbi2) (Mann et al. 2011). Other 
constitutive promoters that are also widely used 
are nopaline synthase (nos), octopine synthase 
(ocs), Mannopine synthase (mas) (von Lintig 
et al. 1994), and maize alcohol dehydrogenase 1 
(Adh1) (Kyozuka et al. 1994). To introduce more 
than one gene in a transgenic plant, different 
promoters and terminators should be used to 
avoid the co-suppression and silencing of the 
introduced genes (Abdel-Ghany et al. 2013). 
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28.3.2 Inducible Promoters 


In some cases, constitutive expression of an 
introduced gene can have adverse effects on plant 
growth and development when the overproduc- 
tion occurs at the wrong time in development or 
in a tissue where it is not normally expressed. For 
instance, in metabolic engineering of Arabidopsis 
for cellulose production, constitutive overex- 
pression of glucanases and cellulases (cellulose- 
degrading enzymes) reduced cellulose crystallinity 
and the glucose content in the cell wall (Takahashi 
et al. 2009), modified leaf shape and growth 
(Petersen and Bock 2011), and reduced environ- 
mental stress tolerance (Taylor et al. 2008). 
Therefore, under certain circumstances, it is 
desirable to use inducible promoters to regulate 
foreign gene expression to a distinct developmen- 
tal stage or for a certain duration (Cai et al. 2007). 
Inducible systems are usually based on two com- 
ponents. The first component is a native or chimeric 
transcription factor (activator) that when induced 
specifically binds to a specific promoter. The 
second component contains the binding sites for 
the activator to activate transcription of a gene 
fused to this promoter (Borghi 2010). Inducible 
promoters are activated in response to various 
external stimuli, which can be chemical factors 
such as tetracycline, ethanol, steroids, metal ions, 
and herbicides or physical factors such as heat, 
cold, and light (Zuo and Chua 2000). Those 
promoters that respond to specific chemical 
compounds, not found naturally in the organism 
of interest, are of particular interest in metabolic 
engineering because they offer very stringent 
regulation of gene expression. Tetracycline 
inducers are particularly attractive because they 
are small lipophilic compounds that enter easily 
into eukaryotic cells by passive diffusion. 
They have been used successfully to produce 
valuable pharmaceuticals or industrial proteins in 
plant cell suspension culture (Bortesi et al. 2012). 
Another antibiotic, which has been used widely 
in bioengineering of cell culture, is pristinamycin. 
The use of a pristinamycin-responsive promoter 
to drive transgene expression has the advantage 
of controlling bacterial contamination as well 
as easy induction of the transgene within the 
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bioreactor (Frey et al. 2001). The ethanol-inducible 
system (Roslan et al. 2001), derived from the 
filamentous fungus Aspergillus nidulans, has 
been optimized for the production of proteins in 
plants and studying the function of proteins (Van 
Hoewyk et al. 2007). Steroid-inducible systems 
have also been used in studying the function of 
plant transcription factors that control plant 
developmental pathways (Aoyama et al. 1995; 
Gunl et al. 2009). Also, the promoter of the 
Arabidopsis heat shock protein (HSP18.2) has 
been used successfully to induce expression of 
desired genes in several plants after heat shock 
(37 °C) (Matsuhara et al. 2000; Masclaux et al. 
2004). In the absence of heat stress, the HSP18.2 
promoter is repressed (Yoshida et al. 1995). 


28.3.3 Organ-, Tissue-, Domain-, 
and Cell-Type-Specific 
Promoters 


Rather than overexpressing the transgene in all 
tissues, another option is to target the transgene 
expression to a specific organ or tissue that is rich 
in precursors of a desired metabolic pathway or is 
easy to harvest, e.g., leaves, tubers, or fruits 
(Lessard et al. 2002). Several endosperm-specific 
promoters have been used to express single or 
multiple enzymes of biochemical pathways in this 
tissue either to dissect a metabolic pathway or to 
improve the nutritional value of seeds (Naqvi 
et al. 2009; Coussens et al. 2012). Expression of 
phytoene synthase (psy) in indicia rice under the 
control of seed-specific glutelin promoter (Gt-1 
P) in combination with lycopene B-cyclase (/cy) 
and phytoene desaturase (crt/) fused to the transit 
peptide sequence of the pea-Rubisco small subunit 
and, driven by the CaMV35S promoter, caused the 
seeds to accumulate high levels of carotenoids 
and become yellow (Datta et al. 2003). Other tis- 
sue-specific promoters that have been used in 
plant biotechnology are the tapetum-specific pro- 
moters for generation of male sterility in canola, 
brassica, tobacco, Arabidopsis, rice, and others 
(Mariani et al. 1990; Tsuchiya et al. 1995; Mitsuda 
et al. 2006; Zhan et al. 1996), the green tissue- 
specific Rubisco small subunit promoter 
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(Rubisco SSU) (Bakhsh et al. 2011; Abdel-Ghany 
et al. 2013), and the floral whorl-specific API pro- 
moter (Byzova et al. 2004). 


28.4 Suppression of Gene 
Expression 


In addition to overexpression of transgenes that 
confer desirable traits, suppression of the levels 
of endogenous gene(s) expression is also of 
interest for plant biotechnology. Studies in gene 
silencing-related mechanisms started in early the 
1990s based on a surprising observation in which 
the introduction of a transgene into the genome 
led to the silencing of both the transgene and 
homologous native gene (Napoli et al. 1990; van 
der Krol et al. 1990a), a phenomena known as 
posttranscriptional gene silencing (PTGS). 
Different methods have been applied for gene 
silencing in plants. 


28.4.1 Antisense RNA 


The antisense RNA strategy is a powerful tool for 
the selective suppression of gene expression in 
plants (reviewed in Bird and Ray 1991). It has 
been used to study genes involved in a wide range 
of physiological and biochemical processes in a 
variety of tissue types such as flower pigmenta- 
tion (van der Krol et al. 1990b), photosynthesis 
(Stockhaus et al. 1990), fruit-ripening, and viral 
resistance (Hemenway et al. 1988; Powell et al. 
1989). The principle of antisense RNA is based 
on the introduction of a gene corresponding to a 
desired gene in a reverse orientation so that an 
antisense RNA molecule that is complementary 
to whole or part of the sequence of the mRNA of 
the target gene is produced. Inside the nucleus, a 
duplex is formed between the sense and antisense 
transcript, which interferes with several aspects 
of gene expression at the transcriptional, post- 
transcriptional, and translational levels. 
Antisense RNA has several limitations. Its 
effectiveness varies between cells and organs 
depending on the promoter employed even in the 
case of the 35S CaMV constitutive promoter 
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because its expression was found to vary in 
different cell types (Wilkinson et al. 1997; Tang 
et al. 1999). A further limitation is that gene 
silencing methods will potentially target all 
homologous genes, either members of a family or 
closely related genes (Thorneycroft et al. 2001). 
Also, it is not possible to eliminate an enzyme 
completely by antisense RNA, and in many cases 
the transcript is reduced to undetectable levels 
without apparent phenotype, making it difficult 
to derive a firm conclusion about its function as it 
is usually argued that a small amount of residual 
enzyme is sufficient to fulfill the required role in 
vivo (Takaha et al. 1998). 


28.4.2 Insertional Mutagenesis 


Gene knockout is one of the important tools in 
plant functional genomics. It is based on the 
insertion of a foreign DNA (either T-DNA or 
transposon) into the gene of interest. Insertion of 
foreign DNA into a structural gene (whether into 
an exon or intron) in most cases will disrupt gene 
expression completely and result in a null muta- 
tion. In some cases, insertion of foreign DNA in 
the promoter or in the 3’-UTR reduces the expres- 
sion (knockdown) rather than producing null 
mutation. The foreign DNA not only disrupts the 
expression of the gene into which it is inserted 
but also acts as a marker for subsequent identifi- 
cation of the mutation. One advantage of such 
insertional mutagenesis is that it allows the 
recovery of mutations in essential genes, as the 
plants can be maintained in the heterozygous 
state. Another advantage of this method is that 
only a single gene within a family of closely 
related genes is affected, allowing functional 
analysis of individual members in a gene family. 
Using Agrobacterium-mediated transformation, 
a large number of T-DNA-transformed lines have 
been generated, and, therefore, one has a very 
high chance of finding a plant carrying a T-DNA 
insert within any gene of interest. Many T-DNA 
lines have been developed in the two most impor- 
tant model plant species, Arabidopsis and rice 
(Krysan et al. 1999; Jeon et al. 2000; Alonso 
et al. 2003; Sallaud et al. 2004). These T-DNA 
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stocks have served as a foundation for functional 
characterization of thousands of genes in these 
two species. Unfortunately, in many plant species, 
the Agrobacterium-mediated transformation is 
not developed yet or is not efficient enough to 
produce a sufficient population of insertion lines 
that cover the whole genome (Duangpan et al. 
2013). In these species, the transposable elements 
(TEs) are the choice for insertional mutagenesis. 
Several TEs have been used in plants including 
the Activator and Mutator transposons in maize 
(Walbot 1992), the Tam3 transposon in Antirrhinum 
majus (Luo et al. 1991), the Jos/ 7retrotransposon 
in rice (Hirochika 2010), and Tnt/retrotransposon 
in tobacco, potato, soybean, and other species 
(Grandbastien et al. 1989; Cui et al. 2013; 
Duangpan et al. 2013). 


28.4.3 Artificial microRNAs 
(amiRNAs) 


Since insertional mutagenesis approaches are 
available for a limited number of model plants, 
targeted gene silencing using amiRNA technology 
is a valuable tool for gene manipulation and 
functional analysis in plants (Li et al. 2013). 
amiRNA technology exploits the endogenous 
miRNA biogenesis and silencing machineries to 
generate miRNAs that direct silencing of a 
desired gene. In plants, the miRNA precursor 
transcript is 80-250 nucleotides in length with 
some structural features such as mismatches, 
bulges, and one or several imperfect foldbacks 
and when processed produces an miRNA/miRNA* 
duplex. When both duplex sequences are altered 
without changing the structural features, a high- 
level accumulation of miRNAs of a desired gene 
can be produced in different plant species 
(Parizotto et al. 2004; Alvarez et al. 2006; Schwab 
et al. 2006; Ossowski et al. 2008; Sablok et al. 
2011). Therefore, artificial miRNA is built on a 
native miRNA precursor by replacing the origi- 
nal miRNA/miRNA* duplex with a customized 
sequence to silence a gene of interest, thereby 
generating a loss-of-function phenotype for the 
targeted gene (Parizotto et al. 2004; Alvarez et al. 
2006; Schwab et al. 2006; Ossowski et al. 2008). 
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A Web-based miRNA designer (WMD) (http:// 
wimd3.weigelworld.org/cgi-bin/webapp.cgi?pag 
e=Home;project=stdwmd), which works for 
>100 plant species, allows the design of gene- 
specific amiRNA candidates within a given 
transcriptome (Schwab et al. 2006; Ossowski 
et al. 2008). WMD computationally generates 
hundreds of amiRNAs for each target that are 
ranked based on the complementarity and 
hybridization energy without knowing the in vivo 
efficiency. The efficacy of amiRNAs has been 
tested in Arabidopsis (Schwab et al. 2006), rice 
(Warthmann et al. 2008; Schmidt et al. 2012), 
Physcomitrella patens (Khraiwesh et al. 2008), 
Chlamydomonas reinhardtii (Molnar et al. 2009), 
and other species. The advantages of amiRNA 
compared to other RNA silencing approaches 
such as RNA interference (RNAj) and virus- 
induced gene silencing (VIGS) are that it has 
minimal off-target effects and a unique capacity 
for multigene silencing (Alvarez et al. 2006; Niu 
et al. 2006; Schwab et al. 2006; Ossowski et al. 
2008). Since miRNA silencing depends on many 
unpredictable in vivo factors such as target 
mRNA structure and mRNA binding proteins 
(Fabian et al. 2010; Pasquinelli 2012) and WMD 
prediction does not consider these factors, many 
amiRNAs should be tested to find an optimal 
amiRNA. Recently, an epitope-tagged protein- 
based amiRNA (ETPamir) screen was developed 
for efficient amiRNA gene silencing in plants, in 
which target mRNA encoding epitope-tagged 
proteins were coexpressed in protoplasts with 
amiRNA candidate targeting single or multiple 
genes (Li et al. 2013). These screens were applied 
to diverse plant species including Arabidopsis, 
tobacco, tomato, sunflower, maize, and rice with 
the potential for improving crop engineering 
with highly predictable and efficient amiRNAs 
(Li et al. 2013). 


28.5 Metabolic Engineering Using 
Multiple Transgenes 


In plants, metabolic pathways can be enhanced 
by overexpressing the rate-limiting enzymes 
or introducing enzymes that are insensitive to 
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negative feedback regulation. However, introduc- 
tion of a novel pathway or metabolic engineering 
of a plant for production of platform chemicals 
often requires expression of multiple transgenes. 
In prokaryotes, multigene engineering is possible 
through coexpression of an operon as a polycis- 
tronic transcript (Steen et al. 2010). But in plant 
cells, except in chloroplasts, the polycistronic 
mRNA is not efficient in translation because of 
the scanning mode of the eukaryotic ribosome 
for recognizing the 5’ cap and the initiation 
codon in the transcript and the need for a proper 
sequence context in the 5’ UTR (Kozak 2005). 
Therefore, different strategies have been devel- 
oped to introduce multiple transgenes into plants. 

The common strategy for introducing more 
than one gene is stacking of expression cassettes 
in one vector. Each cassette consists of its own 
promoter, gene of interest, and a terminator. To 
avoid gene silencing through co-suppression, dif- 
ferent promoters and terminators should be used 
and the cassettes are arranged in tandem and in 
the same transcriptional orientation (Hamilton 
and Baulcombe 1999; Kooter et al. 1999). For 
example, introduction of B-carotene pathway 
in rice was achieved by co-transformation with 
two plasmids, each with a two-gene cassette 
(Ye et al. 2000). One plasmid contained the phy- 
toene synthase gene PSY under the control of 
the endosperm-specific glutelin promoter and the 
phytoene desaturase gene crt/ under the control 
of the CaMV 35S promoter. The second vector 
contained the lycopene B-cyclase gene and the 
selectable marker gene. The three genes for 
carotenoid biosynthesis additionally contained 
an N-terminal transit peptide sequence to direct 
the proteins to the chloroplast, the sites for carot- 
enoids biosynthesis. Another example is the 
introduction of the butanetriol metabolic path- 
way in Arabidopsis (Abdel-Ghany et al. 2013). 
Production of butanetriol from its precursor, 
xylose or arabinose, requires four enzymes. The 
genes for the first three enzymes in the pathway 
were cloned in binary vectors with each gene 
driven by a different promoter and terminator 
(Fig. 28.1). Each gene was cloned into a promoter 
vector, and then the gene with the promoter/ 
terminator was moved sequentially into a binary 
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vector that was used for A grobacterium-mediated 
transformation (Abdel-Ghany et al. 2013). 

Vectors containing different cassettes can also be 
introduced into plants by co-transformation, serial 
transformation, or crossing. Co-transformation 
with direct gene transfer is a straightforward 
method but more challenging if Agrobacterium- 
mediated transformation is to be used due to the 
requirement to maintain two plasmids in one 
Agrobacterium cell or the need to infect a single 
cell with two Agrobacterium strains (Ebinuma 
et al. 2001). In serial transformation, referred to 
as super transformation, a transgenic plant will 
be transformed successively with additional 
transgenes (Bock 2013). This method is more 
time consuming and requires a new selectable 
marker gene for each super transformation step 
unless the selectable marker gene is recycled 
using marker excision techniques (Hare and 
Chua 2002). Crossing is also a time-consuming 
process especially if the plant generation time 
is long, and this is not an option in crop plants 
that propagate vegetatively such as potato and 
sugarcane. 

Another strategy for multigene transformation 
is combinatorial transformation (Zhu et al. 2008). 
This method takes advantage of the particle 
bombardment method, which induces a few 
double-stranded DNA breaks into which multiple 
copies of the transforming plasmid’s DNA can 
co-integrate before DNA repair (Altpeter et al. 
2005). Individual transgenes required for a meta- 
bolic pathway as well as the selectable marker 
are cloned in expression cassettes, mixed, and 
loaded onto gold or tungsten particles. Particle 
bombardment with these microprojectiles some- 
times introduces copies of all constructs present 
in the mix. How many of them are introduced 
into the genome depends on the efficiency of the 
integration process as well as on the stability of 
the DNA released from the particles in the 
host cells. Following the selection, transgenes 
co-transformed with the selection marker can be 
analyzed, and transgenic lines can be generated. 
The most attractive feature of combinatorial 
transformation is that there is no theoretical limit 
to the number of transgene cassettes that can 
be co-transformed. Combinatorial transformation 


562 


al D-xylose 


J XDG 


D-xylonic acid 


por 


D-3-deoxy-glycero- 
pentulosonic acid 


BFD 
D-3,4-dihydroxy-butanal 


ou 


D-1,2,4-butanetriol 


PI-Pspl 


—FEJ- sctection (BAR/Kan) — 
PI-Pspl 


I-JScel 


Fig.28.1 Schematic diagrams of 1,2,4-butanetriol pathways 
and design of gene constructs for plant transformation. 
(a) Schematic representation of the biochemical pathways 
showing the synthesis of D-1,2,4-butanetriol and L-1,2,4- 
butanetriol from xylose and arabinose. XDG, D-xylose 
dehydrogenase; ADG, L-arabinose dehydrogenase; XDT, 
D-xylonate dehydratase; ADT, L-arabinonate dehydratase; 
BFD, benzoylformate decarboxylase; DH, dehydrogenase. 
(b) Binary vector construct containing the three genes 
encoding the first three enzymes used in the butanetriol 


has been used to improve the nutritional value of 
maize by introducing the biosynthetic pathways 
for three vitamins (vitamin A, C, and B9) (Naqvi 
et al. 2009). 


28.6 Targeted Genome 
Engineering 


Although the co-transformation, re-transformation, 
and sexual crossing methods have been proven to 
be useful for metabolic engineering of plants, 
these methods suffer from several flaws (reviewed 
in Dafny-Yelin and Tzfira 2007; Naqvi et al. 
2010). For instance, the integration of genes in 
the genome is completely random, which can have 
marked effects on the host, such as inactivation 
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synthesis pathways from xylose and arabinose. Each 
vector had a set of genes amplified from bacteria or 
codon-optimized synthetic genes for the xylose pathway 
or the set of codon-optimized synthetic genes for the arab- 
inose pathway. RbcS-P and RbcS-T Rubisco small subunit 
promoter and terminator, respectively, mas-P and mas-T 
mannopine synthase promoter and terminator, respec- 
tively; and 35S-P and 35S-T cauliflower mosaic virus 
promoter and terminator, respectively (From Abdel- 
Ghany et al. 2013) 


of an endogenous gene, different levels of transgene 
expression among lines, and even silencing of 
the transgene when inserted into a heterochro- 
matic region. To overcome these issues, a tech- 
nology that allows stacking of a number of 
genes in a single multigene array, introducing it 
at a precisely identified chromosomal locus, and 
then manipulating the array by genome editing 
is needed. Zinc-finger nucleases (ZFNs) and 
transcription activator-like effector nucleases 
(TALENs) offer potential for efficient targeted 
plant genome engineering (Bogdanove and Voytas 
2011; Zeevi et al. 2012; Zhang et al. 2013). ZFNs 
and TALENs are sequence-specific nucleases that 
create in vivo double-stranded breaks at target 
loci followed by nonhomologous end joining 
(NHEJ) and homologous recombination, resulting 
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in targeted gene insertion or gene replacement 
(Wright et al. 2005; Zhang et al. 2013). Using 
these systems, binary vectors that can carry up to 
nine genes and integrate them in the genome 
were developed (Zeevi et al. 2012). In plants, the 
use of these sequence-specific nucleases for 
targeted genome modifications has many appli- 
cations, ranging from dissecting gene function 
(Curtin et al. 2012) to creating plants with new 
traits such as disease resistance, altered metabo- 
lite profile, and tolerance to herbicides (Shukla 
et al. 2009; Townsend et al. 2009). 


28.7. Applications of Metabolic 
Engineering 


28.7.1 Improving the Nutritional 
Value of Crops 


Multigene transformation for metabolic engineering 
helped to generate plants with high nutritional 
values. For example, to develop golden rice, the 
entire B-carotene (pro-vitamin A) biosynthetic 
pathway was introduced into rice endosperm 
(Ye et al. 2000). The transgenic rice produced 
seeds with yellow endosperm due to the accumu- 
lation of B-carotene (Ye et al. 2000). Substantial 
improvements of B-carotene were achieved by 
testing enzymes from different sources, some of 
which turned out to have superior catalytic activi- 
ties (Paine et al. 2005). A similar technology has 
been used to improve the content of other vita- 
mins that are limiting in diet and important in 
maintaining optimal health (DellaPenna 2007). 
Iron biofortification is another application of 
metabolic engineering. Iron is the most com- 
monly deficient micronutrient in the human diet 
and about 25 % of the world population suffers 
from iron deficiency. To increase the iron content 
in seeds, the iron storage protein ferritin and 
Aspergillus phytase were expressed in the endo- 
sperm of rice and maize, respectively, which 
resulted in significant increase in the levels of 
bioavailable iron (Goto et al. 2000; Lucca et al. 
2002; Drakakaki et al. 2005; Aung et al. 2013). 
Similar technology was applied for lettuce (Goto 
et al. 2000). Other nutritionally improved traits 


563 


that are introduced into agricultural crops and 
intended to provide health benefits for human 
and animals include protein quality and level, 
essential amino acids, oil and fatty acids, fibers, 
vitamins, and minerals (reviewed in Newell- 
McGloughlin 2008). 


28.7.2 Enhancement of Biomass 
and Photosynthetic Efficiency 


Metabolic engineering has also been employed to 
improve the photosynthetic efficiency of crop 
plants (Kebeish et al. 2007). Phosphoglycolate 
produced from the oxygenation reaction of 
Rubisco is recycled into phosphoglycerate during 
photorespiration, which 1s then used in the Calvin 
cycle. This recycling process results in net loss 
of fixed carbon, thereby reducing the yield of 
biomass and sugars (Kebeish et al. 2007). 
Introduction of a glycolate catabolic pathway in 
the chloroplast enabled the release of CO, in 
close proximity of the Rubisco enzyme, which in 
turn suppresses the unwanted oxygenation 
reaction and increases the CO), fixation capacity 
(Kebeish et al. 2007). The five transgenes 
required for the glycolate metabolic pathway 
were introduced into Arabidopsis chloroplasts by 
the re-transformation strategy (Kebeish et al. 
2007). Transgenic lines showed reduced photo- 
respiration rate and increased biomass produc- 
tion (Kebeish et al. 2007), a potential for 
improving the efficiency of photosynthesis in 
crop plants. Also, in a green house simulation 
experiment using antisense technology, it was 
possible to produce transgenic plants with 
reduced Rubisco activity (Makino et al. 1997). 
These plants showed 15 % higher photosynthetic 
rates and efficient nitrogen use in an elevated par- 
tial pressure of carbon dioxide and high irradia- 
tion, although 65 % reduction was observed 
under normal condition (Makino et al. 1997). 
Also, overexpression of other enzymes such as 
maize phosphoenolpyruvate carboxylase in rice 
and cyanobacterial fructose-1,6-/ 
sedoheptulose-1,7-bisphosphatase in tobacco 
enhanced photosynthesis and growth (Ku et al. 
1999; Miyagawa et al. 2001). 
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28.7.3 Resistance to Biotic 
and Abiotic Stresses 


Several biotic and abiotic stresses such as patho- 
gens, insects, salinity, drought, high temperature, 
and freezing severely limit plant growth and 
development. Plant biotechnology has allowed 
the development of transgenics that are more 
resistant to stresses. In response to osmotic stress, 
plants accumulate compatible solutes or osmo- 
lytes such as glycine betaine, proline, and sugar 
alcohols, suggesting that overproduction of 
these solutes in transgenic plants may result in 
improved stress tolerance. Overexpression of 
bacterial choline monooxygenase (CMO), the 
first enzyme in the betaine biosynthesis pathway, 
in plants resulted in increased resistance to 
various stresses such as salt and extreme temper- 
ature (Sakamoto and Murata 2001). P5CS 
(At Pytoline_5-carboxylate) is the rate-limiting 
enzyme in the proline biosynthetic pathway, and 
transgenic tobacco plants overexpressing the 
P5CS gene produced 10- to 18-fold more proline 
and exhibited better performance under salt 
and freezing stresses (Kishor et al. 1995; 
Konstantinova et al. 2002). Another way to 
increase the proline content is to downregulate its 
catabolism. Arabidopsis plants expressing the 
antisense mRNA for proline dehydrogenase 
(AtProD#), the first enzyme in the proline degra- 
dation pathway, constitutively accumulated pro- 
line and showed tolerance to freezing (—7 °C) and 
to NaCl (600 mM) (Nanjo et al. 1999). Rice 
plants engineered for trehalose overexpression 
showed higher capacity for photosynthesis, 
decrease in photooxidative damage, and tolerance 
to multiple abiotic stresses (Garget al. 2002). 
The dehydration-responsive transcription 
factors (DREB) and C-repeat binding factors 
(CBF) are conserved transcription factors that 
are induced in response to stresses and induce the 
expression of stress-regulated genes. Increased 
expression of Arabidopsis CBF/1, CBF2, and 
CBF3 induced the expression of many cold- 
regulated genes and increased the freezing toler- 
ance of non-acclimated plants (Jaglo-Ottosen 
et al. 1998; Kasuga et al. 1999; Gilmour et al. 
2000). ABA signaling plays a vital role in drought 
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response as many of the drought-inducible 
genes are induced by ABA (Wang et al. 2003). 
Constitutive overexpression of ABA-responsive 
element-binding proteins ABF3 and ABF4 in 
Arabidopsis demonstrated enhanced drought 
tolerance with altered expression of ABA- and 
stress-responsive genes (Kang et al. 2002). 
Similarly, overexpression of heat shock tran- 
scription factors HSFI and HSF3 led to the 
expression of several hsp genes, conferring 
thermotolerance in transgenic plants (Lee et al. 
1995), while the tomato plants expressing 
HsfAl antisense mRNA were extremely sensitive 
to elevated temperature (Mishra et al. 2002). 
Engineered plants overexpressing antioxidants 
such as catalases, ascorbate oxidases, and super- 
oxide dismutases showed increased resistance 
to oxidative stress caused by high light, low 
temperatures, and UV radiation (Gupta et al. 
1993; McKersie et al. 2000). 

Application of plant biotechnology led to 
improved resistance to insects, diseases, and 
herbicides (reviewed in Muthurajan and 
Balasubramanian 2010). For example, transgenic 
plants that express both Br and cry endotoxins 
were found to exhibit resistance against insects 
that feed by chewing or biting. Also, simultane- 
ous introduction of three genes expressing 
insecticidal proteins Cry/Ac, Cry2A, and gna 
into indica rice controlled three of the major 
rice pests (Maqbool et al. 2001). Coexpression of 
two PR-protein genes (chitinase and glucanase) 
was shown to confer resistance to several 
fungal diseases (Muthurajan and Balasubra- 
manian 2010). 


28.7.4 Production of Platform 
Chemicals 


According to the US Department of Energy, 12 
chemicals and chemical classes could be produced 
from plants, and these chemicals can function 
as building blocks or platform chemicals from 
which many value-added compounds such as 
biopolymers, biofuels, energetic materials, and 
pharmaceuticals can be produced (United States 
Department of Agriculture 2008). 
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28.7.4.1 Energetic Materials 
1,2,4-Butanetriol (butanetriol) can be nitrated 
to produce butanetrioltrinitrate (BTTN), an 
energetic material and a useful building block for 
the synthesis of various drugs and important 
chemicals (Niu et al. 2003; Sato et al. 2003; 
Yamada-Onodera et al. 2007; United States 
Department of Agriculture 2008). BTTN is used 
as a propellant and as an energetic plasticizer in 
mining and in a variety of Department of Defense 
and Department of Energy applications. In addi- 
tion, it is anticipated that BTT'N could replace 
nitroglycerin as a vasodilator for the treatment of 
angina. Advantages of BTTN over nitroglycerin 
are that it is less hazardous, less shock sensitive, 
less volatile, more thermally stable, and resistant 
to degradation by nitrate reductase, minimizing 
the number of metabolites generated from 
degradation. 

Chemical synthesis of D, L-butanetriol needs 
petroleum-derived substrates and requires expen- 
sive metal catalysts that adversely impact the 
environment (Niu et al. 2003). On the other hand, 
bacterial synthesis of butanetriol from precursors 
D-xylose or L-arabinose requires continuous 
supply of these sugars (Niu et al. 2003), a process 
that limits the bacterial production. Four enzymes 
are necessary for synthesis of butanetriol 
from either xylose or arabinose: The first two 
enzymes D-xylose dehydrogenase (XDG) and 
D-xylonate dehydratase (XDT) for xylose path- 
way or L-arabinose dehydrogenase (ADG) and 
L-arabinonate dehydratase (ADT) for arabinose 
pathway are substrate specific, whereas the last 
two enzymes benzoylformate decarboxylase 
(BED) and dehydrogenase (DH) are common for 
both pathways (Fig. 28.1). 

Plants synthesize the pentose sugars (xylose 
and arabinose), which can be converted into buta- 
netriol by expressing microbial enzymes allow- 
ing plant-based production of this platform 
chemical. To produce butanetriol in plants, the 
native bacterial genes or the codon-optimized 
ones for both xylose and arabinose pathways 
were cloned under different promoters and intro- 
duced into Arabidopsis plants (Abdel-Ghany 
et al. 2013). Stable transgenic lines accumulated 
butanetriol (Fig. 28.2) and showed an altered 
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metabolic profile. Manipulation of the growth 
medium significantly increased the butanetriol 
production. 

Another energetic bio-based intermediate 
with potential for production of propellants/ 
explosives is phloroglucinol. In addition to 
being the starting material for biosynthesis 
of 1,3,5-triamino-2,4,6-trinitrobenzene (TATB), 
a stable energetic material used by the US mili- 
tary, phloroglucinol has medical, agricultural, 
and industrial applications. Chemical synthe- 
sis of phloroglucinol involves oxidation of 
2,4,6-trinitrotoluene (TNT), a process that pres- 
ents an explosion hazard and generates carcino- 
genic chromates as well as other waste streams. 
On the other hand, bacterial synthesis of phloro- 
glucinol requires a continuous supply of glucose 
to generate the precursor malonyl-CoA. Plants 
have malonyl-CoA as the precursor for fatty acid, 
flavonoids, and _ phytoalexins biosynthesis 
(Baud et al. 2003) and, therefore, can be used for 
sustainable production of phloroglucinol. We 
generated transgenic plants expressing the bacte- 
rial PhID gene targeted to either chloroplast or to 
cytosol for phloroglucinol production (Abdel- 
Ghany, Day, Heuberger, Broeckling, and Reddy, 
MS in preparation). 


28.7.4.2 Polyhydroxyalkanoates 
(PHAs), Bioplastics 

For a long time, there has been an interest in the 
development and production of biodegradable 
plastics to address problems associated with 
plastic waste and its harmful effect on the envi- 
ronment (Poirier et al. 1995). In 1992 Somerville’s 
group demonstrated the production of polyhy- 
droxybutyrate (PHB), a 100 % biodegradable 
polymer, in Arabidopsis by expressing two bacte- 
rial proteins required to convert acetoacetyl-CoA 
to PHB (Poirier et al. 1992). PHB is the most 
common and best-characterized member in a 
group of polyhydroxyalkanoates (PHAs), which 
are biopolymers made entirely from bacterial 
fermentation. In bacteria, three enzymes are 
necessary for PHB synthesis: first, a B-ketothiolase 
that catalyzes the reversible condensation of 
two acetyl-CoA molecules to form acetoacetyl- 
CoA; second, an acetoacetyl-CoA reductase 
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Fig. 28.2 Detection of 
1,2,4-butanetriol in a 
transgenic line. 
Metabolites from 
vacuum-dried tissues of 
wild-type and transgenic 
lines were extracted and 
detected by GC-MS. (a) 
Extracted ion chromato- 
gram (EIC, 103.1 m/z) of 
1,2,4-butanetriol standard 
solution and calibration 
curve. Butanetriol eluted at 
6.45 min. (b) EIC of wild 
type. (c) EIC chromato- 
gram of AAB,,, 6.6.2 
transgenic line showed b 
large peak at 6.45 min with 
a mass spectrum that 
matches the commercial 
butanetriol standard (From 
Abdel-Ghany et al. 2013) 
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which reduces acetoacetyl-CoA to R-(—)-3- 
hydroxybutyryl-CoA; and third, a PHA synthase 
which subsequently polymerizes R-(—)-3- 
hydroxybutyryl-CoA to form PHB. By changing 
the carbon source and the bacterial strain used in 
the fermentation process, it was possible to 
produce biopolymers having physical properties 
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ranging from stiff and brittle plastic to rubbery 
polymers (Anderson and Dawes 1990). 

Despite its basic attractiveness as a substitute 
for petroleum-derived polymers, the major draw- 
back of bacterial PHA synthesis is the high cost 
of bacterial fermentation, making bacterial 
PHAs five to ten times more expensive than the 
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synthetic polymer (Poirier et al. 1995). In this 
respect, crop plants were seen as an attractive 
system for the sustained production of large 
amounts of polymers at low cost as plants have 
the precursor acetyl-CoA and the enzyme 
B-ketothiolase in the cytoplasm. Through cre- 
ation of novel metabolic pathways targeted to the 
cytoplasm, plastid, or peroxisome, varieties of 
PHAs with different physical properties have 
been produced in a number of transgenic plants. 
To date, the highest levels of PHB have been 
achieved in plastids, likely due to the high flux of 
the PHB pathway substrate acetyl-CoA through 
this organelle during fatty acid metabolism 
(Suriyamongkol et al. 2007; Van Beilen and 
Poirier 2008; Bohmert-Tatarev et al. 2011). In 
Arabidopsis plants engineered with plastid- 
targeted enzymes, PHB has been produced as 
granules at levels up to 40 % dry weight but with 
significant growth penalties such that plants were 
dwarf and produced no seeds (Bohmert et al. 
2000). These deleterious effects might be caused 
by the depletion of acetyl-CoA and acetoacetyl- 
CoA from the endogenous flavonoid and isopren- 
oid pathways (Van Beilen and Poirier 2008). To 
circumvent the growth penalties associated with 
constitutive expression of the PHA biosynthetic 
pathway, the introduced genes were fused to an 
inducible promoter that can be activated at later 
stages of growth when the plant accumulates 
sufficient biomass for PHB production (Kourtz 
et al. 2007). PHAs synthesis in biomass crops of 
industrial interest such as switchgrass, sugarcane, 
maize, tobacco, cotton, flax, and others was also 
reported (reviewed in Bornke and Broer 2010), 
although more investigation is needed to channel 
the precursors toward PHA synthesis without 
deleterious effects on plant growth. In addition to 
being bioplastics, PHAs can also be converted 
through simple processes to a range of platform 
chemical intermediates such as acrylic acid, cro- 
tonic acid, propylene, and butanol (Zhong and 
Whitehouse 2005; Peterson and Fischer 2010), 
which have valuable industrial applications. 
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28.7.4.3 Cyanophycin, a Poly-Amino 
Acid Polymer 

Cyanophycin is another important water-soluble 
and biodegradable polymer that contains only 
two amino acids, L-arginine and L-aspartate, in a 
ratio of 1:1 (Simon 1987; Elbahloul et al. 2005). 
It is synthesized via non-ribosomal polypeptide 
synthesis in many cyanobacteria in a polymeriza- 
tion reaction catalyzed by an enzyme cyanophy- 
cin synthetase (cphA). Cyanophycin is a major 
source of several platform chemicals that can be 
further processed into many other chemicals of 
industrial, agricultural, and medical applications 
(Joentgen et al. 2001) complementing the 
application ranges of PHB. Mild hydrolysis of 
cyanophycin results in homopolymers and copo- 
lymers of polyaspartate and L-arginine, which 
serve as the starting point for the syntheses of a 
range of compounds such as 1,4-butanediamine 
that can be used for the synthesis of nylon-4,6; 
3-aminotetrahydrofuran that is used heavily in 
the polymer industry; fumaric acid that is used 
for polyester resin; and acrylamide which is used 
in papermaking and manufacturing dyes (Bornke 
and Broer 2010). Polyaspartate polymer could 
also replace the nondegradable synthetic polyac- 
rylates in various processes such as water treat- 
ment and plastics (reviewed in Bornke and Broer 
2010). For sustainable production of this impor- 
tant polymer in plants, the cyanophycin synthe- 
tase gene from cyanobacteria was introduced 
into tobacco and potato under the control of the 
35S CaMV promoter with modest accumulation 
(0.24-1.14 % per unit DW) and deleterious 
effects on plants (Neumann et al. 2005). Targeting 
the cyanophycin synthesis to the plastids or to 
tubers significantly enhanced its accumulation up 
to 7.5 % DW without noticeable effect on plant 
growth (Huhns et al. 2008, 2009). Similar bio- 
technological approaches have been applied for 
bio-production of other important poly-amino 
acids such as poly-y-glutamate and poly-lysine 
using bacterial operons (Oppermann-Sanio et al. 
1999; Oppermann-Sanio and Steinbuchel 2002). 
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28.7.5 Production 
of Pharmaceuticals in Plants 


The growing need for safer and cheaper drugs 
and the availability of tools to express foreign 
proteins of interest in plants led to the idea of 
using plants to produce vaccines and other phar- 
maceuticals. A schematic representation of the 
concept is shown in Fig. 28.3. Various pharma- 
ceutical proteins can be produced in planta in 
their original or genetically modified form. The 
molecular design of the correspondent expres- 
sion cassettes is largely dependent on the choices 
of expression system (Schiermeyer et al. 2004). 
The key considerations are the size, solubility, 
folding parameters, as well as possible glycosyl- 
ation and other posttranslational modifications. 
Recombinant products can also be designed as 
protein fusions or chimeras with affinity tags that 
can aid in detection and purification (Gomord 
et al. 2004; Benchabane et al. 2008). Many 
proteins (>150) of interest to the pharma industry 
have already been tested for production in planta. 
The advantages of using plants for pharmaceu- 
tical production are that it 1s easy to scale up 
and is expected to incur low production costs. 
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Furthermore, plants are considered as a safe and 
pathogen-free biomass source. 

Plants have been tested for vaccine production, 
often referred to as “edible,” “oral,” or “green vac- 
cines” (Mor et al. 1998; Streatfield and Howard 
2003). The main idea is to express immunogenic 
antigens in edible parts of plants and feed animals 
or humans to immunize them. These antigens are 
capable of inducing strong and protective immune 
responses sufficient to neutralize the pathogens 
(Bae et al. 2009), especially the ones against the 
reemerging infectious diseases, cancer, autoim- 
mune diseases, and others (Ulmer et al. 2006). 
The pioneering work of Dr. Charles Arntzen and 
his colleague Dr. Mason in this area has led to 
many studies and establishment of a novel branch 
of plant biotechnology (Goldstein and Thomas 
2004; Schiermeyer et al. 2004; Ma et al. 2005a). 
Successful oral feeding experiments with some 
strong antigenic proteins such as E. coli entero- 
toxin B (LTB), Cholera toxin B (CT-B), and 
Hepatitis B surface antigen (HBsAg) (Tiwari 
et al. 2009) that do not require immune stimula- 
tors (adjuvants) (Reed et al. 2009) confirm the 
potential for using plant-based vaccines. Plant 
vaccines together with plant-derived functional 
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Fig. 28.3 Schematic representation of the key stages 
for production of pharmaceuticals in plants. The gene(s) 
of interest is cloned in a binary vector, which is then trans- 
ferred into plant cells by either Agrobacterium-mediated 
transformation or direct gene transfer. In the case of stable 


transformation, the transgenic lines are selected and used 
for production of the recombinant protein. Production of 
recombinant protein is usually followed with functional 
testing in lab animals (Modified from Golovkin 2011) 
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antibodies (Hood et al. 2002; Stoger et al. 2002; 
Fischer et al. 2003) have reached the initial stages 
of testing in animals and humans (Table 28.3). 
The first officially approved trial was conducted 
in 1997 with LTB in potato tubers (Tacket et al. 
1998). Some products can be used as a simple 
processed material directly in needle-free immu- 
nization (Ma et al. 2005b; Streatfield 2006; Floss 
et al. 2007; Tiwari et al. 2009). These ground- 
breaking studies and many other studies that fol- 
lowed suggest the potential for engineered plants 
in providing safe and, probably, cheap therapeu- 
tic products. The technology is now transitioning 
from academic labs to industrial production of 
valuable recombinant proteins like subunit vac- 
cines, antibodies, and other therapeutic proteins. 

The “edible” pharmaceutical concept has 
some issues that need to be addressed. For 
instance, the dosage control is difficult because 
of variable quantities of recombinant protein in 
plant tissues (fruits, vegetables, grains, etc.). 
Another concern is a potential environmental risk 
(Mascia and Flavell 2004; Peterson and Arntzen 
2004). The release and use of edible crops, like 
vegetables, fruits, and grains, may lead to possible 
genetic cross-contamination of other agriculturally 
important crops (Sparrow and Twyman 2009). 
Special regulatory measures called “current good 
manufacturing practice” (CGMP) are currently in 


Table 28.3 Plant-derived pharmaceuticals in clinical trials 


569 


place (FDA 2002) to guide the entire research/ 
production process. These regulatory measures 
are applicable to the plant-based pharmaceutical 
products also. A recent approval of the first US 
plant-based (tobacco) recombinant vaccine 
against Newcastle disease in chickens in 2006 
(Vermij and Waltz 2006) indicates that plant 
vaccines are safe and efficacious at least for 
veterinary use. Simple downstream processing 
of biomass (dry or dry/freezing) could also help 
to normalize the active recombinant components, 
e.g., for low-concentration microbicides. The final 
products could be extracts, powders, capsules, 
tablets, and ointments (Daniell et al. 2005; Scott 
2005; Yusibov and Rabindran 2008; Rybicki 
2009; Tiwari et al. 2009). Scientists remain cau- 
tiously optimistic that the latest developments in 
this area will garner the attention of the pharma 
industry and bring needed support to establish 
plant biopharming. 


28.8 Conclusions 


As discussed above, significant progress has been 
made in recent years in engineering plants to 
produce commercially important products. This 
field has moved from a single-gene manipulation 
to introduction of whole new pathways from 


Product Targeted disease Plant source 
Vaccines 

E. coli enterotoxin B (LTB) Diarrhea Potato and maize 
Hepatitis B surface antigen (HBsAg) Hepatitis B Potato and lettuce 
Norwalk virus antigen Sickness/diarrhea Potato 

Rabies glycoprotein G Rabies Spinach 
Antibodies 

LSBC scFVs Non-Hodgkin’s lymphoma Tobacco 
Avicidin Colorectal cancer Maize 

CaroRx Dental caries Tobacco 

Other products 

Gastric lipase Cystic fibrosis, pancreatitis Maize 

Human intrinsic factor Vitamin B12 deficiency Arabidopsis 
Lactoferrin Gastrointestinal infections Maize 

Human glucocerebrosidase Gaucher disease Carrot 


Enzyme 


LTB Heat-labile enterotoxin B subunit, LSBC Lifesaver for Babies and Children, scFVs single-chain Fv fragments 
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heterologous systems. The current toolbox allows 
stacking of a large number of genes at the same 
chromosomal locus as a multigene array and its 
manipulation by genome editing. The choice of 
the tool used depends on several factors such as 
plant species, number of genes to be engineered 
into the plant, the desired expression level, cellular 
compartmentalization of the pathway of interest, 
and the available information about the metabolic 
pools, fluxes, and regulation of the metabolic 
pathway. Reduction in gene synthesis costs and 
progress in assembling larger pieces of synthetic 
DNA in recent years have made engineering 
complex networks possible. 

Although much progress has been made in 
engineering plants to produce several commer- 
cially important products, there are still significant 
challenges in bringing these plant-based products 
to market. The main strategy that has been used 
for engineering plants is based on introduction of 
one or more genes under constitutive, regulat- 
able, or tissue-specific promoters. This approach 
works well in producing mRNA from the 
introduced gene, but it does not always result in 
the required amount of protein due to many post- 
transcriptional and posttranslational regulatory 
mechanisms (e.g., MRNA stability, translatability 
of mRNAs, stability of proteins) that cells employ 
to modulate the levels of proteins and their func- 
tion. Because of this, often-desired compounds 
are produced at very low levels. Incorporating 
elements that impact expression of a gene after 
mRNA biogenesis into gene construction design 
may lead to increased production of desired 
proteins. Also, production of native or novel 
chemicals and compounds in plants at high levels 
often is detrimental to plant growth and develop- 
ment. This is mainly due to lack of our under- 
standing of the interconnections between 
metabolic pathways and insufficient knowledge 
about the multilevel regulation of metabolic 
pathways. These limitations must be addressed in 
order to make the outcome of metabolic pathway 
engineering more predictable and at the same 
time avoid unwelcome surprises from adverse 
phenotypes of transgenic plants. Comprehensive 
global studies on transgenic lines using “omics” 
approaches together with the knowledge obtained 
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from systems biology should help in addressing 
some of the current issues associated with plant 
metabolic engineering. Furthermore, application 
of emerging tools in synthetic biology to manipulate 
existing gene circuits and create novel circuits 
to plant metabolic engineering is also expected to 
mitigate some of these issues. 
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Abstract 

Crop plants are affected by a variety of abiotic stresses such as salinity, 
drought, extreme temperatures, and oxidative stress and cause a significant 
yield loss (more than 50 %). In the near future, these abiotic stresses might 
increase because of global climate change. Abiotic stresses lead to dehy- 
dration or osmotic stress through reduced availability of water for vital 
cellular functions and maintenance of turgor pressure and also result in 
high production of reactive oxygen species (ROS). Plants are evolved with 
various mechanisms such as changes in cellular and metabolic processes 
to cope with the stress condition. Recent developments in molecular genet- 
ics have contributed greatly to our understanding of the biochemical and 
genetic basis of abiotic stress tolerance. This has led to the development of 
abiotic stress-tolerant plants with yield advantage by modulation of the 
expression of the genes that encode for enzymes involved in the biosyn- 
thesis of osmoprotectants (e.g., proline, sugars, sugar alcohol, glycine 
betaine, and polyamines), antioxidant enzymes, protective proteins (e.g., 
LEAs and HSPs), transporters, regulatory proteins, kinases, and transcrip- 
tion factors. More recently, posttranscriptional and posttranslational 
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regulation mechanisms of the abiotic stress response, like microRNAs and 
ubiquitination, appear as promising new modulation targets to develop 
abiotic stress-tolerant plants and contribute to the development of more 
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productive crops to feed the growing mass. 
Keywords 
Transcription factors * miRNAs 

29.1 Introduction 


Plants, as sessile organisms, have had the need 
both to develop and evolve mechanisms that 
allow them to be informed about their changing 
environment, as well as systems and strategies to 
react and adapt to these changes. These capabili- 
ties have allowed plants not only to survive 
against environmental and biological threats but 
also have accorded them the ability to face differ- 
ent types of environment and, thus, colonize 
many habitats on earth. 

Abiotic stress is a term that refers to those 
unfavorable environmental conditions _ that 
adversely affect plant growth and development. 
Some examples of abiotic stresses that a plant 
may face include decreased water availability 
(due to drought or osmotic stress), soil salinity 
(salt stress), or extreme temperatures (heat or 
cold stress), which are considered to be among 
the most threatening environmental conditions 
for crop yield. To mitigate and recover from dam- 
aging effects of abiotic stress, plants have evolved 
various adaptive strategies at the cellular and 
metabolic levels that come into play at the onset 
of stress. 


29.1.1 Abiotic Stress Effects 


One of the most threatening situations that plants 
have to face is water deficit, due to drought or 
osmotic stress, which can result from low and 
erratic rainfall, low water-holding capacity of soil, 
and situations where transpiration rate exceeds 


the rate of water uptake. Drought implies changes 
in water potential gradients between soil and 
plant, such that the plant’s ability to absorb water 
is compromised, leading to loss of cell turgor, 
change in cell volume, denaturation of proteins, 
and changes in several physiological and molecu- 
lar components as well as membrane integrity. In 
response to this situation, the plant triggers cellu- 
lar and metabolic processes including inhibition 
of cell expansion, cell wall synthesis, protein syn- 
thesis, stomata conductance, and photosynthetic 
activity. Also, the plant triggers adaptation mech- 
anisms that include biosynthesis of the phytohor- 
mone abscisic acid (ABA) and production of 
compatible solutes as well as gene products 
involved in osmotic adjustment and protection/ 
repair of cellular structures (Bartels and Sunkar 
2005; Mahajan and Tuteja 2005; Bhatnagar- 
Mathur et al. 2008; Janska et al. 2009). 

Salt stress develops from excessive concentra- 
tion of salt, especially sodium chloride (NaCl) in 
the soil. In a situation of high salinity, soil water 
potential decreases, making the plant face an 
osmotic stress situation initially, which triggers 
responses similar to drought. Later, the salt that 
has entered the root is transported via the xylem 
to the shoot and additional ion-specific toxicity 
effects arise, contributing to growth inhibition 
and accelerated leaf senescence. To face this sec- 
ond phase, the plant sets into motion mechanisms 
concerning 10n uptake and compartmentaliza- 
tion, with an objective of reestablishing ion 
homeostasis (Munns and Tester 2008). 

Salinity and drought stress are often associ- 
ated with temperature situations above the 
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normal/optimum, causing heat stress. In this 
stress situation, many proteins suffer denatur- 
ation and dysfunction, membrane fluidity, and 
permeability; cellular homeostasis is disturbed, 
leading to severe retardation in growth, develop- 
ment, and even death (Hasanuzzaman et al. 
2013). 

Cold or low-temperature stress in normally 
faced by plants growing in temperate and frigid 
areas. Chilling (<20 °C) or freezing (<0 °C) 
temperatures can induce ice formation in plant 
tissues, also leading to a situation of water deficit 
in the cell. Also, as enzyme activities are 
extremely temperature dependent, low tempera- 
tures affect the rate of most biochemical reac- 
tions of the plant cell, including photosynthesis. 
On the other hand, membrane fluidity properties 
also change. Some of the responses triggered by 
cold stress include production of more energy by 
activation of primary metabolism and induction 
of molecular chaperones, which contribute by 
stabilizing proteins against freeze-induced dena- 
turation (Chinnusamy et al. 2007; Theocharis 
et al. 2012). 


29.1.2 Abiotic Stress Perception 
and Response 


The very first event experienced by a plant, when 
one or more environmental factors change from 
their optimum, is sensing. Abiotic stress sensing 
is a complex issue where there is no single sens- 
ing mechanism common to all stresses. In the 
case where a chemical agent is related to the 
stress situation (like heavy-metal stress or nutri- 
ent depletion), a model of ligand-receptor could 
be hypothesized. This model, however, could not 
be applied in the same way to situations where 
stress is produced by changes in physical param- 
eters. For example, primary sensing of tempera- 
ture stress (heat stress or chilling/freezing 
conditions) does not involve any chemical ligand, 
although the existence of receptor proteins that 
change its conformation with temperature could 
be hypothesized. In this sense, several evidences 
support the existence of membrane-associated 
thermosensors that can perceive and respond 


appropriately to mild changes in the fluidity of 
the membrane, which could be generated by 
changes in temperature or in the lipid composi- 
tion of those membranes (Saidi et al. 2011). In 
addition, as stated above, high salinity, drought, 
and cold (freezing) situations produce similar 
changes in the osmosis cell conditions, increas- 
ing water potential inside the plant cell, and thus 
cell turgor decreases. Yeast membrane protein 
SLNI acts as an osmosensor by reacting to 
changes in cell turgor pressure and triggers a 
two-component system, which activates mitogen- 
activated protein kinase (MAPK) _ pathways 
(Maeda et al. 1994; Reiser et al. 2003). The isola- 
tion of the Arabidopsis thaliana histidine kinase 
(HK) receptors ATHK1, homologue to SNLI1 
(Urao et al. 1999), as well as the membrane cyto- 
kinin receptor CRE1, which is able to act as an 
osmosensor when expressed in yeast (Reiser 
et al. 2003), has opened the possibility of the 
existence of osmosensor systems similar to yeast 
in higher plant cells (Fig. 29.1). 

Sensing mechanisms activate a complicated 
interplay of downstream signaling cascades, which 
transmit, amplify, and finally trigger cell responses 
that lead to counterreactions against harmful 
effects of the stress condition. The knowledge of 
stress signaling pathways leading from stimulus to 
the end response in plants has increased over the 
recent years. These downstream signaling path- 
ways are composed by several active participators 
including second messengers, phytohormones, 
and phosphoprotein (protein kinases and protein 
phosphatases) cascades. 

Second messengers are small intracellular sig- 
naling molecules or ions, normally located in the 
cytoplasm of a cell, whose levels change in 
response to a signal received by a membrane sen- 
sor. Second messengers spread by diffusion and 
regulate activity of proteins, helping to distribute 
and amplify the signal received. Second messen- 
gers involved in stress signaling pathways include 
calcium (Ca?*) (Tuteja and Mahajan 2007), reac- 
tive oxygen species (ROS) (Suzuki et al. 2012), 
cyclic nucleotides, and polyphosphoinositides, 
among others (Tuteja and Sopory 2008). 

On the other hand, phytohormones activate 
signaling pathways that could act independently 
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or synergistically with other stress-triggered 
pathways. In this sense, ABA constitutes the 
most studied stress-responsive hormone and 
plays a central role in abiotic stress responses. 
Other phytohormones and signaling molecules 
such as cytokines (CK), salicylic acid (SA), eth- 
ylene (ET), jasmonic acid (JA), nitric oxide 
(NO), sugars, and polyamines (PAs) play a sub- 
stantial direct or indirect role in abiotic stress 
response (Tuteja and Sopory 2008). 

Protein kinases and phosphatases play a fun- 
damental role in coordinating the activity of 
many known signal transduction pathways. 
Those proteins control phosphorylation status of 
their target proteins by catalyzing reversible 
phosphorylation processes. Cell signal transduc- 
tion pathways often rely on phosphoprotein cas- 
cades, including stress responses. They are 
divided into several categories according to their 
structural and functional characteristics. Several 
families are known to be involved in downstream 
stress signaling cascades in plants, including 
MAPK cascades, sucrose non-fermenting 
1(SNF1)-related kinases (SnRKs), — cyclin- 
dependent protein kinases (CDKs), Ca’- 
dependent protein kinases (CPKs), and 
calcineurin B-like(CBL)-interacting protein 
kinases (CIPKs), among others (Boudsocq and 
Lauriére 2005; Boudsocq and Sheen 2013). 

All these elements also interact among them- 
selves, forming a complex network, whose end 
result is modification of target proteins that may 
have enzymatic or structural function, leading to 
quick physiological cell responses. Alternately, 
signaling pathways can also lead to production 
on new proteins by modifying the activity of tran- 
scription factors (TFs) which modulate gene 
expression by binding to specific DNA sequences 
in the promoters of target genes. TFs classify as a 
big family depending upon the presence of 
known DNA-binding domains. Stress-related 
TFs include members of APETALA2/ethylene- 
responsive factor (AP2/ERF), basic helix-loop- 
helix (bHLH), basic leucine zipper (bZIP), 


homeodomain leucine zipper (HD-ZIP), 
myelocytomatosis (MYC),  myeloblastosis 
(MYB), and NAC and WRKY _ families 


(Lindemose et al. 2013). 
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Transcriptome studies have revealed that the 
genes induced by those signaling cascades trig- 
gered by abiotic stress could be divided into two 
major groups according to the functions of their 
products (Bohnert et al. 2001; Fowler and 
Thomashow 2002; Seki et al. 2002). The first 
group, often called functional proteins, consists 
of a large number of proteins with enzymatic and 
structural functions which help to protect the 
cells from the diverse effects of stress, such as 
membrane proteins that help to reestablish cell 
homeostasis (as water channel proteins and mem- 
brane transporters), biosynthesis enzymes of 
metabolites for osmotic adjustment or with pro- 
tective functions (also called compatible solutes), 
ROS detoxification enzymes (as glutathione 
S-transferases, hydrolases, superoxide  dis- 
mutases, and ascorbate peroxidases), and other 
proteins for macromolecular protection [such as 
late embryogenesis abundant (LEA) proteins, 
chaperones, and mRNA-binding proteins]. The 
second group comprises a variety of regulatory 
proteins (transcription factors, protein kinases, 
receptor protein kinases, ribosomal-protein 
kinases, and _ transcription-regulation protein 
kinase) and signal transduction proteinases 
(phosphoesterases and phospholipase C) involved 
in the regulation of signaling cascades that con- 
trol expression of additional genes whose prod- 
ucts could belong, in turn, to each of the two 
groups (Agarwal et al. 2006; Shinozaki and 
Yamaguchi-Shinozaki 2007). 

Apart from TFs, modulation of gene expres- 
sion in response to abiotic stress may be per- 
formed at other levels. In this sense, there are 
some reports where involvement of epigenetic 
mechanisms has been pointed out, as the altera- 
tion of methylation status of DNA or modifica- 
tions of nucleosomal histones, in the response to 
environmental cues and to different types of abi- 
otic stresses (Boyko and Kovalchuk 2008; Kim 
et al. 2010). A second level of tuning of gene 
expression could be performed by posttranscrip- 
tional regulation mechanisms. In this sense, most 
of the alternative splicing events that have been 
described in response to abiotic stress concern 
genes with regulatory roles (Mastrangelo et al. 
2012; Seo et al. 2013). Additionally, the role of 
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miRNAs in stress responses has been demon- 
strated (de Lima et al. 2012). By the other hand, 
posttranslational modification of the gene prod- 
ucts constitutes a third level of control. Apart 
from protein phosphorylation, it has been 
described that protein sumoylation (Miura et al. 
2007; Xu and Yang 2013) and protein ubiquitina- 
tion (Lyzenga and Stone 2012; Mazzucotelli 
et al. 2008) are processes that play major roles in 
the modulation of plant response to abiotic stress 
(Fig. 29.1). 

The concerted actions of those control mecha- 
nisms ensures that gene expression events down- 
stream stress perception could be carried out in 
adequate spatial and temporal patterns and thus 
shape a specific transcriptome and proteome 
response by the plant, adjusted to each stress situ- 
ation and focused to reach to a new balance 
between growth, development, and survival. The 
complete understanding of the interplay of these 
regulatory systems 1s crucial for the comprehen- 
sion of the molecular mechanisms governing 
plant adaptation to environment as well as for the 
design of strategies to improve plant stress 
tolerance. 


29.1.3 Strategies to Improve Plant 
Resistance 


As previously stated by Mizoi and Yamaguchi- 
Shinozaki (2012), there are three possible strate- 
gies that could be undertaken to improve plant 
stress resistance: 

1. A stress escape strategy that implies to change 
the plant life cycle, e.g., by making plants 
grow and flower only during the more favor- 
able seasons 

2. An avoidance strategy that implies to modify 
the architecture of organs or tissues in order to 
prevent the formation of stressful internal 
environments in plants, such as extremely 
high-salt concentrations or high osmotic 
potentials inside tissues 

3. A tolerance strategy that implies to improve 
tissue and plant cell viability in order to toler- 
ate stressful internal environments 
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Regarding abiotic stress, a majority of the 
physiological and molecular biology studies on 
plant tolerance have been undertaken using the 
third strategy above, by researching whose are 
the cellular responses to abiotic stress and how 
they are regulated. In some cases, knowledge has 
been gained using transgenic approaches, which 
also resulted in generation of stress-resistant 
plants. In other cases the knowledge gained has 
served as basis for the design of alternative trans- 
genic strategies that also had led to the produc- 
tion of resistant plants. 


29.2 Transgenic Approaches 
for Improving Abiotic Stress 
Tolerance 


29.2.1 Overproduction of Functional 
Proteins 


The adaptation of the plant to abiotic stress 
includes a series of cell physiological processes 
including the production of metabolites and pro- 
teins involved in protective functions. Therefore, 
some of the strategies used for obtaining plants 
with increased stress tolerance have been aimed 
at the modulation of the expression of genes 
involved in the biosynthesis of these metabolites, 
as well as those coding for those proteins that 
play a direct part in the mechanisms of protective 
function of the plant cell (Fig. 29.1). 


29.2.2 Enzymes for Production 
of Protective Metabolites 


In response to different stresses, plants use a 
common physiological survival mechanism that 
consists in the accumulation of low-molecular- 
weight organic compounds that are highly solu- 
ble in water and nontoxic at high concentrations. 
Those compounds, often called osmoprotectants 
or compatible solutes, protect plants from stress 
by contributing to cellular osmotic adjustment as 
well as by helping to protect cell structures from 
damage and denaturation by acting as ROS 
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scavengers and/or as _ low-molecular-weight 
chaperones that protect membrane integrity and 
stabilize proteins (Chen and Murata 2002). A 
wide range of metabolites have been identified as 
osmoprotectants, including amino acids (e.g., 
proline), quaternary and other amines [e.g., gly- 
cine betaine (GB) and polyamines (PAs)], and a 
variety of sugars and sugar alcohols (e.g., man- 
nitol and trehalose). Overproduction of such 
osmoprotectants has been extensively used as 
strategy to attempt to improve tolerance to abi- 
otic stresses (Bhatnagar-Mathur et al. 2008). 


29.2.3 Proline 


It is well known that many plants and other 
organisms, including eubacteria, protozoa, as 
well as marine invertebrates, accumulate proline 
to a high concentration in response to various 
stress conditions. Apart from acting as an osmo- 
lyte and contributing to the maintenance of cell 
turgor or osmotic balance, other roles have been 
proposed for proline in plant stress response, 
such as molecular chaperone, being a metal che- 
lator, forming part of the antioxidative defense 
mechanism, or even playing a role in signaling. 
On the other hand, proline accumulation could 
also be a sink of carbon and nitrogen for use after 
relief of water deficit (Ashraf and Foolad 2007; 
Verbruggen and Hermans 2008). Plants are able 
to synthesize proline from two different precur- 
sors. One biosynthesis pathway converts gluta- 
mate to proline by two successive reductions 
catalyzed by pyrroline-5-carboxylate synthase 
(P5CS) and pyrroline-5-carboxylate reductase 
(PS5CR), respectively (Delauney and Verma 
1993). Proline could also be obtained from orni- 
thine (Orn), which is transaminated to P5C by 
Orn-6-aminotransferase (OAT), a mitochondrial 
enzyme. By the other hand, proline is degraded 
by a process reverse to proline biosynthesis and 
catalyzed by proline dehydrogenase (PDH) and 
PSC dehydrogenase (PSCDH). Proline biosyn- 
thesis is a cytosolic and plastidial process, while 
proline degradation takes place in mitochondria 
(Verbruggen and Hermans 2008). 
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The accumulation of proline in plants under 
stress 1s caused either by the induction of expres- 
sion of proline biosynthesis genes or by the 
repression of the genes of its degradation path- 
way. During osmotic stress, glutamate pathway is 
the main route used by plants for proline accumu- 
lation, although in young Arabidopsis plants, the 
Orn pathway seems also to contribute by 
enhancement of OAT activity (Verbruggen and 
Hermans 2008). Proline overproduction by trans- 
genic approaches has been achieved by the same 
strategies: favoring proline synthesis by overex- 
pression of P5CS, P5CR, or OAT genes or inhibi- 
tion of proline degradation by PDH silencing by 
antisense or knockout mutants. An excellent 
review by Kavi Kishor and Sreenivasulu (2014) 
includes an updated list of 65 transgenic plants 
where some of these strategies have led to proline 
accumulation and abiotic stress tolerance. 


29.2.4 Amines and Polyamines 


GB is a quaternary amine compound widely dis- 
tributed in higher plants and is synthesized in 
many plant species at elevated rates in response 
to various types of environmental stress (Chen 
and Murata 2008). In plants, GB is synthesized 
from choline via two sequential enzymatic reac- 
tions catalyzed by choline monooxygenase 
(CMO) and betaine aldehyde dehydrogenase 
(BADH), while in the soil bacterium Arthrobacter 
sp., GB synthesis from choline is performed by a 
single enzyme, choline oxidase, which is encoded 
by the codA gene (Chen and Murata 2002). Also, 
in extreme-halophytic phototrophic — bacteria 
Actinopolyspora halophilaand Ectothiorhodospira 
halochloris, GB is synthesized by a third pathway 
from glycine catalyzed by two enzymes, namely, 
glycinesarcosine methyltransferase (GSMT) and 
sarcosine dimethylglycine methyltransferase 
(SDMT), by a three-step N-methylation reaction 
(Chen and Murata 2002). Several plants species, 
known as natural accumulators of GB, normally 
contain low levels of GB, but they increase sig- 
nificantly their GB levels when subjected to abi- 
otic stress. Indeed, many of those plants are 
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salt-tolerant plants (Chen and Murata 2008; 
Rhodes and Hanson 1993). In contrast, many 
other plant species including Arabidopsis, rice, 
tobacco, potato, and tomato show no detectable 
GB levels under normal or stressful conditions 
and consequently have been chosen as good can- 
didates for the engineering of GB biosynthesis. 
In a variety of cases transgenic lines generated 
from several plant species, being most of them 
from natural non-accumulators of GB plants, have 
been obtained overexpressing GB-biosynthetic 
genes from bacteria and plants. The various trans- 
genic plants accumulate GB at a variety of levels 
and exhibit enhanced tolerance to various types of 
stresses (Park et al. 2007; Zhou et al. 2008; He 
et al. 2010; Zhang et al. 201 1a, b). An extensive 
list could be found in the reviews by Chen and 
Murata (2011) and Wani et al. (2013). Several 
mechanisms have been proposed to explain this 
enhanced tolerance. By one side, GB could act as 
a stabilizer of certain protein complexes, prevent- 
ing its denaturation when plant cells are exposed 
to stress conditions. Also, it has been observed 
that GB is an inducer of the expression of specific 
genes that encode ROS-scavenging enzymes and 
thus could prevent the accumulation of excess 
ROS that could accumulate when plant are 
exposed to abiotic stress. Combination of both 
actions will result in conservation of the integrity 
of cell membranes and protein complexes such as 
photosynthetic machinery, ion channels, or trans- 
lation machinery, helping to prevent their dena- 
turation when exposed to abiotic stress conditions 
(Chen and Murata 2011; Wani et al. 2013). 

PAs are a group of low-molecular-weight 
amine-containing aliphatic molecules. The most 
widespread forms of PAs in living organisms are 
the diamine putrescine (Put), triamine spermi- 
dine (Spd), and tetramine spermine (Spm), which 
differ in the number of aminopropyl moieties 
added to the carbon skeleton of Put and thus in 
their number of positively charged amine groups 
at physiological pH. PAs have been implicated as 
key players in a wide array of plant growth and 
developmental processes as well as in biotic and 
abiotic stress responses (Bhattacharya and Rajam 
2006; Alcazar et al. 2010; Kusano et al. 2008; 
Takahashi and Kakehi 2010). 
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Intracellular levels of PAs in plants are mostly 
regulated by anabolic and catabolic processes as 
well as by their conjugation to hydroxycinnamic 
acids and macromolecules like proteins and 
DNA. Plants synthetize the essential diamine Put 
either directly from ornithine (Orn) decarboxyl- 
ation in a reaction catalyzed by ornithine decar- 
boxylase (ODC) or by an alternative pathway 
that involves arginine (Arg) decarboxylation by 
arginine decarboxylase (ADC) followed by two 
additional successive steps catalyzed by agmatine 
iminohydrolase (AIH) and N-carbamoylputrescine 
amidohydrolase (CPA). Spd and Spm are the 
result of sequential additions of aminopropyl! moi- 
eties to Put and Spd, respectively, by the enzymes 
Spd synthase (SPDS) and Spm synthase (SPMS). 
Decarboxylated S-adenosylmethionine (dcSAM) 
is used by both enzymes as donor molecule of 
aminopropyl groups and is synthetized from the 
decarboxylation of S-adenosylmethionine (SAM) 
in a reaction catalyzed by SAM decarboxylase 
(SAMDC). Spd and dcSAM can also form a 
structural isomer of spermine, thermospermine 
(tSpm) in a reaction catalyzed by tSpm synthase 
(tSPMS). PAs are catabolized through oxidative 
deamination in reactions catalyzed by diamine 
oxidase (DAO) and polyamine oxidase (PAO) 
activities. DAOs display high affinity for 
diamines like Put, producing A!-pyrroline, 
hydrogen peroxide (H,O,), and ammonia, while 
PAOs oxidize secondary amine groups from Spd 
and Spm, leading to the formation of 4-aminobutanal 
or (3-aminopropyl)-4-aminobutanal, along with 
1,3-diaminopropane (DAP) and H,0,. PAOs are 
also able to catalyze the back-conversion of Spm 
to Spd, producing 3-aminopropanal and H,Q,. 
Some PAO isoforms are also involved in back- 
conversion processes of tSpm to Spm, and Spm 
to Put, which lead to the production of H,O, 
(Alcazar et al. 2010; Bitrian et al. 2012; Moschou 
et al. 2012). 

A large number of studies have shown 
increases in PA levels in different plant species 
under a number of environmental stress condi- 
tions, and its accumulation correlates with 
enhanced stress tolerance (Bouchereau et al. 
1999; Alcazar et al. 2006; Bhattacharya and 
Rajam 2006; Groppa and Benavides 2008). Gene 
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expression studies have also been pointed out 
that the expression of some of the PA biosynthe- 
SIS genes increases in response to stress, although 
with different levels, suggesting a differential 
regulation of PA pathway genes during stress, 
consistent with different pathways involved in 
the regulation of PA _ biosynthesis (Alcazar 
et al. 2010). 

Modulation of PA metabolism has also been 
achieved by heterologous or homologous overex- 
pression of ODC, ADC, SAMDC, SPDS, and 
SPMS in tobacco (Kumria and Rajam 2002; Waie 
and Rajam 2003), eggplant (Prabhavathi and 
Rajam 2007), tomato (Hazarika and Rajam 
2011), pear, sweet potato, Arabidopsis, and rice 
(Pujni et al. 2007) (a detailed list could be found 
in Marco et al. 2012). Although the degree of 
modification of one or more specific PAs obtained 
is not the same in all plants, all transgenic lines 
share a common enhancement of the tolerance 
against one or more abiotic stresses. Indeed, the 
improvement in tolerance observed always cor- 
relates with elevated levels of Put and/or Spd and 
Spm (Kumar et al. 2006; Marco et al. 2012). 
Thus, the elevation of PA levels in the plant as a 
response to stress seems to have a protective role. 
However, the question about what are the precise 
mechanisms of action by which PAs are able to 
exert this protection against challenging environ- 
mental conditions remains still unresolved, 
although some roles have been put forward 
(Alcazar et al. 2010; Gill and Tuteja 2010). 
Several protective roles proposed have been 
related to their chemical structure. As polyca- 
tions, PAs are able to modulate ion balance of the 
cell and interact with anionic molecules such as 
DNA, RNA, proteins, and membrane lipids 
(Feuerstein and Marton 1989; Schuber 1989). In 
this sense, PAs could act as chaperone molecules, 
which bind to those macromolecules not only to 
protect them from degradation but also to help to 
maintain the most stable conformation under 
Stress conditions. On the other hand, the dual 
anion- and cation-binding properties of PAs con- 
fer them radical-scavenging properties (Bors 
et al. 1989) and open the possibility that PAs 
could play an antioxidative role in the cell. 
Additionally, PA catabolism produces H,QO,, 
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which could act as a signaling molecule and 
trigger the activation of the antioxidative defense 
response of the plant. However, the fact that this 
peroxide production could also be a source of 
oxidant species makes the PA antioxidant role 
still a matter of controversy (Groppa and 
Benavides 2008). More recently, gene expression 
studies have pointed out that modification of 
endogenous PA levels alters the expression of 
some important genes, most of them stress 
related. Therefore, other mechanisms by which 
PAs can perform their protective action against 
stress could be through the activation at tran- 
scriptional level of plant defense mechanisms, 
which in turn would render stress tolerance. In 
this regard, some evidences point out that some 
of the changes in gene expression observed could 
be a direct or indirect consequence of cross talk- 
ing between PAs with other signaling pathways, 
including ABA, JA, gibberellic acid (GA), Ca’, 
and NO, among others (Alcazar et al. 2010; 
Marco et al. 2012). 


29.2.5 Sugars and Sugar Alcohols 


Mannitol is a sugar alcohol that accumulates 
upon salt and water stress in plants and appears to 
function in osmotic stress tolerance by serving as 
a compatible solute or osmoprotectant (Bohnert 
and Jensen 1996). Among plants, celery (Apium 
graveolens L.) is the major mannitol producer, as 
well as a Salt-tolerant plant. Indeed, mannitol 
biosynthesis in celery is increased by salt stress 
(Everard et al. 1994). The main transgenic 
approach to obtain plants with elevated mannitol 
levels has been by the introduction of the bacte- 
rial mannitol-1P dehydrogenase (encoded by 
mtlD gene in E. coli) in several plant systems. 
Mannitol accumulation has been observed when 
mtlD gene is overexpressed in_ tobacco 
(Tarczynski et al. 1992; Karakas et al. 1997), 
Arabidopsis (Thomas et al. 1995), eggplant 
(Prabhavathi et al. 2002), poplar (Hu et al. 2005), 
wheat (Abebe et al. 2003), and rice (Pujni et al. 
2007). The simultaneous expression of two 
genes, mtID and glucitol-6-phosphate dehydro- 
genase, in transgenic loblolly pine has conferred 
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enhanced tolerance to salt stress (Tang et al. 
2005). Interestingly, marker-free transgenic 
tomatoes were also developed for tolerance to 
multiple abiotic stresses like salinity, drought, 
and heavy metal by engineering mannitol accu- 
mulation (Gupta and Rajam 2013). An alternative 
successful approach has been the transformation 
of Arabidopsis plants with the celery gene for 
mannose-6P reductase (M6PR) that usually con- 
verts mannose-6P to mannitol-1P as part of the 
path to mannitol biosynthesis (Zhifang and 
Loescher 2003). In all cases, transgenic lines 
showed an increase in mannitol levels and an 
improved tolerance to water and/or salinity stress. 
However, another common observation on those 
transgenic plants is that the amount of accumu- 
lated mannitol is too small to account for its 
effect as an osmolyte, and some authors suggest 
that this protective effect could be exerted through 
its ROS quencher properties (Abebe et al. 2003; 
Shen et al. 1997). More recently, transcriptome 
analysis of Arabidopsis overexpressing celery 
MO6PR gene indicates that mannitol-enhanced 
stress tolerance is due at least in part to increased 
expression of a variety of stress-inducible genes, 
including members of the ROS-detoxifying sys- 
tem (Chan et al. 2011). 

Trehalose is a nonreducing disaccharide con- 
sisting of two units of glucose bound by an a-a- 
(1-1) linkage. It is present in diverse organisms 
including algae, fungi, bacteria, lower and higher 
plants, as well as insects and invertebrates (Elbein 
et al. 2003). In some organisms, like bacteria and 
yeast, this sugar accumulates in response to 
osmotic stress, heat stress, and drought stress, 
playing a protective role (lordachescu and Imai 
2008). Trehalose is also present in high concen- 
trations in anhydrobiotic organisms (Drennan 
et al. 1993). Those organisms, among which are 
those called “resurrection plants’, are able of sur- 
viving the loss of most of their water content until 
a quiescent stage is achieved and upon watering 
rapidly revive and are restored to their former 
state. In those plant species, trehalose accumula- 
tion helps to survive complete desiccation by sta- 
bilizing dehydrated proteins, lipid membranes, 
and biological structures. While in resurrection 
plants trehalose constitutes the main soluble 
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sugar, the levels of this sugar in other higher 
plants are very low to consider that could exert 
the same full-scale protective role, and it has 
been suggested that in most plant species sucrose 
has taken over the role of trehalose as a preserva- 
tive molecule during desiccation (Goddijn and 
van Dun 1999). Besides, since in vitro studies 
have demonstrated that trehalose can stabilize 
dehydrated biological structures such as lipid 
membranes or enzymes, more effectively than 
other sugars, in recent years there has been a 
growing interest in trehalose metabolism as a 
means of engineering stress tolerance in higher 
plants, including crops. Higher plants, as well as 
various eukaryotic and prokaryotic organisms, 
synthesize trehalose from UDP glucose by a two- 
step pathway catalyzed by trehalose phosphate 
synthase (TPS) and trehalose phosphate phos- 
phatase (TPP). On the other hand, trehalose deg- 
radation is performed by the enzyme trehalase 
(TreH), forming two glucose residues. Different 
groups have attempted to create stress-tolerant 
plants by introducing TPS genes in tobacco 
(Almeida et al. 2005), potato (Stiller et al. 2008), 
tomato, Arabidopsis, and rice (Redillas et al. 
2012) (a detailed list could be found in 
Iordachescu and Imai 2008). In most cases, an 
improvement in abiotic stress tolerance, mainly 
drought, has been reported. However, the levels 
of trehalose accumulated in these transgenic 
plants are generally low (compared to those mea- 
sured in resurrection plants) and do not correlate 
well with the level of tolerance, suggesting that 
trehalose in plant stress response may have a 
function other than direct protective roles 
(lordachescu and Imai 2008). Some possible 
roles have been proposed, like ROS scavenger, 
signaling molecule, an antiapoptotic role, or act- 
ing as compatible solute but localized in specific 
organs, cells, or organelles (Fernandez etal. 2010). 


29.2.6 Enzymes for Membrane Lipid 
Biosynthesis 


Membranes are critical sites of injury by chilling, 
freezing, and heat and osmotic stresses. Hence, 
transgenic approaches have also been attempted 


29 Genetic Engineering Strategies for Abiotic Stress Tolerance 


to engineer cell membranes with an improved 
stress tolerance. Adaptation of living cells to low 
temperatures involves alterations in the 
membrane lipid composition, for example, by 
decreasing membrane fluidity through increase 
of their unsaturated fatty acid percentages 
(Nishida and Murata 1996). Therefore, several 
researchers have tried to modulate the level of 
membrane lipid unsaturation by transgenic 
approaches. Plastidial cis-unsaturated phosphati- 
dylglycerol levels have been increased by over- 
expression of chloroplastic glycerol-3-phosphate 
acyltransferase (GPAT) in tobacco (Murata et al. 
1992), rice (Yokoi et al. 1998; Ariizumi et al. 
2002), and tomato (Sui et al. 2007), leading to 
plants with an improved photosynthesis tolerance 
to cold. On the other hand, modulation of unsatu- 
rated membrane lipids have also been achieved 
by overexpression, co-suppression, or mutation 
of fatty acid desaturase genes, leading to changes 
in cold or heat tolerance of several plants 
(Matsuda and Iba 2005; Upchurch 2008). 


29.2.7 Antioxidant Enzymes 


Although plant cells use reactive oxygen species 
(ROS) as a part of their signal transduction path- 
ways (Miller et al. 2010, 2011), its accumulation 
above a certain threshold causes oxidative damage 
to cell components and leads to cellular impair- 
ment (Gill and Tuteja 2010; Suzuki et al. 2012). To 
protect against oxidative stress, plant cells have 
developed an extensive ROS-scavenging network, 
which involves nonenzymatic antioxidants, includ- 
ing vitamin C, vitamin E, glutathione, carotenoids, 
and flavonoids, and also numerous enzymatic 
mechanisms such as multiple ascorbate peroxi- 
dases (APX), dehydroascorbate reductase 
(DHAR), catalases (CAT), superoxide dismutases 
(SOD), glutathione peroxidases (GPX), glutathi- 
one S-transferases (GST), alternative oxidases, and 
peroxiredoxins (Halliwell 2006). Since the major- 
ity of abiotic stresses induce the accumulation of 
active ROS, which in turn cause oxidative stress 
(Apel and Hirt 2004; Laloi et al. 2004), one of the 
classical approaches explored toward obtaining 
increased stress tolerance has relied on boosting 
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cell enzymatic and nonenzymatic ROS-scavenging 
systems (Gill and Tuteja 2010; Miller et al. 2010). 
In this sense, an improved stress tolerance has 
been observed in several transgenic plants that 
overexpress several antioxidant enzymes such as 
GPX, SOD, APX, and GST (John et al. 2010). 
SOD enzymes catalyze the conversion of anion 
superoxide O, to peroxide H,O, and are present 
in the plant cell as two isoforms, one found in 
the chloroplast (Cu/ZnSOD), whereas a 
Mn-containing enzyme is located in the mito- 
chondria (MnSOD). Tobacco plants transformed 
with pea Cu/ZnSOD under the control of a con- 
stitutive promoter show an increased tolerance to 
chilling (Gupta et al. 1993). Also, overexpression 
of Avicennia marina Cu/ZnSOD enhances salt 
and drought tolerance of transgenic rice plants 
(Prashanth et al. 2008). Drought tolerance has 
also been observed in pepper plants overexpress- 
ing a pea Cu/ZnSOD (Chatzidimitriadou et al. 
2009). Similarly, overexpression of MnSOD 
enhances salt and cold tolerance in Arabidopsis 
(Wang et al. 2004) as well as improves drought 
tolerance in rice (Wang et al. 2005), alfalfa 
(McKersie et al. 1996), Brassica napus (Gusta 
et al. 2009), and potato (Waterer et al. 2010). 
H,O, generated by SOD or by other metabolic 
processes could be detoxified by the action of 
APX/DHAR/GST enzymes in the glutathione- 
ascorbate cycle, which decomposes peroxide to 
water, by the action of GPX, leading to the pro- 
duction of oxidized glutathione, or by a dismuta- 
tion reaction catalyzed by CAT. Induction of the 
genes that code for APX isoenzymes in response 
to salinity and drought has been reported in sev- 
eral plant species, and overexpression of APX 
genes has led to stress tolerance in tobacco and 
Arabidopsis (Badawi et al. 2004; Kavitha et al. 
2008; Lu et al. 2007). Also, transgenic 
Arabidopsis plants overexpressing a rice DHAR 
are resistant to salt stress (Ushimaru et al. 2006). 
On the other hand, tobacco seedlings overex- 
pressing a cDNA encoding an enzyme with both 
GST and GPX activities show a better growth 
when control seedlings are exposed to chilling 
and salt stress (Roxas et al. 1997). Also, overex- 
pression of a GST in Arabidopsis improves seed 
germination and seedling salt tolerance (Qi et al. 
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2010). Moreover, heterologous overexpression of 
E. coli catalase KatE gene leads to improved salt 
tolerance in rice (Moriwaki et al. 2008) and 
tobacco (Al-Taweel et al. 2007), while transgenic 
rice Overexpressing a wheat catalase cDNA dis- 
plays increased capability to maintain low levels 
of peroxide and enhanced tolerance to low- 
temperature stress (Matsumura et al. 2002). 

Simultaneous overexpression of more than 
one ROS-detoxifying enzyme has also been 
reported in some studies. Transgenic rice plants 
constitutively co-expressing glutathione 
S-transferase (GST) and CAT genes show 
enhanced tolerance to salinity and oxidative 
stresses at the vegetative stage (Zhao and Zhang 
2006). Moreover, tobacco plants that simultane- 
ously express CuZnSOD, APX, and DHAR in 
their chloroplasts are more tolerant to salt and 
oxidative stress as compared to those transgenic 
lines expressing single- or double-gene transfer 
(Lee et al. 2007). 

Manipulation of ROS levels and improvement 
of stress tolerance have also been approached by 
using other proteins related to antioxidant mech- 
anisms. Aldehyde dehydrogenase (ALDH) cata- 
lyzes the oxidation of toxic aldehydes, which 
accumulate as a result of side reactions of ROS 
with lipids and proteins (Kirch et al. 2004). 
Overexpression of the aldehyde dehydrogenase 
AtALDH3 gene in Arabidopsis conferred toler- 
ance to dehydration, salt, and heavy metals, sug- 
gesting that ALDH can maintain membrane 
integrity under osmotic stress (Sunkar et al. 
2003). On the other hand, it has also been shown 
that ectopic expression of the cotton stress- 
responsive metallothionein MT3a gene, involved 
in the regulation of metal homeostasis and oxida- 
tive stress, increases tolerance to salt, drought, 
and low temperature in tobacco (Xue et al. 2009). 


29.2.8 Protective Proteins: LEAs, 
HSPs, and Other Chaperones 


LEA proteins constitute a diverse group of 
families of very hydrophilic proteins whose 
expression was first described during seed matu- 
ration (Galau et al. 1986). Their intracellular 
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accumulation is tightly correlated with acquisi- 
tion of desiccation tolerance in seeds of anhydro- 
biotic plants (Bartels 2005) as well as in response 
to other abiotic stress conditions such as drought 
and cold (Battaglia et al. 2008). LEA-type pro- 
teins are encoded by RD (responsive to dehydra- 
tion), ERD (early responsive to dehydration), 
KIN (cold inducible), COR (cold regulated), and 
RAB (responsive to ABA) genes in different plant 
species (Shinozaki and Yamaguchi-Shinozaki 
2000; Zhu 2002). Although their actual mode of 
function is unclear, it has been proposed that 
LEAs could perform their protective role due to 
their highly hydrophilic nature that could give 
them the capacity of sequestering ions that are 
concentrated during cellular dehydration, as well 
as performing a chaperone-like function by 
retaining water molecules and thus preventing 
protein aggregation and protecting cellular com- 
ponents from inactivation and denaturation dur- 
ing the water deficit situation common to 
high-salt, drought, and freezing conditions 
(Bartels and Sunkar 2005; Goyal et al. 2005; 
Vinocur and Altman 2005). 

Overexpression of some LEA genes has been 
correlated with desiccation and salt tolerance in 
several cases. Ectopic expression of the group 3 
LEA protein HVA1 from barley confers tolerance 
to drought and salt stress in rice (Rohila et al. 
2002; Xu et al. 1996) and improves biomass pro- 
ductivity and water use efficiency in transgenic 
wheat under water deficit conditions (Bahieldin 
et al. 2005; Sivamani et al. 2000) as well as leads 
to an enhancement of drought and salinity toler- 
ance in mulberry (Checker et al. 2012). Also, 
overexpression of Brassica napus gene ME- 
leaN4, which encodes another group 3 LEA pro- 
tein, in transgenic Chinese cabbage plants 
enhances their growth ability under salt and 
drought stress conditions (Park et al. 2005). In 
addition, homologous overexpression of 
OsLEA3-I gene improves transgenic rice yield 
under drought stress (Xiao et al. 2007). On the 
other hand, it has been reported in some cases 
that overexpression of dehydrins, a subfamily of 
group 2 LEA proteins, confers tolerance to freez- 
ing (Battaglia et al. 2008). Homologous overex- 
pression of Arabidopsis dehydrin CORI5a 
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enhances the freezing tolerance of chloroplasts 
and protoplasts (Steponkus et al. 1998). 
Overexpression of other dehydrins from 
Arabidopsis, such as RAB18, COR47, LT129, 
and LTI30, also leads to transgenic plants with an 
improved freezing tolerance (Puhakainen et al. 
2004). Further, tobacco plants overexpressing 
citrus dehydrin CuCOR19 show earlier germina- 
tion and better seedling growth under low tem- 
perature than control plants (Hara et al. 2003), 
and leaves from transgenic strawberry plants 
overexpressing a wheat dehydrin WCOR410 
show an improved freezing tolerance (Houde 
et al. 2004). More recently, it has been reported 
that overexpression of JcLEA, a group 5 LEA 
protein identified in Jatropha curcas, confers an 
increased resistance to both drought and salt 
stresses in Arabidopsis transgenic plants (Liang 
et al. 2013). 

The production of heat shock proteins (HSPs) 
is one of the most prominent responses of virtu- 
ally all cells under conditions of high tempera- 
ture. Five families of HSP have been described 
(Hsp100/ClpB, Hsp90/HtpG, Hsp70/Dnak, 
Hsp60/GroEL, and small HSP), and they are 
thought to act as molecular chaperones. Many 
plant HSPs are induced by heat stress, and their 
role in heat tolerance has been shown in trans- 
genic or mutant plants (Kotak et al. 2007). HSPs 
are also induced by water stress in several plants 
(Bartels and Sunkar 2005). It has been described 
in some cases that HSP overexpression enhances 
heat tolerance in plants. For instance, the cultured 
cells of carrot overexpressing HSP/7.7 gene per- 
form with better growth rate than control cultures 
at 37 °C (Malik et al. 1999). Also, the constitu- 
tive expression of HSP/IOJ gene in Arabidopsis 
helps seedlings tolerate sudden shifts to extreme 
temperatures better than the untransformed con- 
trol plants (Queitsch et al. 2000), and its overex- 
pression in rice also enhances the heat tolerance 
of the transgenic lines (Katiyar-Agarwal et al. 
2003). In other cases, this protective effect 1s 
extended to other stress conditions. In this sense, 
Arabidopsis plants overexpressing small heat 
shock protein show enhanced tolerance to 
drought and salinity (Sun et al. 2001). Also, 
overexpression of sHSPJ7.7 in rice leads to 


transgenic plants with an enhanced drought and 
osmotic stress tolerance (Sato and Yokoya 2008). 
More recently, it has been demonstrated that 
constitutive expression of a cotton HSP, GHSP26, 
enhances drought tolerance in transgenic cotton 
plants (Maqbool et al. 2010). 

Other reports have pointed out that stress tol- 
erance could be achieved by overexpressing other 
proteins with chaperone activity. The expression 
of two bacterial RNA chaperones, E. coli CspA 
and B. subtilis CspB, resulted in enhanced toler- 
ance to abiotic stress, by maintaining growth, 
photosynthesis, and development in rice, maize, 
and Arabidopsis (Castiglioni et al. 2008). More 
recently, heat tolerance has been obtained in 
Arabidopsis by overexpression of NADPH- 
thioredoxin reductase, type C (NTRC), a compo- 
nent of a high-molecular-weight complex with 
redox-dependent holdase-chaperone activity 
(Chae et al. 2013). On the other hand, tobacco 
plants overexpressing calnexin, a Ca**-dependent 
molecular chaperone, exhibit better germination 
under osmotic stress and survival under dehydra- 
tion stress conditions (Sarwat and Naqvi 2013). 


29.2.9 Transporters 


One of the main strategies for achieving greater 
tolerance to abiotic stress is to improve the capac- 
ity of plants to reestablish both ionic and osmotic 
homeostasis that are altered under stressful envi- 
ronments, like high-salt environments. Under 
saline conditions, Na* and Cl are the predomi- 
nant toxic ions for cell metabolism affecting 
plant growth and development, and the mainte- 
nance of a high cytosolic K*/Na* ratio is essential 
for plant salt tolerance (Glenn et al. 1999). Na* 
transporters are involved in reestablishing ionic 
homeostasis after salt stress, preventing Na* 
accumulation in cells either by increasing Nat* 
storage in the vacuole or by improving their 
excretion out of the root (Tuteja 2007). 
Manipulation of tonoplast Na* transport in order 
to improve salt tolerance has been one of the 
genetic engineering approaches with greatest 
success. Arabidopsis vacuolar Na‘’H* antiporter 
AtNHX1 has been overexpressed in Arabidopsis 
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(Apse et al. 1999), as well as in tomato (Zhang and 
Blumwald 2001), Brassica (Zhang et al. 2001), 
cotton (He et al. 2005), wheat (Xue et al. 2004), 
beet (Beta vulgaris) (Yang et al. 2005), and tall 
fescue (Festuca arundinacea) (Zhao et al. 2007), 
with a common result of increased salt tolerance. 
Likewise, overexpression of the rice ortholog, 
OsNHX1, in rice (Fukuda et al. 2004) and maize 
(Chen et al. 2007) showed improved salt stress 
tolerance. Similar results were observed when 
cotton GhNHX/ gene is overexpressed in tobacco 
(Wu et al. 2004). NHKs from halophytes Atriplex 
gmelini AZNHX1 (Ohta et al. 2002) and Suaeda 
salsa SSNHX1 (Zhao et al. 2006a) also improved 
salt tolerance when overexpressed in transgenic 
rice. Other members of the NHX family from 
Arabidopsis have also been tested, like AtLNHX3, 
whose overexpression in sugar beet confers an 
increased resistance to high salinity (Liu et al. 
2008), as well as AtNHXS5, which enhances salt 
and drought tolerance in rice (Li et al. 201 1c) and 
paper mulberry (Broussonetia papyrifera) (Li 
et al. 201 1b). 

On the other hand, the plasma membrane 
Na*H* antiporter salt overly sensitive 1 (SOS1) 
is involved in Na* extrusion and long-distance 
Na* transport from root to shoot (Shi et al. 2002) 
and improves salt tolerance of transgenic 
Arabidopsis when overexpressed (Shi et al. 
2003). Also, heterologous overexpression of 
plasma membrane Na*/H* antiporter SOD2 from 
yeast (Schizosaccharomyces pombe) in trans- 
genic rice increases salt tolerance (Zhao et al. 
2006b). 

Previously, another transporter from yeast, 
which facilitates intracellular K* accumulation 
and decrease of intracellular Na, HAL1, was 
overexpressed in tomato, leading to plants with 
increased fruit yield and enhanced K/Na* selec- 
tivity in leaves (Rus et al. 2001). 

Vacuolar proton pumps, which generate a pro- 
ton electrochemical gradient across tonoplast that 
is used by other vacuolar co-transporters, have 
been another point where transgenic strategies 
have been attempted to improve the plant capacity 
to withstand against osmotic disturbances. In this 
sense, transgenic plants expressing higher levels 
of the vacuolar proton-pumping pyrophosphatase 
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(H*-PPase), AVP1, have an increased capacity to 
retain more solutes and water and are more resis- 
tant to salt and drought than are wild-type plants 
(Gaxiola et al. 2001). AVPI has also been overex- 
pressed in cotton, leading to similar results 
(Pasapula et al. 2011). On the other hand, other 
reports have used H*t-PPase TsVP from halophyte 
Thellungiella halophila. Overexpression of TsVP in 
tobacco leads to transgenic plants that accumulate 
more solutes than wild-type plants and are more 
resistant to saline conditions (Gao et al. 2006). 
TsVP overexpression also increases cell solute 
content and improves their resistance to saline 
stress in cotton (Lv et al. 2008). Cotton TsVP- 
overexpressing plants are also resistant to drought 
(Lv et al. 2009), and similar results were also 
obtained in maize (Li et al. 2008). 

Also, co-overexpression strategies of vacuolar 
transporters have been tried. The overexpression 
of the Suaeda salsa SsNHXI and Arabidopsis 
AVP/ in rice transgenic plants exhibited an 
enhanced salt tolerance as compared to plants 
overexpressing SsNHX/ alone (Zhao et al. 
2006a). 

Plants exposed to saline environments not 
only face Na* toxicity, but also water loss caused 
by osmotic stress. Thus, the ability to maintain 
water content under stress conditions is critical 
for plant survival. Aquaporins are intrinsic mem- 
brane proteins implicated in water diffusion and 
regulated in response to environmental cues 
(Li et al. 2014). Overexpression of aquaporins 
has resulted in contrasting results, with 1mprove- 
ment of tolerance to salt and drought in 
Arabidopsis when aquaporins PgTIPI from 
Panax ginseng (Peng et al. 2007), TaNIP from 
wheat (Gao et al. 2010), cotton GhPIP2;7 (Zhang 
et al. 2013), and OsPIP1 (OsPIP1-1) or OsPIP2 
(OsPIP2-2) from rice (Guo et al. 2006) are over- 
expressed. Tolerance to salt is also observed in 
tobacco plants overexpressing rice TaAQPS8 
(Hu et al. 2012), as well as a better performance 
in water deficit conditions in the case of rice 
plants overexpressing aquaporin RWC3 (Lian 
et al. 2004). However it has been reported in 
other cases that overexpression of aquaporins 
increased plant susceptibility to stress, like the 
drought-sensitive tobacco plants overexpressing 
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Arabidopsis PIP1b (Aharon et al. 2003) or the 
salt-sensitive rice plants overexpressing wheat 
HvPIP2 (Katsuharaet al. 2003). Also, Arabidopsis 
plants that overexpress Glycine soja GsTIP2 
plants show an increased water loss under drought 
stress conditions (Wang et al. 2011). Since nega- 
tive effects on stress resistance were rather seen 
when an aquaporin of interest was overexpressed 
in a heterologous plant species, it has been specu- 
lated that in those cases the foreign aquaporin 
may not be properly recognized by the endoge- 
nous stress response machinery (Li et al. 2014). 


29.2.10 Overproduction 
of Regulatory Proteins 


The mechanisms of response and adaptation of 
plants to stress generally involve a combination 
of processes, which must occur by temporally 
and spatially appropriate patterns. To do that, the 
activities of the proteins that ultimately perform 
these processes are modulated by signaling cas- 
cades that ensure those patterns. Therefore, 
another transgenic strategy for obtaining stress- 
resistant plants has been the modulation of the 
expression of genes encoding the regulatory pro- 
teins that are part of those signal transduction 
pathways, in order to mimic or even enhance the 
plant adaptation mechanisms. 


29.2.11 Kinases 

Protein kinases are involved in various stages of 
plant development and adaptation. Among them, 
MAPK plays an essential role in the signal trans- 
duction pathways that link cell stimulus percep- 
tion with cell response. Indeed, MAPKs 
constitute one of the most studied signaling 
mechanisms in plants. The MAPK cascades are 
generally organized as modular pathways in 
which the activation of upstream MAPKKKs 
leads to the sequential phosphorylation and sub- 
sequent activation of downstream MAPKKs and 
MAPKs. Through their kinase activity, MAPK 
cascades are an integral part of the mechanisms 
that translate incoming environmental cues into 


393 


posttranslational modifications of target proteins, 
triggering a myriad of transcriptomic, cellular, 
and physiological responses in the plant. In this 
sense, MAPK pathways are known to be acti- 
vated by diverse abiotic stresses such as cold, 
salt, heat, drought, wounding, UV irradiation, 
osmotic shock, ozone, or heavy-metal intoxica- 
tion (Samajova et al. 2013). 

Modulation of MAPK cascades by transgenic 
modification of the expression and/or activity of 
some of its members has led to abiotic stress tol- 
erance in some cases. Tobacco MAPKKK NPKI1 
confers resistance to multiple environmental 
stress conditions when an active version of this 
MAPKKK is constitutively expressed in this 
plant (Kovtun et al. 2000). Moreover, the consti- 
tutive low-level of expression of this active 
tobacco NPK1 in maize, increases its tolerance to 
freezing (Shou et al. 2004a) and drought (Shou 
et al. 2004b). More recently, it has been reported 
that Arabidopsis plants overexpressing 
MAP3K64 exhibit enhanced tolerance to salt 
stress (Shitamichi et al. 2013). In addition, the 
overexpression of another rice MAPKKK from 
rice DSM1 increases the tolerance of transgenic 
rice seedlings to dehydration (Ning et al. 2010). 

Regarding MAPKK genes, Arabidopsis plants 
overexpressing AtMKK2 exhibit constitutive 
AtMPK4 and AtMPK6 activity, as well as a con- 
stitutively upregulated expression of  stress- 
induced genes, and show an increased tolerance 
to salt and cold (Teige et al. 2004). Also in 
Arabidopsis, AIMKK1 overexpression enhances 
drought resistance (Xing et al. 2008), and plants 
overexpressing AtMKK3 exhibit an enhanced 
tolerance to salt and an increased ABA sensitivity 
(Hwa and Yang 2008). In addition, maize 
MAPKK ortholog ZmMKKI1 confers chilling 
resistance when overexpressed in transgenic 
tobacco (Cai et al. 2014), while overexpression 
of cotton GhMKK1 increases salt and drought 
stress tolerance of Nicotiana benthamiana plants 
(Lu et al. 2013). On the other hand, tolerance to 
salt and cold stress is obtained in Arabidopsis 
plants by overexpressing maize ZmMKK4 (Kong 
et al. 2011). Overexpression of an active 
OsMKK6 enhances tolerance to salt stress in rice 
(Kumar and Sinha 2013). 
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Overexpression of MAPK genes has also lead 
to stress tolerance in other cases. Thus, ZmMPK7 
enhances protection against ROS-mediated 
injury during osmotic stress when it is overex- 
pressed in transgenic tobacco (Zong et al. 2009), 
while Arabidopsis plants constitutively overex- 
pressing ZmSIMK1 have an improved resistance 
against salt stress (Gu et al. 2010). Moreover, 
homologous overexpression of OSMAPKS leads 
to transgenic plants with increased multiple-stress 
tolerance (Xiong and Yang 2003). 

CPKs are Ca’* sensors involved in Ca’*- 
mediated signal transduction in plant cells and 
play vital regulatory roles in plant development 
and stress responses (Boudsocq and Lauriere 
2005; Boudsocg and Sheen 2013; Valmonte et al. 
2014). Overexpression of CPK genes AtCPK4 
and AtCPKI// renders Arabidopsis plants with an 
enhanced ABA-induced stomata closure and an 
improved capacity to retain water that makes 
them more tolerant to drought stress (Zhu et al. 
2007b). Transgenic plants with similar pheno- 
type have also been obtained by heterologous 
overexpression of maize genes ZmCPK4 (An 
et al. 2008) and ZmCPK12 (Wang and Song 
2013). On the other hand, Arabidopsis plants 
overexpressing AtCPK6 have an increased sur- 
vival rate to salt and drought stress (Xu et al. 
2010). As an example of a different approach, 
disruption of AtCKP23 gene expression by 
T-DNA insertion increased Arabidopsis tolerance 
to drought and osmotic stresses, suggesting its 
role as a negative regulator in plant signaling cas- 
cades involved in the response to those stresses 
(Ma and Wu 2007). CPK modulation by trans- 
genic approaches has also been described in rice, 
where overexpression of OsCDPK7 confers both 
cold and salt/drought tolerance (Saijo et al. 2000), 
whereas plants overexpressing OsCDPK21 
(Asano et al. 2011) and OsCDPK12 (Asano et al. 
2012) show an enhanced salt tolerance. Also, it 
has been described that OsCDPK13 overexpres- 
sion leads to rice plants with improved recovery 
rates after cold stress (Abbasi et al. 2004). 

Ca’*-mediated signal transduction pathways 
could also be mediated by CBLs (Boudsocq and 
Lauriére 2005). Binding of Ca** to CBLs changes 
their conformation, making them able to interact 
with a specific group of protein kinases designed 
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as CIPKs (Kim et al. 2000). CBL-CIPK signaling 
pathways have been thoroughly studied in the 
salt stress response of Arabidopsis. CBL SOS3 
and CIPK SOS2 are part of the SOS signaling 
pathway, which leads to sodium ion homeostasis 
in response to salt stress. The CBL-CIPK com- 
plex SOS2/SOS3 activates the Na*/H* antiporter 
SOS1 by phosphorylation (see section 29.2.9) 
(Qiu et al. 2002). Arabidopsis plants overex- 
pressing single genes SOS/, SOS2, and SOS3 or 
co-expressing them have been generated (Yang 
et al. 2009). The transgenic plants overexpressing 
SOS3 or SOS/ did exhibit a clear increase in salt 
tolerance compared to the control plants. 
However, overexpression of SOS2+SOS3_ or 
SOSI+SOS2+SOS3 did not result in an improve- 
ment of plant tolerance as compared to transgenic 
plants overexpressing only SOS3 or SOS/. 
Regarding other CBLs, it has been described 
that overexpression of AfCBL/ triggers an 
increase of the expression of drought and salt 
stress-related genes, which is correlated with an 
enhanced tolerance to both these stresses, 
whereas T-DNA knockout mutants display the 
opposite phenotype (Albrecht et al. 2003; Cheong 
et al. 2003). Similar results were obtained in 
Arabidopsis plants overexpressing AtCBL5 
(Cheong et al. 2010). Heterologous overexpres- 
sion of CBLs in Arabidopsis has also been 
reported. In this regard, maize ZmCBL4 overex- 
pression enhances the salt tolerance of 
Arabidopsis at the germination and seedling 
stages (Wang et al. 2007). Also, Arabidopsis 
plants overexpressing poplar CBLs PeCBL6 or 
PeCBL10 show enhanced tolerance to high salin- 
ity, drought, and low temperature (Li et al. 2013). 
With regard to other CIPKs, transgenic rice 
plants overexpressing OsCIPK03, OsCIPK12, 
and OsCIPKI5 genes display a significantly 
improved tolerance to cold, drought, and salt 
stress, respectively (Xiang et al. 2007). Also, 
chickpea CaCIPK6 overexpression increases salt 
tolerance and root development in_ tobacco 
(Tripathi et al. 2009). Overexpression of another 
CIPK isolated in apple MdCIPK6L also confers 
tolerance to salt, drought, and chilling stress in 
transgenic Arabidopsis, tomato, and _ apple 
(Wang et al. 2012). More recently, it has been 
reported that the identification of wheat 
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TaCIPK29, anew member of CIPK gene family, 
and its overexpression in tobacco transgenic 
plants displayed higher germination rates, longer 
root lengths, and better growth status than wild- 
type plants under salt stress conditions (Deng 
et al. 2013). 

Another family of kinases induced by hyper- 
osmotic and saline stresses in Arabidopsis and 
rice is the SNF 1-related protein kinase 2 (SnRK2) 
family (Boudsocq and Lauriere 2005). Two cases 
of transgenic overexpression of SnRK2 genes 
have been described. Overexpression of SRK2C 
gene in Arabidopsis enhances drought tolerance 
significantly in concert with the upregulation of 
stress-responsive gene expression (Umezawa 
et al. 2004). In addition, rice SnRK2 gene SAPK4 
has also been overexpressed in rice, which results 
in an improvement of germination and growth 
rates under salt stress (Diedhiou et al. 2008). 


29.2.12 Transcription Factors 


Transcription factors (TFs) are essential mem- 
bers of the cell signal transduction pathways that 
transduce the information carried previously by 
upstream signals (second messengers, kinases) to 
changes in gene expression. The importance of 
TFs as regulatory proteins in plants is evidenced 
by the fact that they represent an important part 
of their genomes, as in the case of Arabidopsis, 
which has 1,500 TFs in its genome (Ratcliffe and 
Riechmann 2002). Moreover, TF gene families 
have experienced higher expansion rates during 
evolution of plants, suggesting their main role in 
the adaptation of plants to various environments 
(Shiu et al. 2005). Those features have made TFs 
to be considered as good target candidates for the 
generation of stress-tolerant plants. 

Each single TF can control the expression of 
many target downstream genes in parallel through 
the specific union of the TF to cis-acting ele- 
ments in the promoter regions of their respective 
target genes, forming a transcription unit called 
“regulon” (Nakashima and Yamaguchi-Shinozaki 
2006). Several regulons have been described in 
the gene response of plants to abiotic stresses. 
The study of the expression mechanisms of 
osmotic stress- and cold stress-responsive genes 
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in Arabidopsis has allowed to identify multiple 
regulons. Among them, the dehydration- 
responsive element-binding protein/C-repeat 
(CRT)-binding factor (DREB1/CBF) regulon is 
involved in both osmotic stress- and cold stress- 
responsive gene expression, while other regu- 
lons, including DREB2, AREB/ABF, MYB/ 
MYC, and NAC, are involved in osmotic stress- 
responsive gene expression (Nakashima and 
Yamaguchi-Shinozaki 2006). 

DREB1/CBF and DREB2 are TF members of 
the AP2/ERF family that bind to the cis-element 
DRE/CRT. DREB1I/CBF refers to a set of three 
TFs (DREB1A, DREB1B, and DREBIC) highly 
similar in amino acid sequences and whose genes 
are located in tandem within the Arabidopsis 
genome (Nakashima and Yamaguchi-Shinozaki 
2006). Cold stress immediately induces the 
DREB1I/CBF genes, whose products activate the 
expression of multiple stress-inducible target 
genes. Dehydration induces DREB2 genes, lead- 
ing to the expression of various genes that are 
involved in drought stress tolerance (Liu et al. 
1998; Nakashima et al. 2000). Therefore, these 
TFs have been used as preferred targets to gener- 
ate transgenic plants with resistance to cold and 
drought stress. Thus, transgenic Arabidopsis 
lines overexpressing DREBIA, DREBIB, and 
DREBIC under control of constitutive promoters 
exhibit remarkably enhanced salt, drought, and 
cold tolerance (Kasuga et al. 1999; Liu et al. 
1998). Arabidopsis DREB1/CBF genes have also 
been found in a variety of plant species and have 
been shown to enhance abiotic stress tolerance in 
other plants either by heterologous or homolo- 
gous overexpression (an extensive list of exam- 
ples could be found in Hussain et al. 2011 and 
Agarwal et al. 2013). The DREB1/CBF genes 
apparently produce abiotic stress tolerance by 
upregulating a suite of native stress-responsive 
genes whose products produce physiological 
adaptations that enable the plant cells to cope 
with osmotic stress, but their ectopic expression 
usually results in dwarf phenotypes (Agarwal 
et al. 2013; Nakashima and Yamaguchi-Shinozaki 
2005). To overcome this problem, stress- 
inducible promoters with low background 
expression under normal growth conditions have 
been used to drive expression of the DREBI/CBF 
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genes to achieve increased stress tolerance with- 
out growth retardation. This approach has suc- 
ceed by using stress-inducible RD29A promoter 
and DREBIA in Arabidopsis (Kasuga et al. 
1999), tobacco (Kasuga et al. 2004), wheat 
(Pellegrineschi et al. 2004), and _ peanut 
(Bhatnagar-Mathur et al. 2007). Genes coding for 
DREB2 homologues have also been isolated 
from several plant species. However, in contrast 
to the DREBI/CBF genes, overexpression of 
DREB2 in Arabidopsis does not improve stress 
tolerance, suggesting that DREB2 proteins are 
tightly regulated by upstream mechanisms, 
including posttranscriptional modifications (Liu 
et al. 1998). DREB2A possess a negative regula- 
tory domain that could be eliminated by deletion, 
generating a constitutive active form 
(DREB2A-CA). Transgenic Arabidopsis plants 
overexpressing DREB2A-CA showed growth 
retardation and improved tolerance to drought 
and heat stress (Sakuma et al. 2006a, b). Similar 
results are obtained when maize ZmDREB2A 
(Qin et al. 2007) and rice Os DREB2B (Matsukura 
et al. 2010) are overexpressed in the same plant. 

The basic leucine zipper TFs ABRE-binding 
protein/ABRE-binding factor (AREB/ABF) bind 
to cis-element ABRE and activate ABA- 
dependent gene expression (Nakashima and 
Yamaguchi-Shinozaki 2006). Arabidopsis 
AREBI/ABF2 (AREBI) and AREB2/ABF4 
(AREB2) are induced by drought, NaCl, and 
ABA treatment and are activated by an ABA- 
dependent phosphorylation process (Uno et al. 
2000). Arabidopsis plants overexpressing 
AREB2 show an enhanced sensitivity to ABA, as 
well as reduced transpiration rate, and enhanced 
tolerance to drought (Kang et al. 2002). On the 
other hand, AREB1 is required for normal glu- 
cose response in Arabidopsis, and its overexpres- 
sion affects stress tolerance to salt, drought, and 
heat (Fujita et al. 2005; Kim et al. 2004). AREF/ 
ABF rice homologues TRAB1 and its close rela- 
tive OsbZIP23 have also been described (Hobo 
et al. 1999; Xiang et al. 2008). Overexpression of 
OsbZIP23 renders transgenic plants with ABA 
hypersensitivity and increased tolerance to 
drought and salt (Xiang et al. 2008). 

AtMYB and AtMYC have been described as 
TFs that recognize distinct cis-elements present 
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in the promoter of the drought-inducible gene 
RD22 and activate it in a cooperative way (Abe 
et al. 1997). Since both TFs are synthetized after 
accumulation of ABA levels, it has been pro- 
posed that they act in a late stage of the stress 
responses. ‘Transgenic plants overexpressing 
AtMYC2 and/or AIMYB2 cDNAs have higher 
sensitivity to ABA and an improved osmotic 
stress tolerance (Abe et al. 2003). Analysis of the 
transcriptome of those transgenic plants has 
allowed the identification of other target genes 
for AUMYB and AtMYC, including other ABA- 
and JA-inducible genes. 

The group of proteins NAM, ATAF, and CUC 
(NAC) were first identified as a group of TFs with 
roles in plant development. Several members of 
the NAC family have also been shown to be 
involved in abiotic stress responses in 
Arabidopsis, regulating both ABA-dependent 
and ABA-independent genes (Fujita et al. 2004; 
Tran et al. 2004). Three NAC TFs have been iso- 
lated by its binding ability to the promoter of the 
early responsive to dehydration | (ERD/) gene. 
Transgenic plants overexpressing those NAC 
cDNAs show an upregulation of several drought- 
inducible genes, as well as an increased drought 
tolerance, although ERD/ is not induced in those 
plants (Tran et al. 2004). Another NAC from rice, 
OsNAC6, is induced by ABA and abiotic stresses 
such as high salinity, cold, and drought 
(Nakashima et al. 2007) and when overexpressed 
under the control of both constitutive and stress- 
inducible promoters yields transgenic rice plants 
with an improved drought and salinity tolerance 
(Nakashima et al. 2007). Water-stress tolerance 
has also been observed in transgenic rice plants 
overexpressing other stress-inducible NAC 
genes, SNAC/ (Hu et al. 2006) and SNAC2 (Hu 
et al. 2008). In addition, overexpression of 
another rice NAC gene, ONAC045, increases tol- 
erance of the rice plants to drought and salt with- 
out negative effects on growth (Zheng et al. 
2009). 

Other transcription factors that do not belong 
to those regulons have also been used successfully 
to obtain plants with an abiotic stress resistance. 
Additional examples could be found in the 
reviews of Agarwal et al. (2013) and Hussain 
et al. (2011). 
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29.2.13 Other Levels of Regulation 


As described in section 29.1.2, recent findings 
have pointed out that gene expression in response 
to stress could be fine-tuned by posttranscrip- 
tional and posttranslational mechanisms. Some 
authors have reported successful approaches 
where components of those regulatory systems 
have been targeted to develop transgenic plants 
with stress tolerance. 


29.2.14 miRNAs 


miRNAs are the most abundantly expressed and 
well-studied class of the noncoding regulatory 
small RNAs (21-24 nucleotides) that are accu- 
mulated and produced in plants. miRNAs regu- 
late gene expression in plants by causing 
degradation or inhibition of the translation of 
their complementary mRNA targets (Sunkar 
2010). In plants, it has been pointed that miRNAs 
have diverse biological functions, playing roles 
in the regulation of optimal growth and develop- 
ment as well as in abiotic and biotic stress 
responses (de Lima et al. 2012; Khraiwesh et al. 
2012; Sunkar et al. 2012). Interestingly, TF genes 
are the major targets for several stress-regulated 
miRNAs, making them good candidates for new 
transgenic approaches focused to alter stress 
response. In this sense, tomato plants overex- 
pressing S/y-miR169 are more tolerant to drought 
stress (Zhang et al. 201 1a, b). Also, overexpres- 
sion of Osa-miR319 yields rice plants resistant to 
cold (Yang et al. 2013), as well as transgenic 
creeping bentgrass tolerant to salt and drought 
(Zhou et al. 2013). The details on the role of miR- 
NAs in abiotic stress tolerance have been covered 
in Chap. 36 of this book. 


29.2.15 Ubiquitination 


Defective proteins and proteins which are no 
longer required for cell are tagged by covalent 
union of chains formed by multiple units of the 
small protein ubiquitin. Ubiquitin attachment is 
mediated by the sequential action of ubiquitin- 
activating (El), ubiquitin-conjugating (E2), and 


597 


ubiquitin ligase (E3) enzymes, being E3 the 
enzyme which recognizes the target proteins and 
ensures ubiquitination specificity. A number of 
studies have pointed out the relevance of the 
ubiquitin-dependent protein degradation in 
stress-related signaling and response mecha- 
nisms (Zhang and Xie 2007; Zhu et al. 2007a). In 
fact, transcriptome and proteome studies from 
several plant species indicate that hundreds of 
ubiquitination-related transcripts/proteins are 
modified after exposure to abiotic stresses, sug- 
gesting a role for ubiquitination in_ stress 
responses (Mazzucotelli et al. 2008). 

Particularly, many components of the ubiqui- 
tination pathway have been identified or pre- 
dicted to be essential in ABA homeostasis and 
signaling (Lee and Kim 2011), and some exam- 
ples of upregulation of E3 ubiquitin ligases have 
led to transgenic plants with stress tolerance 
through ABA-related mechanisms. XERICO, an 
H2-type zinc-finger E3 ubiquitin ligase, controls 
the level of ABA by enhancing the transcription 
of the key ABA biosynthesis gene AtNCED3. 
Thus, upregulation of the XERICO gene in 
Arabidopsis enhances ABA levels and results in 
improved drought tolerance due to an increased 
stomatal closure (Ko et al. 2006). Moreover, TF 
ATHB6 is a negative regulator of ABA response, 
whose degradation is promoted by MATH-BTB, 
a component of an E3 ubiquitin ligase complex. 
Overexpression of MATH-BTB leads _ to 
Arabidopsis plants with an increased drought tol- 
erance with minimal phenotypic alterations 
(Lechner et al. 2011). Also, Arabidopsis E3 
ligase SDIR1 is induced by drought and salt 
stress, and it has been identified as a positive reg- 
ulator in ABA response. SDIR1 overexpression 
in Arabidopsis confers an enhanced ABA- 
induced stomatal closure, as well as a drought 
tolerance phenotype, which is also observed 
when this Arabidopsis gene is overexpressed in 
tobacco and rice (Zhang et al. 2008). On the other 
hand, AtAIRP1 and RHA2b have also been iden- 
tified as other Arabidopsis E3 ubiquitin ligases 
that act as a positive regulator in ABA-mediated 
drought responses, and its respective overexpres- 
sion leads to transgenic plants with an improved 
resistance to drought stress (Li et al. 201 1a; Ryu 
et al. 2010). 
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Regarding other drought-response mecha- 
nisms, it has been suggested that hot pepper E3 
ligase RmalH1 and its Arabidopsis homologue 
Rmal target Arabidopsis aquaporin PIP2;1, 
inhibiting its traffic from the ER to the plasma 
membrane, and play a positive role in drought 
tolerance mechanisms. In fact, RmalHI/ overex- 
pression in Arabidopsis increases tolerance to 
drought stress (Lee et al. 2009). 

Ubiquitination mechanisms have also been 
targeted to improve cold stress tolerance. ICE] is 
a TF involved in the expression of cold-inducible 
regulon CBF/DREB1. Degradation of ICE1] 
under cold is mediated by ubiquitin E3 ligase 
HOS1, leading to the attenuation of plant cold 
response. The overexpression of a point-mutant 
version of ICE] resistant to ubiquitination leads 
to Arabidopsis transgenic plants with high levels 
of this TF and an improved cold tolerance (Miura 
et al. 2011). 

Apart from E3 ligases, it has been reported 
that constitutive overexpression of peanut E2 
enzyme AhUBC2 in Arabidopsis upregulates the 
expression of |ABA-independent _ stress- 
responsive genes, increases the synthesis of Pro, 
and renders plants with improved drought toler- 
ance (Wan et al. 2011). Similar results were 
observed in Arabidopsis plants overexpressing 
soybean E2 gene GmUBC2 (Zhou et al. 2010). 
Additionally, drought tolerance has also been 
observed in tobacco plants overexpressing wheat 
polyubiquitin gene Ta-Ub2 (Guo et al. 2008). 


29.3 Limitations and Future 
Prospects of Transgenic 
Strategies 


As seen in the previous section, many genes asso- 
ciated with plant response(s) to abiotic stresses 
have been identified and used for generating 
stress-tolerant plants, thereby demonstrating the 
potential of transgenic strategies. However, it has 
also been noted by some authors that these strate- 
gies are not exempt from downsides. 

A majority of the overexpression strategies 
have been based on the use of constitutive and 
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strong promoters, such as cauliflower mosaic 
virus 35S (CaMV35S) (Odell et al. 1985), ubiqui- 
tin 1 (UBI/) (Holtorf et al. 1995), and actin 
(McElroy et al. 1990) promoters, which often 
render plants, though resistant to stress, to show 
undesired pleiotropic effects like growth retarda- 
tion and reduced fruit/seed number and fresh 
weight under normal conditions (Wang et al. 
2003; Cabello et al. 2014). For strategies that 
involve manipulation of signaling pathways, 
cross talk with other regulatory pathways (such 
as development) increases the probability of 
appearance of pleiotropic effects. In other cases, 
undesired effects could come from toxic levels of 
the overproduced protein or metabolite. 
Furthermore, although these pleiotropic effects 
may be absent, constitutive activation of stress 
response pathways imposes on the plant meta- 
bolic and energetic costs that have been compen- 
sated by diverting the plant from developmental 
programs, thus resulting in yield penalties and 
consequent limited benefit for agricultural crops. 
In addition, the use of strong constitutive promot- 
ers could accelerate the process of RNA silencing 
of the transgene (Dietz-Pfeilstetter 2010). The 
most common alternative proposed in literature 
to solve this problem is to use stress-inducible 
promoters that ensure activation of the desired 
gene only when needed. In this regard, some of 
the examples cited above have successfully used 
promoters of stress-inducible genes such as the 
Arabidopsis RD29A promoter, or barley HAV22 
promoter, for obtaining transgenic plants with 
desired stress tolerance and without the unde- 
sired effects in control conditions. However, 
widespread use of this strategy requires isolation, 
study, and development of a more extensive cata- 
log of stress-induced promoters with a better 
knowledge of their characteristics, such as thresh- 
old stress level under which the promoter can be 
activated, levels of expression, reversibility, and 
dose-dependence characteristics. 

Furthermore, although a majority of these 
transgenic lines have served as useful tools for 
gaining knowledge in the mechanism of plant 
response to stress, and this has even been applied 
in plants of agronomic interest such as crops, 
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there are still few reports of transgenic lines that 
maintain tolerance to stress in field trials. With 
the exception of drought-resistant maize line 
MON87460, developed by Monsanto Company 
(Monsanto 2012), there have not been any more 
reports of release to the market of stress-tolerant 
genetically engineered varieties. Some explana- 
tions for this limited success have been given. 
Most of the studies have been conducted under 
laboratory growth conditions focusing on short- 
term harsh, single stress conditions where stress 
tolerance/resistance is estimated as the survival 
rate (which is easier to screen). Moreover, in 
most studies, stress tolerance 1s tested at the veg- 
etative stages (seedlings) only. A majority of the 
crops, in contrast, are exposed to heterogeneous 
conditions during their life cycle in the field, 
which include mild episodes of combined stress 
conditions (Mittler 2006; Mittler and Blumwald 
2010), and are particularly susceptible to stress at 
their reproductive stage (during flowering and 
seed/fruit/grain maturation), which constitutes 
the most critical stage determining crop yield. 
Some authors have pointed out the need for stan- 
dardized protocols to assay stress tolerance and 
to phenotype plants which mimic better the situ- 
ations that plants have to face in the field 
(Bhatnagar-Mathur et al. 2008; Cabello et al. 
2014). On the other hand, since the plants in the 
field face stresses of diverse nature simultane- 
ously, additional efforts in the study of cross talk 
between different plant stress response pathways 
should be addressed. In this sense, application of 
new approaches as systems biology (Cramer 
et al. 2011) could help gain better knowledge of 
plant response mechanisms to stress as well as 
lead to isolation of novel targets for transgenic 
modification to enable development of plants 
without growth penalties under normal condi- 
tions and which are able to maintain high produc- 
tivity under stress. 
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Abstract 

Crop plants are exposed to a plethora of biotic and abiotic stresses. Biotic 
stresses such as pathogens (viruses, bacteria and fungi), insect pests, nem- 
atode parasites and weeds cause a significant loss of crop yield and quality. 
Although conventional strategies like breeding for resistant varieties and 
agrochemicals and biocontrol agents for control of diseases and pests have 
been in use for a long time, these have been met with limited success. 
During the last 10 years, technological advancements in genetic engineer- 
ing have led to the development of transgenic crop varieties resistant to 
various biotic stresses. A large number of transgenic crops have been 
developed and more are underway; however, the number of biotech crops 
reaching the field from labs is still limited. Transgenic crops developed 
against insect resistance and/or herbicide tolerance have been commercial 
success stories, an example being Bt cotton. 


Keywords 

Transgenic crops * Biotic stress * Disease resistance * Coat protein ¢ 
Pathogen-related proteins ¢ Insect resistance * Bt crops * Nematode resis- 
tance * Herbicide tolerance * RNAi 


30.1 Introduction 


In the present-day scenario, food availability, 
quality and safety are given top priority, keeping 
in view the changing climate which will, in the 
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agriculture (Easterling et al. 2000) besides 
directly affecting agricultural yield. Crop yield 
losses due to biotic stresses like pest infestation, 
weed overtake and disease occurrence together 
with abiotic factors like drought, salinity and 
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extreme temperature diminish considerably food 
production economics in many parts of the 
world. With climate change predicted to alter 
conditions in various parts of the world, research- 
ers started developing models to forecast adverse 
effects of abiotic and biotic factors on crop pro- 
ductivity. In a study by Collier et al. (2008), it 
was deduced that changes in life cycle of a pest 
with increased number of generations or with 
increase in number of survivors of a particular 
stage of instar could be a consequence of the 
changing climatic conditions in the future. Such 
changes are also predicted for plant pathogens 
(Lonsdale and Gibbs 2002; Turner 2008). Apart 
from this, the tremendous increase in human 
population is bound to elevate food demand in 
terms of both quality and quantity. In fact, it has 
been suggested that food production needs to be 
doubled by 2050 with available arable land to 
ensure food security for future generations 
(Baulcombe 2010; Datta 2013). Further, it was 
estimated that in wheat alone, production needs 
to increase by 2 % annually (Singh and 
Trethowan 2007). 

Enhancement of food production can be 
achieved by various strategies: (1) increasing the 
land area under cultivation, (2) developing 
improved varieties by traditional breeding meth- 
ods, (3) adopting better agricultural techniques 
and tools and (4) adopting transgenic approaches. 
The first approach is neither feasible nor sustain- 
able, as we can no longer convert scarce forest 
areas into arable lands. Although conventional 
breeding has contributed its share, it has limita- 
tions such as non-precise gene transfer, transfer 
of unwanted genes, time-consuming and labori- 
ous (Rajam et al. 1998; Bhatnagar-Mathur et al. 
2008; Hazarika and Rajam 2011). At the same 
time, increasing use of land for purposes other 
than agriculture (like housing, industrial build- 
ing, forest conservation and so on) also needs to 
be considered (Evans 2009). In the present con- 
text, an effective and challenging task is to max1- 
mise land use efficiency by increasing crop 
productivity per unit area. Therefore, better tools 
and techniques can only compliment conven- 
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tional agricultural productivity by use of other 
novel approaches. 

In this regard, transgenic technology has 
gained relevance for crop improvement and has 
added a new thrust to agriculture. Better under- 
standing of biotic and abiotic factors involved in 
crop improvement, and their manipulation for 
better results, is developing as a potential strat- 
egy. Area under cultivation of genetically modi- 
fied (GM) crops is increasing year by year (James 
2013). Although this is a positive sign for increas- 
ing acceptance of GM crops in the commercial 
market (with a notion that the productivity of 
these crop varieties can deal with the demand for 
quantity and quality of food) (Chakraborty et al. 
2000; Datta 2012), concerns regarding the effects 
of these GM products on human health do persist 
(Dona and Arvanitoyannis 2009). It is to be noted 
here that in both conventional breeding tech- 
niques and transgenic crops, change in the genetic 
make-up of the plant may be involved. However, 
in the production of GM varieties via the trans- 
genic technology, genetic changes that are 
brought about are small and defined, whereas in 
conventional breeding, these changes can be rela- 
tively large and may involve a set of ill-defined, 
uncharacterised genes (Datta 2013). In this chap- 
ter, we have given a general overview of biotic 
factors and transgenic approaches being used for 
alleviating these stresses for improving crop 
health and productivity. 


30.2 Types of Biotic Stress 


Plant species are under a constant threat from a 
number of biotic stresses that exist simultane- 
ously in nature. Attack by pathogens and para- 
sites such as bacteria, fungi, viruses and 
nematodes plays a crucial role in plant growth 
and productivity. Apart from these, insect her- 
bivory and theft of nutrients by weeds also cause 
serious economic losses in crop yield. These 
stresses very often coexist, leading to multiple 
stresses which might elevate the level of biotic 
stress incident on crop plants. 
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30.3 Conventional Strategies 
to Alleviate Biotic Stresses 


To protect plants against biotic stresses, various 
prevention and control strategies have been tested 
over the ages. Conventional breeding practices 
for the generation of disease-resistant lines 
against diverse biotic stresses in plants have been 
widely employed over several decades now. 
Besides conventional agricultural practices, 
farmers depend on chemical pesticides and herbi- 
cides to a great extent which leads to significant 
Increase in crop yields. However, the ever- 
increasing environmental and health concerns 
against use of chemicals in agriculture paved the 
way for the development of biological control 
agents (BCAs). Biological control measures 
include use of BCAs, i.e. use of antagonists that 
suppress plant diseases; use of predatory insects, 
entomopathogenic fungi, bacteria and nematodes 
for control of insect pests; and use of weed- 
specific pathogens for control of populations of 
specific weeds. Formulated natural plant prod- 
ucts are also used in integrated pest management 
(IPM) strategies. 


30.4 Why the Need 
for an Alternative? 


Although BCAs and resistant lines of crop plants 
are successfully used for some plant disease and 
insect pest control, emergence of resistance- 
breaking biotypes is rapid in pace. Conventional 
plant breeding techniques are laborious and 
time-consuming. To keep pace with the emer- 
gence of resistant biotypes of pests, breeders 
need to look for new resistant traits that are more 
durable. However, keeping the time required for 
developing a new cultivar in view, it becomes 
difficult to rely wholly on such a strategy. 
Advancements in the field of genetic engineer- 
ing and molecular biology over the past two 
decades have led to the use of transgenic tech- 
nology in crop improvement. 
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30.5 Transgenic Approaches 
to Biotic Stress Tolerance 


in Plants 


The high potential of genetic engineering in 
developing disease-resistant transgenic crop 
plants has been successfully tapped as a strategy 
alternative to conventional breeding. Over the 
years, success stories have thrown up transgenic 
plants resistant to various plant pathogens and 
insect pests, and these engineered crops have 
moved to the field (James 2013) instead of 
remaining as just laboratory curiosities. Thus, 
transgenic crop plants engineered for resistance 
against bacteria, fungi (Wally and Punja 2010), 
virus (Prins et al. 2008), pathogens and nema- 
todes (Atkinson et al. 2003; Tamilarasan and 
Rajam 2013) have become a reality, so also resis- 
tance against insect herbivory (Christou et al. 
2006; Gatehouse 2008) and for dealing with the 
unwanted weeds (Herouet-Guicheney et al. 
2009). 


30.5.1 Viruses 


Plant viruses cause significant losses in crop 
yield and productivity on a global scale. The 
most devastating infectious plant diseases are 
caused by viruses (Anderson et al. 2004) belong- 
ing to the Jospovirus group, causing crop losses 
of more than a billion dollar a year (Prins and 
Goldbach 1998). In view of the limited effective- 
ness of control mechanisms like the use of virus- 
free seeds, prevention by _phytosanitation, 
breeding for resistant lines or prevention of vec- 
tor transmission (Mawassi and Gera 2012), the 
transgenic approach has been used. Introduction 
of natural virus-resistant genes into an existing, 
desirable cultivar can be achieved using genetic 
engineering. Pathogen-derived resistance (PDR) 
is the most commonly used phenomenon. Here, 
transgenic plants expressing genes of a viral 
pathogen show resistance against the same or 
related viruses (Slater et al. 2003). The first 
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application of PDR phenomenon with respect to 
plant viruses was demonstrated by Powell-Abel 
et al. (1986) who showed that transgenic tobacco 
plants expressing tobacco mosaic virus coat 
protein gene were resistant to attack by the 
tobacco mosaic virus. Examples of pathogen- 
derived genes include the replicase protein 
(Rubino and Russo 1995), coat protein (Lindbo 
and Dougherty 1992; Nelson et al. 1993; Ganesan 
et al. 2009), proteinase (Vardi et al. 1993), move- 
ment protein (Cooper et al. 1995) or genes cod- 
ing for host cell lethal proteins like antiviral 
compounds  (ribosome-inhibiting proteins) 
(Wang et al. 1998) or those that target the viral 
genome or particle directly, viz., ribozymes and 
ribonucleases (Sano et al. 1997). Resistance 
against viruses can also be obtained by presetting 
the RNA silencing system in plants by introduc- 
ing and expressing fragments of sense and anti- 
sense viral nucleic acid sequences into 
chromosomes of the plants (Prins et al. 2008; 
Tyagi et al. 2008; Duan et al. 2012), or by 
expressing the virus mini-replicon (Brumin et al. 
2009), or by expression of artificial microRNA 
(Niu et al. 2006). Management of papaya ringspot 
virus (PRSV) is one of the remarkable success 
stories (Gonsalves et al. 2004). 


30.5.2 Bacterial and Fungal 
Pathogens 


Research over the years has led to the use of vari- 
ous strategies for the development of transgenic 
plants resistant to bacterial and fungal pathogenic 
attack (Salomon and Sessa 2012), the first among 
these being expression of resistance genes 
(R-genes). R-genes code for nucleotide-binding 
peptides harbouring leucine-rich repeats (LRRs) 
which are transmembrane in nature (Bent and 
Mackey 2007). These peptides recognise prod- 
ucts of avirulence (Avr) genes produced by the 
pathogen and subsequently induce activation of 
diverse downstream processes that include acti- 
vation of pathogenesis-related genes (PR genes), 
production of high amounts of inhibitory mole- 
cules and their accumulation and induction of the 
hypersensitive response (Kiraly et al. 2007). 
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Introduction of R-gene Rxo/ from maize into rice 
led to the latter’s resistance to bacterial streak 
disease caused by Xanthomonas oryzae pV. oryzi- 
cola (Zhao et al. 2005a), while expression of 
R-gene (RPI-BLB2) from wild potato conferred 
resistance to the cultivated potato against attack 
by Phytophthora infestans (van der Vossen et al. 
2005). 

Many fungal pathogens produce toxins called 
mycotoxins in general. Fusarium culmorum and 
F. graminearum produce deoxynivalenol, a 
trichothecene mycotoxin, which confers viru- 
lence to these fungal strains and also poses tox- 
icity threat to humans and animals (Kimura et al. 
2006). Looking into the way in which the fungus 
deals with the mycotoxin, researchers developed 
transgenic lines of wheat, rice and barley. These 
genetically modified (GM) plants overexpressed 
the trichothecene 3-O-acetyltransferase (tri/0/) 
gene, whose product is involved in acetylating 
deoxynivalenol which results in a much less 
active form of the toxin (Manoharan et al. 2006). 
The transgenic wheat line showed reduced wheat 
spike infection (Okubara et al. 2002), and trans- 
genic rice (Ohsato et al. 2007) and barley 
(Manoharan et al. 2006) yielded grains with far 
less mycotoxin contamination than in the wild 
type. 

Use of antimicrobial peptides and PR-proteins 
in engineering plants for resistance against patho- 
gens is the most common strategy. These proteins 
are expressed constitutively in plants; however, 
their levels shoot up upon pathogen attack (Rao 
et al. 1999; Wally and Punja 2010). Antimicrobial 
metabolites thus produced are involved in degrad- 
ing fungal components like the cell wall, cell 
membrane or RNA, or these induce secondary 
metabolite production or increase physical barri- 
ers in a plant cell (Ferreira et al. 2007). A wide 
range of such peptides have been overexpressed 
in plants for conferring resistance against differ- 
ent plant pathogens, to name a few, chitinases, 
B1-3 glucanases, defensins, thionins, thaumatin- 
like proteins (Rajam et al. 2007), lipid-transfer 
proteins and phenylalanine ammonia lyase 
(reviewed by Wally and Punja 2010). Recent 
reports show that introduction of alfalfa gluca- 
nase into eggplant led to increased resistance 
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against pathogenic fungi like Verticillium dahlia 
and Fusarium oxysporum (Singh et al. 2014). 

Interestingly, polyamines like diamine putres- 
cine, triamine spermidine and _ tetra-amine 
spermine have been implicated in abiotic (Rajam 
et al. 1998) and biotic (Hazarika and Rajam 
2011) stress tolerance (Hussain et al. 2011) 
besides their involvement in the regulation of a 
wide variety of biological processes including 
cell division, growth and development, stabilisa- 
tion of DNA and gene regulation (Igarashi and 
Kashiwagi 2010). Although considerable amount 
of work has been done on genetic manipulation 
of polyamine biosynthesis genes for abiotic stress 
tolerance in various plants (Marco et al. 2012), 
engineering polyamine biosynthesis for biotic 
stress tolerance has not been explored much so 
far, as evidenced by scarce literature on the sub- 
ject. For instance, enhanced fungal resistance 
was reported by overexpressing the polyamine 
biosynthesis gene, S-adenosylmethionine decar- 
boxylase, in tobacco (Waie and Rajam 2003) and 
tomato (Hazarika and Rajam 2011) and arginine 
decarboxylase in eggplant (Prabhavathi and 
Rajam 2007). 

RNA interference (RNAi) strategy involving 
plant-mediated silencing of fungal genes shows 
promise for developing potential tools in the fight 
against fungal attack on crop plants in the near 
future. Fungal uptake of synthetic siRNA specific 
to the ornithine decarboxylase (ODC) gene led to 
downregulation of ODC gene and in turn caused 
significant decrease in mycelial growth and spor- 
ulation in the fungal pathogen, Aspergillus nidu- 
lans (Khatri and Rajam 2007). Transgenic 
tobacco plants that expressed antisense RNA of 
heterologous (mouse) ornithine decarboxylase 
gene were demonstrated to be more resistant to 
Verticillium wilt compared to the wild type plants 
(Rajam 2012a). Also, development of powdery 
mildew causing fungus in wheat and barley, 
Blumeria graminis, was affected adversely by in 
planta expression of antisense transcripts and 
dsRNA specific to the fungus (Nowara et al. 
2010). These initial steps might pave the way for 
the development of a promising tool for control 
of fungal plant pathogens (Rajam 2012b). 
However, success of transgenic crops showing 
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resistance to bacterial and fungal pathogens 1s 
limited in comparison to insect-resistant trans- 
genic crops (Punja 2006). 


30.5.3 Insect Pests 


Success story of the transgenic plants for insect 
resistance started with transforming plants with 
cry genes from Bacillus thuringiensis (Vaeck 
et al. 1987). These genes code for highly insecti- 
cidal crystal proteins called Bt toxins. Upon 
ingestion, the toxin binds to specific receptors in 
the insect midgut epithelium, and further action 
of the proteases activates the toxin. This active 
toxin induces lytic pore formation in the midgut 
epithelial cells, leading to their lysis, and subse- 
quently kills the insect larva (Daniel et al. 2000). 
Over 400 genes coding for these toxin variants 
have been identified from different strains of B. 
thuringiensis (Crickmore et al. 2007). 
Insecticidal spectrum of each of these toxin vari- 
ants is distinct. Since the first transgenic tobacco 
and tomato plants were developed carrying cry 
genes (Vaeck et al. 1987), tissue-specific or con- 
stitutive expression of Bt toxins has been estab- 
lished in many crop plants against specific pest 
species (Pierpoint and Hughes 1996). Bt cotton 
is one among them and is a commercial success. 
Apart from Bt toxin, B. thuringiensis also 
expresses other proteins like vegetative insecti- 
cidal protein (Vip) which is, obviously, insecti- 
cidal in nature. Ingestion of Vip proteins leads to 
the swelling and osmotic lysis of midgut epithe- 
lial cells, causing death of the target insect. 
Unlike cry toxins, these are expressed both dur- 
ing vegetative growth and sporulation of the bac- 
terlum (Estruch et al. 1996; Crickmore et al. 
2007). Like Bt, different Vip toxins act against 
specific groups of insects. Although Vip3 showed 
very high insecticidal activity against major lepi- 
dopteran pests attacking maize and cotton 
(Estruch et al. 1996; Fang et al. 2007), licence 
granted for release of GM Vip cotton is limited 
and controlled (Crothers 2006). Other entomo- 
pathogenic bacteria like Photorhabdus lumines- 
cens also express insecticidal toxin complexes 
whose genes have been introduced into plants 
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(Liu et al. 2003). Transgenic Arabidopsis 
thaliana plants expressing tcdA gene encoding 
for toxin A showed resistance against tobacco 
hornworm and southern corn rootworm. 

Apart from genes from microbes, genes from 
the insect itself are used for killing the pest. One 
such gene is chitinase, which is a development- 
associated gene in insects and is expressed during 
larval moulting, 1.e. shedding of the peritrophic 
membrane and exoskeleton. Insect pests feeding 
on transgenic plants constitutively expressing 
insect chitinase are exposed to a constant level of 
this enzyme all through their development cycle, 
which disturbs their process of regular moulting. 
Ding et al. (1998) showed that transgenic tobacco 
plants expressing chitinase conferred resistance 
to lepidopteran pests. 

Plant transformation with plant genes encod- 
ing for insecticidal secondary metabolites like 
proteinase inhibitors, alpha-amylase and lectins 
has also been shown to confer resistance to insect 
attack. In insects, proteinases are among the 
digestive enzymes essential for catalytic release 
of amino acids from the ingested proteins which, 
thereby, contribute to the insect’s growth and 
development. Hence, ingestion of proteinase 
inhibitors could prove lethal to insects. Proteinase 
inhibitors are produced in high amounts and are 
accumulated by plants in response to herbivorous 
attack or mechanical injury (Ryan 1990). 
Although plants expressing different proteinase 
inhibitors, viz., BTI-CMe (barley trypsin inhibi- 
tor) and MMTI-2 (mustard trypsin inhibitor), 
showed significant insect resistance (Altpeter 
et al. 1999; De Leo et al. 2001), genes for pro- 
teinase inhibitors like cowpea trypsin inhibitor 
(CpTI) have been used in combination with Bt 
toxins to enhance protection of the transgenic 
crop against insects (Gatehouse 2011). 

Expression of protein inhibitors of proteases 
(Wolfson and Murdock 1987; Thomas et al. 
1995) in plants had a significant effect on repro- 
ductive output of target insect pests. Exploiting 
the neuropeptides and inhibitors of polyamine 
biosynthesis with potential insecticidal activity 
might prove beneficial for developing insect 
resistance in plants (Rajam 1991; Raina et al. 
1994; Kumar et al. 2009). 
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With ever-increasing knowledge in the field of 
RNAi, RNA-mediated (sense or antisense RNA, 
dsRNA, artificial microRNA) strategies are being 
employed to induce RNAi in transgenic plants as 
an effective defence mechanism against insect 
attack (Mao et al. 2007; Baum et al. 2007; 
Terenius et al. 2011). One of the major achieve- 
ments is the use of this strategy to control sap- 
sucking insects like aphids. Recent success with 
the use of plant-generated dsRNA and other 
small RNA-mediated approaches, for conferring 
resistance against aphids, is remarkable (Pitino 
et al. 2011; Guo et al. 2014). 


30.5.4 Nematode Parasites 


Plant parasitic nematodes can cause crop yield 
losses up to 20 % in a single crop, and in most 
cases, these losses are caused by attack of root- 
knot nematodes and cyst nematodes (Koenning 
et al. 1999; Bird and Kaloshian 2003; Tamilarasan 
and Rajam 2013). Conventional strategies like 
crop rotation do not hold good for control of 
nematodes with a wide host range, and chemical 
practices are expensive (Abad et al. 2003). 
Alternate transgenic approaches and RNAi, 
which was in fact discovered in the nematode 
Caenorhabditis elegans (Fire et al. 1998), pro- 
vide a better scope for control of plant parasitic 
nematodes. Some of the strategies used for devel- 
oping transgenic plants with resistance to nema- 
todes include anti-invasion and migration, 
anti-nematode feeding and development and 
feeding-cell attenuation strategies (Atkinson 
et al. 2003). Expression of cysteine proteinase 
inhibitor (cystatin) in transgenic plants helped 
potato plants resist nematode attack (Cowegill 
et al. 2002). The parasitism genes express para- 
sitism proteins which help the nematode infect 
the root and parasitise the plants (Davis et al. 
2004). Using RNAi approach, Arabidopsis plants 
were bioengineered to express dsRNA of one of 
the parasitism genes, /6D/0. These transgenic 
plants were resistant to four species of root-knot 
nematodes (Huang et al. 2006). Therefore, this 
strategy can be applied to agriculturally impor- 
tant crops for developing resistance against 
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nematodes. Similarly, the nematode genes 
involved in its development and reproduction and 
the nematode-responsive host plant genes have 
been targeted for developing effective methods of 
plant nematode control (for review, Tamilarasan 
and Rajam 2013). 


30.5.5 Weeds 


Herbicides are chemicals which are used to kill 
the weeds, but at the same time, they are deadly 
to the cultivated crop plants also. Hence, devel- 
opment of transgenic plants resistant to herbi- 
cides was taken up. One of the most commonly 
used herbicide, glyphosate, inhibits the 
enzyme _5-enolpyruvylshikimate-3-phosphate 
(EPSP) synthase. Overexpression of this enzyme 
in transformed Petunia plants conferred resis- 
tance to this herbicide (Shah et al. 1986). 
Expression of an insensitive form of EPSP syn- 
thase also showed similar results (Herouet- 
Guicheney et al. 2009). Since then, this strategy 
has been used for several crop plants. Glyphosate- 
tolerant soybean was successfully commer- 
cialised in 1996, and later, GM crops like 
herbicide-tolerant maize, cotton, etc. were also 
accepted by the farmers (James 2013). Recent 
studies showed that transformation with two 
genes, glyphosate acetyl transferase gat and 
EPSPS G2-aroA genes, conferred high level of 
tolerance towards glyphosate in tobacco plants 
(Din et al. 2014). Further, stacking of genes for 
herbicide tolerance and insect resistance has also 
been reported in crops like cotton, and these are 
being commercialised (James 2013). 


30.6 Concerns About Transgenic 
Plants and Novel Alternative 
Strategies 


In spite of the various advantages of the use of 
transgenic technology, there are some limita- 
tions. One major concern, in general, about the 
release of transgenic plants into the natural envi- 
ronment is transgene flow. Gene flow is a natural 
process among related species (vertical gene 
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flow) or related genera within a family (diagonal 
gene flow) or, even, among unrelated species 
(horizontal gene flow). However, transgenes that 
code for traits not found in related species, or 
even genera, would not integrate into these plants 
had there not been a transgenic. This unnatural 
event might pose a threat to the natural gene pool 
over time (Gressel 2012). The most common 
cases reported in this context are flow of 
herbicide-resistant genes from crops to weeds as 
reported in transgenic glufosinate-resistant rice 
(Song et al. 2011). The ability to control weeds in 
such cases, thus, becomes close to nil. Various 
techniques for transgene containment to prevent 
such transgene flow have been discussed recently 
(Gressel 2012). In the case of a leaky contain- 
ment mechanism where such a transgene move- 
ment takes place, methods of transgene mitigation 
to suppress its spread and establishment in the 
population have also been highlighted (Gressel 
1999, 2012). 

Development of resistant pest populations is 
another shortcoming of transgenic crops express- 
ing insecticidal genes. This has been well noted 
in Bt crops (Shelton et al. 2003; Moar et al. 
2008). Although development of resistant target 
pest populations against Bt crops has been slower 
than against that of Bt sprays (Fox 2003; Gressel 
2012), there are field reports of Bt-resistant pest 
populations, as observed in Spodoptera frugi- 
perda and Helicoverpa zea (Moar et al. 2008; 
Tabashnik et al. 2008). To this problem, “refugia’ 
have worked as a solution. Planting regular rows 
of Bt and non-Bt crops allows survival of the sus- 
ceptible pests. Bt-resistant trait is recessive, and 
as long as the susceptible mating population is 
within reach, selection of resistant genes in the 
population is expected to be very meagre (Bates 
et al. 2005). Pyramiding of two or more cry 
genes, or different insecticidal genes, is expected 
to delay eruption of resistant pest species, as the 
pest needs to then evolve resistance against two 
or more insecticidal toxins simultaneously (Zhao 
et al. 2005b; Watkins et al. 2012). 

A more recent advancement in this field is 
the RNAi approach, which is an alternative to 
expression of transgenes and might prove to be 
advantageous over the latter, considering the 
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limitations and concerns of the transgenic 
approach. RNAi technology can be applied for 
control of those insect pests where effective Bt 
genes are not available, e.g. sucking pests like 
aphids and jassids. One other important point is 
its species specificity which reduces the risk of 
negative effects on nontarget organisms 
(Huesing et al. 2009). Unlike the existing trans- 
genic crops which are based mainly on trans- 
gene protein, RNAi strategy works at the 
post-transcriptional level where there is no pro- 
tein in the transgenic plant, and therefore, it is 
very unlikely that the target pathogen or pest 
would gain resistance in the absence of the 
transgene protein (Rajam 2011; Rajam and 
Singh 2011). A case in point is Bt crops and 
other transgenic plants where target pathogens 
or pests have been shown to be resistant after a 
few generations (Moar et al. 2008; Rajam and 
Singh 2011). Further, a major concern with 
transgenic plants is the possible allergic 
response in the consumers against the expressed 
transgene protein (Niblett and Bailey 2012) 
which, on the other hand, in RNAi plants, might 
not be the case due to absence of any transgene 
protein. This makes these RNAi plants and their 
products safe for human and animal consump- 
tion and greatly reduces the regulatory issues 
related with their commercialisation (Auer and 
Frederick 2009). The beauty of RNAi approach 
is that more than one gene can be effectively tar- 
geted using chimeric RNAi constructs (Gupta 
et al. 2013; Sinha and Rajam 2013) to the patho- 
gen or pest by this technique. This would also 
avoid development of resistance by target organ- 
isms and would allow targeting of more than 
one pathogen or pest, as chimeric RNAi con- 
struct contains sequences of more than one tar- 
get gene (Rajam 2011; Rajam and Singh 2011). 
However, RNAi approach has at least one limi- 
tation, i.e. the sequence(s) introduced can have 
off-target effects on the host plant or other 
organisms, depending upon the extent of homol- 
ogy between the introduced sequence and the 
sequence of nontarget genes (Du et al. 2005). 
Details on the use of RNAi technology for 
crop improvement, including control of diseases 
and pests, are available in the following chapter. 
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30.7. Conclusions and Future 
Prospects 


The most important point which needs consider- 
ation in the use of transgenic approach for allevi- 
ating biotic stress in plants is stability of the 
resistance in engineered plants. Although a large 
number of transgenic plants resistant to various 
biotic stresses are under development, only a few 
have become commercially successful. Examples 
of such commercial successes are Bt crops. One 
of the main features responsible for acceptance 
of Bt crops in the commercial market by consum- 
ers was the proof provided for Bt toxin as having 
no effect on birds and mammals, including 
humans (Goldberg and Tjaden 1990; Mendelsohn 
et al. 2003). For large-scale public acceptance of 
transgenic technologies, the transgenics need to 
be of proven safety for human and animal con- 
sumption; also, the economic balance, 1.e. cost 
involved in developing, raising and registering 
the transgenic crops, should be balanced with 
profit margins of their producers, and finally, ease 
of their availability to the consumer must be 
ensured. 
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Introduction 


Abstract 

Scientific breakthroughs bring about significant advances in basic research, 
which ultimately lead to their applications for human welfare. One such 
discovery is RNA interference (RNAi), a highly conserved gene regula- 
tory mechanism that controls the expression of genes at posttranscriptional 
level. Since the discovery of RNAj, it has become evident that RNAi has 
immense potential for crop improvement. The technology has been 
employed successfully to study the gene function and to alter the gene 
expression in plants for desired traits. RNAi has proved to be a novel and 
potential tool in combating abiotic and biotic stresses. Enriched nutritional 
quality, male sterility, delayed ripening, and secondary metabolite manip- 
ulation add to the list of successful applications of RNAi technology. This 
chapter focuses on the potential of RNAi technology to produce improved 
crop varieties. 
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posing the demand for global sustainable agricul- 
tural practices. To address this demand, 


The crop yield is facing drastic reduction because 
of scarcity of land, water resources, climatic 
changes, and damages caused by abiotic and 
biotic stresses. Increasing population is also 
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conventional breeding is being employed from 
ages as one of the effective ways to raise quality 
plants, but this process is time consuming and 
laborious and also has other ecological, physio- 
logical, and biological constraints. However, 
genetic engineering is playing pivotal role in crop 
improvement. Novel traits can be effectively 
incorporated into crops through genetic manipu- 
lations, leading to increased yield, nutritional 
value, and also tolerance to biotic and abiotic 
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stresses. But, transgenic crops always had ethical 
issues in terms of biosafety especially in case of 
edible crops. Hence, safer strategies have to be 
developed for crop improvement. 

RNA interference (RNAi) technology is 
proven to be a potential alternative for crop 
improvement, probably with less biosafety issues 
as no transgene protein is expressed in transgenic 
lines (Rajam 2012). RNAi pathway mainly com- 
prises small-interfering RNAs (siRNAs) and 
microRNAs (miRNAs). Both siRNAs and miR- 
NAs are produced by the cleavage of double- 
stranded (dsRNA) precursors by a member of the 
RNase III family Dicer and Dicer-like enzyme, 
respectively (Bernstein et al. 2001; Hutvagner 
et al. 2001). Finally, the small noncoding RNAs 
(siRNAs and miRNAs) in association with RNA- 
induced silencing complex (RISC), Argonaute 
(AGO), and other effector proteins lead to gene 
silencing. By this means these small RNAs regu- 
late various aspects of plant growth and develop- 
ment. This gave biotechnologists an opportunity 
to use this easy but highly effective way to modu- 
late gene expression to get desired traits (Mittal 
et al. 2011). The present chapter therefore focuses 
on the discovery of RNAi and its mechanism of 
action and applications for crop improvement. 


31.2 RNAi: Discovery 
and Mechanism of Action 


It all began in 1990 when chalcone synthase gene 
(CHS A) encoding a key enzyme in anthocyanin 
biosynthesis pathway, introduced in Petunia hyb- 
rida L. for enhancing anthocyanin pigments, pro- 
duced white or chimeric flowers instead of dark 
purple flowers in transgenic Petunia. This sug- 
gested that the introduced transgene was func- 
tionally inactive and also’ suppressed the 
endogenous gene expression and the phenome- 
non came to be known as “cosuppression” 
(Napoli et al. 1990; Hannon 2002; Campbell and 
Choy 2005). A similar phenomenon named 
“quelling” was then discovered in the fungus 
Neurospora crassa (Romano and Macino 1992; 
Cogoni et al. 1996). Later on, the term RNAi 
came into picture for the first time when it was 


S. Yogindran and M.V. Rajam 


elucidated in the nematode, Caenorhabditis 
elegans (Fire et al. 1998). 

The process of RNAi in plants is initiated by 
21- to 24-nucleotide-long (nt), small interfering 
RNAs (siRNAs), which are generated from long 
endogenous or exogenous dsRNA molecules 
through the cleavage by a ribonuclease III-type 
enzyme called Dicer (Hamilton and Baulcombe 
1999; Zamore et al. 2000). These siRNAs (21- 
24 nt) are then incorporated into a multiprotein 
complex called RISC which contains AGO pro- 
teins (Baumberger and Baulcombe 2005; 
Vaucheret 2008). The ATP-activated RISC 
unwinds the double-stranded siRNA. The sense 
strand of the siRNA duplex is degraded by RNA 
helicase activity and the antisense strand of 
siRNA molecule is retained in the RISC complex 
(Kusaba 2004). RISC with antisense siRNA then 
targets the homologous mRNA by complemen- 
tary base-pairing and cleaves the mRNA leading 
to inhibition of protein synthesis (Bartel 2004) 
(Fig. 31.1). 

siRNAs also regulate the gene expression at 
transcriptional level by regulating the chromatin 
modification. siRNA recruits several DNA- and 
histone-modifying proteins including the cytosine 
methyltransferase CHROMOMETHYLASE3 
(CMT3) and maintains the chromatin in a minimal 
transcriptional activity state, leading to transcrip- 
tional gene silencing (TGS) (Ossowski et al. 2008; 
Fagegaltier et al. 2009; Burkhart et al. 2011). 


31.3. Applications of RNAi 
Technology in Agriculture 


Increasing world population is facing threat due 
to huge crop losses because of several factors that 
affect the proper production and distribution of 
crops. There is an urgent need to address this 
problem which can be achieved by amalgamation 
of conventional and cutting edge technologies. 
RNAi technology has emerged as one of the most 
potential and promising strategies for producing 
improved quality plants. This biological phenom- 
enon has been assessed in a number of plant 
systems and has been successfully used to silence 
the genes to get better traits. The examples listed 
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Fig.31.1 Biogenesis and mechanism of action of siRNAs 


below illustrate the possibilities of this intrinsic 
biological mechanism for commercial 
exploitation. 


31.3.1 Biotic Stress Tolerance 


Total agricultural production is greatly affected 
by biotic stresses, which include viral, bacterial, 
and fungal pathogens, insect pests, and nematode 
parasites. There is a need to address this problem 
more efficiently. We have discussed the impor- 
tance of RNAi as a promising solution to biotic 
Stress management. 


31.3.1.1 Viral Diseases 

Viruses cause major loss of plant productivity 
and transmit the disease either directly from par- 
ent to progeny or indirectly through insect vec- 
tors and hence their control becomes very 
difficult. “Pathogen-derived resistance” (PDR), 
where resistance to a determined pathogen could 
be obtained from its own genetic material, has 
been used to develop disease-resistant plants. 


This approach comprises (1) the expression of 
viral coat protein (CP) and replication-associated 
proteins (Reps) and (2) gene silencing by anti- 
sense and hpRNA (Shepherd et al. 2009). 

RNAi has revealed a way for obtaining virus- 
resistant trait in many crop plants. It was first 
reported in potato, where simultaneous expres- 
sion of both sense and antisense transcripts of the 
helper-component proteinase (HC-Pro) gene 
showed complete resistance to potato virus Y 
(PVY) (Waterhouse et al. 1998). Silencing of 
viral coat protein through RNAi is an effective 
method for generation of resistant plant that has 
been successfully reported in potato and medici- 
nally important papaya plants (Missiou et al. 
2004; Kertbundit et al. 2007). 

An important class of viruses includes gemini- 
viruses, the DNA viruses, which are responsible 
for a significant amount of crop damage. 
Expression of hairpin construct of noncoding 
intergenic region of mungbean yellow mosaic 
India virus (MYMIV) under the control of 35S 
promoter in MY MIV-infected black gram plants 
resulted in recovery from the infection which 
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lasted till senescence (Pooggin et al. 2003). Apart 
from that, when siRNAs designed to the replicase 
(Rep)-coding sequence of African cassava 
mosaic virus (ACMV) were cotransfected with 
protoplasts, it showed 99 % reduction of Rep 
transcripts and 66 % reduction of viral DNA 
(Vanitharani et al. 2003). In another report, trans- 
genic cassava lines with high levels of ACI- 
homologous small RNAs showed resistance to 
ACMYV (Vanderschuren et al. 2009). 

RNAi-mediated resistance to cassava brown 
streak disease (CBSD) in Cassava was first dem- 
onstrated by Patil et al. (2011). They observed 
resistance against very distant isolates of caus- 
ative organism cassava brown streak virus 
(CBSV) and cassava brown streak Uganda virus 
(CBSUV). 


31.3.1.2 Bacterial Diseases 

Although considered structurally simple, bacteria 
are extremely diverse from a metabolic stand- 
point and are found almost everywhere on earth 
in vast numbers—from those living in jet fuel and 
on the rims of volcanoes to those thriving in 
hydrothermal vents deep on the ocean floor. The 
first bacterial disease ever discovered was anthrax 
(caused by Bacillus anthracis) of cattle and sheep 
in 1876. The discovery of anthrax in cattle was 
immediately followed by the discovery of fire 
blight of pear and apple (caused by Erwinia amy- 
lovora) by T. J. Burrill from the University of 
Illinois (1877-1885). Escobar et al. (2001) 
showed the striking example of bacterial disease 
management where RNAi showed a remarkable 
type of gene regulation. They developed a crown 
gall disease management strategy that targets the 
process of tumorigenesis (gall formation) by in1- 
tiating RNAi of the iaaM and ipt oncogenes. 
Expression of these genes is a prerequisite for 
wild-type tumor formation. RNAi constructs, tar- 
geting iaaM and ipt gene(s) in Arabidopsis thali- 
ana and Lycopersicon esculentum, showed 
resistance to crown gall disease (Dunoyer et al. 
2007). 

Infection by Pseudomonas syringae pv. 
tomato in Arabidopsis induced the production of 
natsiRNA (nat-siRNAATGB2) which downregu- 
lates a PPRL gene that encodes a negative regula- 
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tor of the RPS2 disease-resistance pathway. As a 
result, the induction of nat-siRNAATGB2 
increases the RPS2-mediated race-specific resis- 
tance against P. syringae pv. tomato in 
Arabidopsis (Katiyar-Agarwal et al. 2007). 


31.3.1.3 Fungal Diseases 

Homology-based gene silencing induced by 
transgenes (cosuppression), antisense, or dsRNA 
has been successfully demonstrated in many 
plant pathogenic fungi, including Cladosporium 
fulvum (Hamada and Spanu 1998), Magnaporthe 
oryzae (Kadotani et al. 2003), Venturia inaequa- 
lis (Fitzgerald et al. 2004), Neurospora crassa 
(Goldoni et al. 2004), Aspergillus nidulans 
(Hammond and Keller 2005; Khatri and Rajam 
2007), Fusarium oxysporum (Singh 2011), and 
Fusarium graminearum (Nakayashiki 2005). 
The efficient method for rapid characterization of 
fungal genes using diced siRNAs has been 
reported in model filamentous fungus A. nidulans 
(Natarajaswamy et al. 2013). 

RNAi-mediated downregulation of GST (glu- 
tathione S-transferases) enzyme, which catalyzes 
a variety of reactions, resulted in significant 
Increase in resistance in Nicotiana tabacum 
against Phytophthora parasitica var. nicotianae 
(Hernandez et al. 2009). The transgenic tobacco 
plants expressing heterologous (mouse) ornithine 
decarboxylase (ODC) antisense RNA have been 
shown to have higher resistance to Verticillium 
wilt as compared to wild-type plants (Kumria 
2000). The expression of dsRNA and antisense 
transcripts specific to powdery mildew fungi 
Blumeria graminis in wheat and barley affected 
the growth of the fungus (Nowara et al. 2010). 
Virus-induced gene silencing (VIGS) has been 
used to introduce the gene fragments from the 
rust fungi Puccinia striiformis f. sp. tritici or P. 
graminis f. sp. tritici to plant cells leading to 
reduced expression of the corresponding genes in 
the rust fungus (Yin et al. 2011). 

In cereals, barley stripe mosaic virus (BSMV)- 
induced RNAi has emerged as a useful tool for 
loss of function studies. Three genes, a MAP 
kinase (PtMAPK1), a cyclophilin (PtCYC1), and 
calcineurin B (PtCNB), predicted to be involved 
in pathogenicity, have been targeted by 
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BSMV-mediated host-induced gene silencing 
(HIGS) in the wheat leaf rust fungus Puccinia tri- 
ticina (Pt). BSMV RNAi constructs were then 
inoculated in the wheat plant leaves. Subsequent 
Pt inoculation resulted in a suppressed disease 
phenotype and reduced endogenous transcript 
levels of the targeted fungal genes indicating 
translocation of siRNA molecules from host to 
fungal cells (Panwar et al. 2013b). The same 
group has shown the use of Agrobacterium tume- 
faciens-mediated in planta induced transient 
gene silencing (PITGS) assay for use in Triticum 
spp. (wheat). Agroinfiltration effectively deliv- 
ered hairpin silencing constructs of the three 
genes mentioned above in wheat, leading to the 
generation of fungal gene-specific siRNA mole- 
cules in infiltrated leaves and resulting in up to 
70 % reduction in transcription of the endoge- 
nous target genes (Panwar et al. 2013a). 

Fusarium wilt, caused by Fusarium oxyspo- 
rum f. sp. cubense (Foc), is among the most 
destructive diseases of banana (Musa _ spp.). 
RNAi-mediated knockdown of vital genes of 
fungus (velvet and Fusarium transcription factor 
1) has shown effective resistance against Foc. 
Transformed banana lines were found to be free 
of external and internal symptoms of Foc after 
6-week-long greenhouse bioassays. The five 
selected transgenic lines for each construct 
showed resistance to Foc for 8 months postinocu- 
lation (Ghag et al. 2014). 


31.3.1.4 Insect Attack 

Insects can cause considerable damage to crop 
plants. By defoliating plants or sucking out their 
sap, insects slow down their growth by weaken- 
ing and sometimes killing them. The use of RNA1 
for the insect control seems to be the consequence 
of success of Cry toxins from Bt as an insecti- 
cide. The sensation of RNAi in insect control 
began when two exciting reports came into scien- 
tific community. Mao et al. (2007) targeted cot- 
ton bollworm gut-specific cytochrome P450 gene 
CYP6AE1/4, which confers resistance to gossy- 
pol, a polyphenol compound of cotton plants. 
Cotton bollworm larvae when fed on transgenic 
tobacco and Arabidopsis plants that expressed 
the CYP6AE/4-specific dsRNA — showed 
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sensitivity to gossypol in artificial diets. Whereas, 
Baum et al. (2007) fed Western corn rootworm 
larvae (WCR, Diabrotica virgifera) 290 different 
dsRNAs and observed that 14 of them caused sig- 
nificant mortalities at doses <5.2 ng/cm”. 
Transgenic corn expressing dsRNA specific to 
the gene encoding the A subunit of the V-type 
ATPase proton pump showed significant reduc- 
tion in WCR-feeding damage. Both reports 
showed that RNAi pathway can be exploited to 
control insect pests via in planta expression of 
dsRNA against well-chosen target genes of 
insects (Baum et al. 2007; Mao et al. 2007). 
v-ATPaseA gene has also been shown to be a 
potent target to control the whitefly population. 
Plant-mediated pest resistance was achieved 
against whiteflies by genetic transformation of 
tobacco expressing siRNAs against the whitefly 
v-ATPaseA gene. The transcript level of v-ATPa- 
seA in whiteflies was reduced up to 62 % after 
feeding on the transgenic plants, leading to their 
mortality (Thakur et al. 2014). 

Thereafter, a number of reports suggested the 
success of this technology for the control of 
insect pests. Transgenic tobacco plants express- 
ing dsRNA against EcR-USP (ecdysone receptor- 
ultraspiracle particle) (Zhu et al. 2012), AChE 
(acetylcholinesterase) (Kumar 2011), and HR3 
(Xiong et al. 2013) involved in regulation of 
molting and development in H. armigera resulted 
into resistant plants, with larvae fed upon them 
showing developmental deformities and lethality. 
3-Hydroxy-3-methylglutaryl coenzyme A reduc- 
tase (HMG-CoA reductase, HMGR) gene, a key 
enzyme in the mevalonate pathway in insects, has 
also been shown to be a potential target for insect 
control using RNAi (Wang et al. 2013). 

Apart from its application in generating resis- 
tant plants, RNAi has been extensively used for 
functional studies in insects. The feeding of 
dsRNA and siRNA solutions for knockdown of 
target pest genes has been successfully shown in 
Apolygus lucorum (Zhou et al. 2014), Nilaparvata 
lugens (Chen et al. 2010), Bemisia tabaci 
(Upadhyay et al. 2011), and Helicoverpa armig- 
era (Kumar et al. 2009) which led to a strong 
decline in the expression of the target gene and 
can be used to explore gene functions. Recently, 
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oral delivery of dsRNA molecules to Spodoptera 
littoralis against a gene highly similar to P/02 of 
Heliothis virescens strongly suppressed the 
encapsulation and melanization response, sug- 
gesting that the protein is functionally conserved 
and plays role in insect immunity (Lelio et al. 
2014). 


31.3.1.5 Nematode Attack 

A number of genera and species of nematodes are 
highly damaging to a great range of hosts, includ- 
ing foliage plants, agronomic and vegetable 
crops, fruit and nut trees, turf grass, and forest 
trees. Some of the most damaging nematodes are 
root knot (Meloidogyne spp.), cyst (Heterodera 
and Globodera spp.), root lesion (Pratylenchus 
spp.), spiral (Helicotylenchus spp.), burrowing 
(Radopholus similis), bulb and stem (Ditylenchus 
dipsaci), reniform (Rotylenchulus reniformis), 
dagger (Xiphinema spp.), and bud and _ leaf 
(Aphelenchoides spp.) (Tamilarasan and Rajam 
2013). Oral delivery of dsRNA was first demon- 
strated in C. elegans (Fire et al. 1998). 

Plant-mediated RNAi in plant parasitic nema- 
todes through dsRNA targeting has been shown 
in RKN which resulted in effective resistance 
(Yadav et al. 2006; Huang et al. 2006). The host- 
induced RNAi in Arabidopsis thaliana, which is 
a host for the sugar beet cyst nematode Heterodera 
schachtii, led to reduction in the number of 
mature nematode females. Although no complete 
resistance was observed, the reduction of devel- 
oping females ranged from 23 to 64 % in differ- 
ent RNAi lines (Sindhu et al. 2009). Fewer 
females (reduced by 81—93 %) were observed on 
the transgenic roots obtained by hairy root cul- 
tures engineered to silence either two of ribo- 
somal proteins, a spliceosomal protein or 
synaptobrevin, of H. glycines by RNAi (Klink 
et al. 2009). Similarly, high reduction in egg pro- 
duction was achieved by targeting mRNA splic- 
ing factor prp-17 or an uncharacterized gene 
cpn-1 (Li et al. 2010). 

The success reports of RNAi for controlling 
the nematode infection also include the following 
research outcomes. Targeted silencing of con- 
served region of M. incognita gene acetylcholin- 
esterase (AChE) involved in neurotransmission 
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and also in many cellular processes, through 
host-derived RNAj, resulted in reduced fecun- 
dity. The enhanced resistance to nematode infec- 
tion displayed by different lines strongly suggests 
their utilization in nematode control (Tamilarasan 
2013). A recent study also demonstrated that 
MiASB (M. incognita mitochondrial ATP syn- 
thase b subunit) silencing had a positive effect on 
the control of root-knot nematodes, and the gene 
may be associated with the formation of galls 
caused by the nematode (Huang et al. 2013). 


31.3.2 Abiotic Stress Tolerance 


Plants are constantly affected by abiotic factors 
such as high salinity, flood, drought, heavy 
metal, and variable temperatures which consid- 
erably reduce the productivity. Functional 
genomics studies have come up with novel genes 
involved in stress adaptation in plants, which can 
be manipulated to get tolerance (Pardo 2010). 
Poly (ADP-ribose) polymerase (PARP) is 
induced by stress in animals and is responsible 
for energy depletion. The enzyme PARP! and to 
a lesser extent PARP2 are primarily responsi- 
ble for stress-induced poly (ADP-ribosyl)ation 
activity. Upon induction, polymers of ADP- 
ribose are synthesized by a range of nuclear 
enzymes using NAD* as substrate, while over- 
production of PARP leads to a rapid breakdown 
of the NAD* pool. ATP molecules are required 
for resynthesis of NAD*, and as a consequence 
the cellular ATP is depleted which leads to 
necrotic cell death (De Block et al. 2005). Since 
PARPI1 and PARP2 homologues are found in 
plants, PARP was targeted by RNAi for abiotic 
stress tolerance in plants (De Block et al. 2005; 
Vanderauwera et al. 2007). 

RNAi-mediated downregulation of RACK], 
receptor for activated C-kinase 1, which is a 
highly conserved scaffold protein with flexible 
functions, plays important roles in plant growth 
and development indicating its possible role in 
drought stress response in rice compared to non- 
transgenic plants (Da-Hong et al. 2009). 

Similarly, disruption of a rice farnesyltransfer- 
ase/squalene synthase (SQS) by maize squalene 
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synthase through RNAi improved drought 
tolerance at both the vegetative and reproductive 
stages (Manavalan et al. 2012). Expression of 
OsTZF1, a member of the CCCH-type zinc fin- 
ger gene family in rice, was induced by drought, 
high-salt stress, and hydrogen peroxide. OsTZF1- 
RNAi plants were susceptible to abiotic stress 
demonstrating that OsTZF1 positively regulates 
high-salt and drought stress tolerance in rice 
plants (Jan et al. 2013). 


31.3.3 Development of Male Sterile 
Plants 


A hybrid production system is based on the 
mechanism for inducing male sterility in one of 
the parental lines so as to ensure purity of the 
resultant hybrid seed. Male sterility trait has 
been an important aspect in agriculture to 
improve the crop productivity by the hybridiza- 
tion process (Duvick 1999). Moreover, its value 
in gene containment of the genetically modified 
crops has increased its importance (Moon et al. 
2010). Natural male sterile mutants, cytoplas- 
mic male sterile (CMS) mutants, and nuclear 
male sterile (NMS) mutants have been used for 
the hybrid seed production (Duvick 2005; Wang 
et al. 2005). 

TA29, an anther-specific gene, is expressed 
exclusively in anthers at the time of microspore 
development. Downregulation of TA29 of 
tobacco (N. tabacum cy. Samsun) by RNAi pro- 
duced male sterile lines (Nawaz-ul-Rehman et al. 
2007). Nucleases are enzymes playing vital role 
in nucleic acid metabolism. Rice transgenic 
plants expressing hairpin RNA for OsGEN-L 
(OsGEN-like) gene, a new member of the RAD2/ 
XPG nuclease family, exhibited low fertility and 
were male sterile (Moritoh et al. 2005). Silencing 
a male-specific gene, Bcp/, in the model host A. 
thaliana resulted in male sterile lines. Bcp/, an 
anther-specific gene, is active in both diploid 
tapetum and haploid microspores. Transgenic 
plants were phenotypically indistinguishable 
from nontransgenic plants, and by crossing with 
nontransgenic fertile pollens, successful seed set 
was observed (Tehseen et al. 2010). 
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S-Adenosylmethionine decarboxylase 
(SAMDC), a key gene involved in polyamine bio- 
synthesis, when targeted in tapetal tissue of 
tomato under the control of tapetal-specific A9 
promoter using RNAi, resulted in male sterile 
lines. These transgenics had sterile pollen and 
failed to set fruits, but female fertility was unaf- 
fected as cross-pollination resulted in fruit setting 
(Sinha and Rajam 2013). 


31.3.4 Nutritional Improvement 


Plants provide most of the nutrients required in 
the human diet, although the major staple crops 
are often deficient in some of these nutrients. 
RNAi technology has also been used in several 
plants to improve their nutritional quality. A 
dominant high-lysine maize variant was _ pro- 
duced by knocking out the expression of the 
22-kDa maize zein storage protein, a protein that 
is poor in lysine content (Segal et al. 2003). A 
recessive lysine-rich mutant called opaque 2 (O2) 
has been obtained by traditional breeding. The 
O2 gene encodes a maize basic leucine zipper 
transcriptional factor that controls the expression 
of a subset of storage proteins, including the 
22-kDa zein storage protein. Opaque 2 mutant 
was lysine-rich but showed poor seed quality and 
yield. Downregulation of lysine-poor zein gene 
via RNAi generated normal and quality seeds 
with high levels of lysine without altering the 
general functions of O2 (Angaji et al. 2010). 
Fatty acid composition of cotton seed oil was 
manipulated by hbpRNA-mediated gene silencing 
of two fatty acid desaturase genes, stearoyl-acyl- 
carrier protein D9-desaturase and oleoylphos- 
phatidylcholine u6-desaturase. Downregulation 
of one gene substantially elevated stearic acid 
level from 2 to 3 % up to as high as 40 %, and 
silencing of the other gene enhanced oleic acid 
content, up to 77 % compared with about 15 % in 
seeds of untransformed plants (Liu et al. 2002). 
RNAi technology was used to enhance 
B-carotene content in potato by silencing the 
B-carotene hydroxylase gene (BCH), which con- 
verts B-carotene to zeaxanthin. RNAi constructs 
having the tuber-specific granule-bound starch 
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synthase (GBSS) promoter and the other 
containing the strong constitutive cauliflower 
mosaic virus 35S (CaMV 35S) promoter were 
introduced into potato by Agrobacterium- 
mediated transformation. The transformants 
derived from the GBSS construct contained more 
B-carotene than CaMV 35S transformants. These 
results showed that BCH silencing can increase 
the content of carotenoids, B-carotene, and lutein 
in potato which will provide a tool for combating 
the incidence of vitamin A deficiency in popula- 
tions (Eck et al. 2007). Silencing of 
DE-ETIOLATED1 (DET!) in Brassica napus 
resulted in seeds with increased levels of lutein, 
B-carotene, and zeaxanthin relative to nontrans- 
genic seeds (Wei et al. 2009). DET/ suppression 
also led to reduced levels of sinapate esters 
responsible for bitter taste, poor meal palatabil- 
ity, and unpleasant flavor to the meat and milk of 
animals fed on a B. napus seed meal diet. 

Tomatoes are a principal dietary source of 
carotenoids and flavonoids, both of which are 
highly beneficial for human health. 
Overexpression of genes encoding biosynthetic 
enzymes or transcription factors has resulted in 
tomatoes with improved carotenoid or flavonoid 
content, but never with both. Increased nutri- 
tional value was obtained by suppressing an 
endogenous photomorphogenesis regulatory 
gene, DET], using fruit-specific promoters com- 
bined with RNA interference (RNAj1) technology. 
Both carotenoid and flavonoid contents were 
increased significantly, whereas other parameters 
of fruit quality were largely unchanged (Davuluri 
et al. 2005). 

Starch, a major plant carbohydrate, is com- 
posed of amylase and amylopectin. Amylose 
molecules tend to efficiently form digestion- 
resistant complexes when the cooked food under- 
goes the process of cooling (Crowe et al. 2000). 
To increase the amylase content in wheat, RNAi 
constructs designed to silence the genes encoding 
the two starch-branching isozymes of amylopec- 
tin synthesis were expressed under a seed-specific 
promoter which resulted in increased grain amy- 
lase content to over 70 % of total starch content 
(Regina et al. 2006; Tang et al. 2007). RNAj con- 
structs have been used in Zea mays and A. thaliana 
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to modify the levels of phosphate metabolism 
involved in leaf starch degradation. Phosphate 
manipulation led to increase in starch content 
(Weise et al. 2012). 


31.3.5 Flower Color Modification 


Floriculture, or flower farming, is concerned with 
the cultivation of flowering and ornamental plants 
for gardens and for floristry. Flower color modifi- 
cation is one of the most desirable traits in floral 
industry. RNAi can be used as a tool to silence 
the pigment synthesis genes, which can lead to 
different flower color patterns. CHI (chalcone 
isomerase) gene silencing in tobacco by RNAI 
showed decreased pigmentation and change of 
flavonoid components in flower petals. Plants 
showed yellow coloration due to accumulation of 
high levels of chalcone in pollens (Nishihara 
et al. 2005). 

Nakatsuka et al. (2008) performed RNAI- 
mediated suppression of three anthocyanin bio- 
synthetic genes—chalcone synthase (CHS), 
anthocyanidin synthase (ANS), and flavonoid 
3'5’'-hydroxylase (F3'5'H)—in gentian plant. In 
CHS suppressed transgenics, petals exhibited 
pure white to pale-blue color, whereas in ANS 
suppressed transgenics, petals were only pale- 
blue. Suppression of the F'3’5'H gene decreased 
delphinidin derivatives and increased cyanidin 
derivatives and led to magenta flower colors. The 
same group demonstrated that RNAi-mediated 
downregulation of anthocyanin 5, 3’-aromatic 
acyltransferase (5/3'AT) and _ flavonoid 
3’,5'-hydroxylase (F3’5’H) activities in gentian 
plant produced modified flower color (Nakatsuka 
et al. 2010). 

The flower color of Torenia hybrid, an impor- 
tant garden plant, was successfully modulated by 
RNAi. Downregulation of chalcone synthase (CHS) 
gene by using each of the coding region and the 
3’-untranslated region of the CHS mRNA as an 
RNAi target led to modulation of flower color 
from blue to white and pale (Fukusaki et al. 2004). 

Roses are the most important cut flower 
commercially and have played a major role in 
human culture from ancient time. RNAi-mediated 
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silencing of the cyanidin genes in rose and 
introduction of delphinidin genes produced flowers 
that accumulated delphinidin-based anthocyanins 
exclusively with a concomitant color change toward 
blue (Tanaka et al. 2009; Katsumoto et al. 2007). 


31.3.6 Secondary Metabolite 
Manipulation 


Plant secondary metabolites are economically 
important as drugs, fragrances, pigments, food 
additives, and pesticides. Secondary metabolite 
production, however, sometimes is blocked by 
undesirable compounds, which can be suppressed 
by RNAi. The versatility of RNAi for controlling 
multigenes responsible for metabolite production 
has been well recognized as an effective strategy 
(Borgio 2009). 

The first commercially useful cultivar pro- 
duced by RNAi was the rice mutant line LGC-1 
(low glutelin content-1), thus making it useful for 
patients who must restrict protein intake such as 
kidney disease patients (Mochizuki and Hara 
2000). LGC-1 and some cultivars developed 
using LGC-1 as across parent are beginning to be 
used for this type of diet therapy (Kusaba et al. 
2003). This dominant mutation produced hpRNA 
from an inverted repeat for glutelin, the gene for 
the major storage protein glutelin, leading to 
lower glutelin content in the rice through RNAI. 
Gil-Humanes et al. (2008) used RNAi technol- 
ogy to silence the expression of specific y-gliadins 
and demonstrated the feasibility of systemati- 
cally silencing specific groups of gluten proteins 
without affecting fertility, grain morphology, and 
seed weight when compared to the control lines. 

RNA-mediated suppression of tryptamine 
biosynthesis in Catharanthus roseus during hairy 
root culture eliminated the production of mono- 
terpene indole alkaloids (a class of natural prod- 
ucts derived from two starting substrates), 
tryptamine and secologanin. To utilize this chem- 
ically silent background, they introduced an 
unnatural tryptamine analog into the production 
media and demonstrated that the silenced C. 
roseus culture could produce a variety of novel 
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products derived from this unnatural starting 
substrate (Runguphan et al. 2009). 

Transformation of Papaver somniferum with 
RNAi construct designed to reduce the levels of 
the gene encoding the morphine biosynthetic 
enzyme  salutaridinol 7-O-acetyltransferase 
(SalAT) led to the accumulation of the intermedi- 
ate compounds, salutaridine and salutaridinol, in 
a ratio ranging from 2:1 to 56:1 (Kempe et al. 
2009). California poppy (Eschscholzia califor- 
nica) cells were transformed with RNAi con- 
struct harboring berberine bridge enzyme (BBE) 
gene to suppress the activity of the enzyme and 
resulted into reticuline accumulation at a maxi- 
mum level (Fujii et al. 2007). The artemisinin 
content of transgenic Artemisia annua L plants 
was significantly increased by 3.14-fold as com- 
pared to untransformed control plants by sup- 
pressing the expression of SQS (squalene 
synthase), a key enzyme of sterol pathway, by 
means of a hairpin RNA (Zhang et al. 2009). 

Over 10 % of the coffee on the world market 
is shared by decaffeinate coffee (DECAF). 
Caffeine is a stimulant of the central nervous sys- 
tem, the heart muscle, and the respiratory system 
and has a diuretic effect. Its adverse side effects 
include insomnia, restlessness, and palpitations. 
Modulation of caffeine biosynthesis in planta 
was done by suppression of CaMXMTI 
(7-N-methylxanthine methyltransferase or theo- 
bromine synthase) by the double-stranded RNA 
method. The caffeine content of transgenic plants 
was reduced by up to 70 %, indicating that it is 
possible to produce decaffeinated coffee beans 
using RNAi (Ogita et al. 2003, 2004). 

Cotton is a major cash crop which produces 
fibers and oil. The cotton seeds that remain after 
fiber extraction could be extensively used as 
sources of protein and calories, but they are 
largely underutilized because they contain a toxic 
gossypol terpenoid. Gossypol is also produced in 
vegetative cotton tissues where it protects cotton 
plants from insects and other pathogens. 
Transgenic cotton plants expressing RNAi con- 
struct of the d-cadinene synthase gene of gossy- 
pol synthesis fused to a seed-specific promoter 
caused seed-specific reduction of this metabolite, 
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while its content in nonseed tissues was 
comparable to the control plants (Sunilkumar 
et al. 2006). 

Cassava is a major staple food in tropical 
countries but contains unnecessary glucosides. 
Jgrgensen et al. (2005) used RNAi to prevent pro- 
duction of the cytochrome P450 enzyme that 
makes the first committed step in the biosynthesis 
of linamarin and lotaustralin, and generated 
transgenic cassava (Manihot esculenta) plants 
with elimination of cyanogenic glucosides in the 
leaves (<1 % of nontransgenic amounts) and a 
92 % reduction of cyanogenic glucoside amount 
in tubers. 


31.3.7 Enhanced Fruit Shelf Life 


Fruit ripening has received considerable attention 
because of the dramatic changes in the metabolic 
processes that take place before and after this 
event, as well as due to its commercial impor- 
tance. Fruits are an important dietary supplement. 
The quality of fruit is determined by a wide range 
of desirable characteristics such as nutritional 
value, flavor, processing qualities, and shelf life. 
The massive losses accrue during transportation 
and post-harvest handling of the fruit which run 
into billions of dollars worldwide. Therefore, 
there is a need to increase the shelf life of fruits 
so as to minimize the agronomic loss. 

Ethylene, unlike the rest of the plant hormone 
compounds, is a gaseous hormone inducing sev- 
eral responses during ripening through a signal- 
ing cascade (Crocker et al. 1935). The shelf life 
of tomato has been increased by targeting the 
genes coding for ethylene biosynthesis pathway. 
The dsRNA of tomato ACC oxidase expression 
cassette was successfully introduced into tomato 
cultivar Hezuo 906 under the control of cauli- 
flower mosaic virus 35S promoter by A. 
tumefaciens-mediated transformation. 
Transgenic plants produced had fruits having 
traces of ethylene and had a prolonged shelf life 
of more than 120 days with similar levels of total 
soluble sugar, titratable acid, amino acids, and 
total soluble solids as the control plants (Xiong 
et al. 2005). Similarly, delayed ripening tomatoes 
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were generated by silencing three homologues of 
1-aminocyclopropane-1-carboxylate (ACC) syn- 
thase (ACS) gene, catalyzing the rate-limiting 
step in ethylene biosynthesis during the course of 
ripening, using RNAi technology. The chimeric 
RNAi-ACS construct designed to target ACS 
homologues effectively repressed the ethylene 
production in tomato fruits. Fruits from such 
lines exhibited delayed ripening and extended 
shelf life for 45 days, with improved juice quality 
(Gupta et al. 2013). 

Recently, SISGR1 (encoding a STAYGREEN 
protein that plays a critical role in the regulation 
of chlorophyll degradation in tomato leaves and 
fruits)-repressed lines reduced H,O, levels and 
inhibited ethylene signal transduction during 
fruit ripening, promoting the retention of firm- 
ness and sustained cell membrane integrity and 
resulting in delayed fruit senescence during stor- 
age and an enhanced shelf life from 25 to 
45-58 days when harvested at the breaker (Br) 
stage and stored at room temperature (Luo et al. 
2013). 


31.3.8 Pros and Cons of RNAi 
Technology 


RNA silencing has emerged as an area of thor- 
ough investigations leading to new discoveries. 
RNAi-mediated gene silencing is a valuable tech- 
nology for the development of transgenic crop 
plants, with a focus on nutritional enrichment and 
plant protection from bacteria, nematodes, fungi, 
and insects pests, which are two major hurdles in 
production and productivity of agriculture crops. 
RNAi strategy has certain advantages over other 
approaches. For instance, the silencing is 
sequence-specific and more than one gene can be 
targeted. Additionally, the extent of the gene 
silencing can be controlled, so that the essential 
genes will only be silenced at desired stage and 
tissue. As there is no transgene protein expres- 
sion in RNAi approach, there would not be any 
extra metabolic load on the transgenic plants. 
Further, in the absence of transgene protein, 
there is less likelihood of development of resis- 
tance by the target pest or pathogen, and RNAi 
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plants would pose minimal biosafety issues 
(Rajam 2011). 

However, there are also some limitations to 
RNAi technology. Although it is a method of 
sequence-specific targeting, there may be issues 
of off-target effects leading to undesirable traits. 
There does exist a concern that inadvertent sec- 
ondary effects could be generated by using non- 
coding small RNA-mediated gene silencing, 
especially when this approach is used to engineer 
broad spectrum resistance into plants against 
pathogens/pests. Delivery methods for the 
dsRNA are a limiting step for a number of spe- 
cies for which RNAi-based approaches cannot be 
used easily. There still remain many significant 
challenges in development and commercializa- 
tion of GM crops utilizing RNAji-based technol- 
ogy. Not only tremendous efforts are required for 
achieving scientific breakthroughs but also pro- 
motion of public acceptance of GM crops among 
other complicated ethical issues has to be taken 
care of. 

We, hence, conclude that agricultural biotech- 
nology, including RNAi technology, would serve 
as one of the most important measures for crop 
improvement, which will contribute to agricul- 
ture productivity to a great extent. 
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Abstract 


Historical Perspective 


MicroRNAs (miRNAs) are small 21-nt-long noncoding RNAs, involved 
in posttranscriptional regulation of mRNA either by direct cleavage or 
translational repression. They are mainly transcribed as independent tran- 
scriptional units and their expression differs during different stages of 
plant development. Plant miRNAs have been well studied during develop- 
mental stages and different environmental stress conditions, but their func- 
tions and regulatory networks have yet to be understood. Understanding 
their regulatory roles may open new avenues for crop improvement which 
is necessary for increased food production to feed the growing population. 
MicroRNA interference (miRNAj1) technology provides an efficient plat- 
form for functional studies and agricultural applications. 
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MicroRNA-induced gene silencing (MIGS) * miRNA interference 
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enzymatic functions but also regulates gene 
expression. This transformation has been carried 


During the last decade, the “RNA world” has 
been transformed into an “RNA _ molecular 
world” where RNA is not only restricted to the 
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out along with many fronts, but one of the most 
significant advances has been the discovery of 
small noncoding RNAs. Small interfering RNA 
(sIRNA) and microRNAs (miRNAs/miRs) are 
the two major classes of plant small noncoding 
RNAs. siRNAs are formed from dsRNA while 
miRNAs originate from single-stranded RNAs 
transcribed from miRNA loci (Voinnet 2009). 
Plants have highly conserved and more recently 
evolved species-specific miRNAs to control a 
vast array of biological processes. 
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The first miRNA /in-4 was discovered in 1993 
by Victor Ambros and his colleagues in 
Caenorhabditis elegans that has been shown to 
repress the protein levels of lin-14, a gene that 
functions in the same developmental pathway. 
The lin-4 gene does not encode any protein but 
instead makes a pair of small RNAs. One RNA is 
about 21-nt long and the other is approximately 
60-nt long. The longer RNA which folds into a 
stem-loop structure was predicted to be the pre- 
cursor of the 21-mer RNA. This shorter lin-4 
RNA was actually the founding member of the 
abundantly available class of small regulatory 
RNAs known today as miRNA. The Jin-4 RNA 
bound with partial antisense complementary 
sequences found in the 3’-untranslated region 
(3'-UTR) of lin-14 mRNA and repressed its 
translation without reducing the level of lin-/4 
transcript (Lee et al. 1993). After 7 years of dis- 
covery of lin-4, another miRNA called let-7 was 
discovered in 2000 in the same organism 
(Reinhart et al. 2000). miRNA let-7 regulates the 
expression of gene /in-4] which is required for 
the transition from late larval to adult cell fate. 
The /et-7 miRNA was found in almost all animal 
phyla, establishing biological importance of 
miRNAs. 

Later on, several investigators have cloned 
miRNAs from various organisms. Cloning of 
small RNAs also revealed their ubiquitous pres- 
ence in plants. The first plant miRNA was discov- 
ered in Arabidopsis thaliana in 2002. Llave et al. 
(2002) cloned a large set of miRNAs of predomi- 
nantly 21- to 24-nt length from Arabidopsis. 
Among the 125 sequences identified, about 90 % 
originated from the intergenic regions and a few 
others from the protein-coding genes. The accu- 
mulation of some of these miRNAs was develop- 
mentally regulated. A set of miRNAs surprisingly 
mapped in the transposable elements of the 
Arabidopsis genome (Mette et al. 2002). 
Differential expression pattern of 16 Arabidopsis 
miRNAs was observed, of which 8 were con- 
served in the rice genome. CARPEL FACTORY, 
an Arabidopsis Dicer (RNase III family enzyme) 
homologue, when mutated, prevented the miRNA 
accumulation. This observation led to the conclu- 
sion that a similar mechanism also exists in plants 
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that direct miRNA processing as in animals (Park 
et al. 2002; Reinhart et al. 2002). 

Despite some similarities between the plant 
and animal miRNAs, the divergences between 
these two groups were more noticeable. It 
includes the sizes of the mature miRNAs, gene 
structure, 5’ nucleotide preferences, and most 
significantly absence of any genetic conservation 
between the two groups. As the mechanistic stud- 
ies emerged strongly, it became clear that these 
two groups differ significantly in the modes of 
their biogenesis and functions. Taken all together, 
it appears that these two groups had dual origins 
in their evolutionary lineages. 


32.2 Biogenesis and Activity 
of MicroRNAs 


miRNAs are transcribed from noncoding nuclear 
genes, a majority of which are located in inter- 
genic and intragenic (intron) regions, and very 
few of them are located in 5’-UTR or 3’-UTR 
regions. Plant miRNAs are generally transcribed 
by RNA Pol II which produces capped and poly- 
adenylated long primary miRNA transcripts (pri- 
miRNAs), which are then processed into 
precursor miRNA (pre-miRNA) of about 80—500 
nucleotides in length, by an enzyme, DICER- 
LIKE 1 (DCL1), with RNase III activity (Kurihara 
et al. 2006). A part of the pri-miRNA folds to 
form a stem-loop structure which is cropped by 
the DCL1 complexes. A general feature of the 
pri-miRNA is shown in Fig. 32.1. DCLI- 
mediated processing occurs mostly at the lower 
stem or base, while in a few cases, the terminal 
loop controls the processing. With long-stem or 
terminal loop-branched pri-miRNAs, the pro- 
cessing could be bidirectional, 1.e., from base to 
loop and loop to base, resulting in productive and 
abortive processing of miRNAs, respectively. 
DCL1 complexes generally cut pri-miRNAs at a 
distance of 16-17 bp from the single-strand— 
double-strand junction (Zhu et al. 2013). Pri- to 
pre-miRNA conversion also requires the double- 
stranded RNA  (dsRNA)-binding _ protein 
HYPONASTIC LEAVES! (HYLI1) and _ the 
C2H2 zinc finger protein SERRATE (SE) along 
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16-17 bp 





Base 


Fig.32.1 General features of primary miRNA (pri-miRNA) 


with other factors like CBC, DDL, and TOUGH 
(Yang et al. 2006a; Parry et al. 2007; Yu et al. 
2008; Ren et al. 2012). These proteins are located 
in nuclear processing centers called D-bodies or 
SmD3-/SmB-bodies (Kurihara et al. 2006; Fang 
and Spector 2007). 

Pre-miRNA is unstable in nucleus and pro- 
cessed into miRNA—miRNA* duplex by DCLI 
and stabilized by S-adenosyl methionine- 
dependent methyltransferase HUA ENHANCER 
1 (HEN1), which methylates all plant-silencing 
small RNAs. Methyl groups deposited on the 3’ 
terminal nucleotides of each strand prevent their 
uridylation and subsequent degradation by the 
SMALL RNA DEGRADING NUCLEASE 
(SDN) class of exonucleases (Li et al. 2005; Yang 
et al. 2006b). The modified miRNA—miRNA* is 
exported into cytoplasm by HASTY (an ortholog 
of exportin-5 protein) and processed into mature 
miRNA (Park et al. 2005). The miRNA- 
programmed silencing complex is often referred 
to as RNA-induced silencing complex (RISC) 
(Hammond et al. 2000; Llave et al. 2002; Tang 
et al. 2003) (Fig. 32.2). Cloning and expression 
data indicate that the miRNA strand of this 
duplex, also known as guide strand, accumulates 
to much higher levels in vivo than does the 
miRNA* or the passenger strand (Reinhart et al. 
2002; Lim et al. 2003). This asymmetric accumu- 
lation is accomplished by preferential loading of 
the miRNA strand into the silencing complex, 
where it is protected from degradation, whereas 
the miRNA* strand is preferentially excluded 
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from the silencing complex and is degraded. 
Bioinformatics and biochemical studies of func- 
tional siRNA duplexes gave insights into the 
asymmetry of RISC loading. The siRNA duplex 
strand with less stable 5’ end pairing is selec- 
tively loaded into RISC, where it guides silenc- 
ing, whereas the strand with the more stable 5’ 
end pairing is excluded from RISC. Most 
miRNA—miRNA* duplexes also appear to have 
energetic asymmetry; the 5’ ends of most miR- 
NAs are less stably paired than are the 5’ ends of 
the corresponding miRNA*s (Khvorova et al. 
2003; Schwarz et al. 2003). Hence, the final prod- 
uct of the miRNA biogenesis pathway is a small 
single-stranded RNA incorporated into a silenc- 
ing complex (Fig. 32.2). 

A central component of the silencing complex 
is a member of the Argonaute protein family. 
Argonaute proteins contain two _ conserved 
regions, the PAZ and PIWI domains (Carmell 
et al. 2002). The PAZ domain appears to be an 
RNA-binding domain (Lingel et al. 2003; Song 
et al. 2003; Yan et al. 2003), and the PIWI 
domain is structurally and functionally similar to 
RNase H enzymes (Liu et al. 2004; Song et al. 
2004). Many organisms contain multiple mem- 
bers of the Argonaute family; in some cases, 
there is evidence for functional diversification 
of the different Argonautes. AGOI 1s the only 
Argonaute gene known to be required for 
miRNA function in Arabidopsis. Arabidopsis 
AGO1 binds miRNAs and catalyzes target cleav- 
age in vitro (Baumberger and Baulcombe 2005; 
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Fig.32.2 Biogenesis of 
plant miRNA: the primary 
miRNA (pri-miRNA) 
transcript is transcribed 
from the microRNA gene 
by RNA polymerase II (Pol 
II). Pri- to pre-miRNA 
conversion requires many 
proteins like DCL1, HYLI, 
SE, HEN1, and others 
(TOUGH, DDL, and 
CBC). HASTY is involved 
in miRNA transport from 
nucleus to cytoplasm. SDN 
degrades the passenger 


miRNA strand and AGO1 
is responsible for mRNA 
cleavage 
NUCLEUS 
CYTOPLASM 
SDN J 


Qi et al. 2005), and ago/ mutants have elevated 
levels of miRNA targets in vivo (Vaucheret 
et al. 2004). 

The cleavage of target mRNA occurs at the 
10th or 11th nucleotide from the 5’ end of the 
miRNA, regardless of the miRNA length, and 
requires the base pairing of the 5’ end of the 
miRNA with the target mRNA (Palatnik et al. 
2003; Xie et al. 2003; Floyd and Bowman 2004; 
Jones-Rhoades and Bartel 2004; Mallory et al. 
2004). The resulting cleavage products have 3’ 
hydroxyl and 5’ phosphate groups, similar to the 
products of other enzymes with “slicer” activity, 
such as RNase H (Martinez and Tuschl 2004; 
Schwarz et al. 2004; Song et al. 2004). 

The plant miRNAs are mostly rich in U at the 
5’ end and are consequently sorted in the AGOI- 
mediated RISC complexes (Voinnet 2009). A few 
Arabidopsis miRNAs have U to A mutation at the 
5’ end which are sorted in AGO2 complexes 
in vivo leading to loss of their silencing potential 
(Mi et al. 2008; Vazquez et al. 2008). However, a 
bioinformatics analysis of the next-generation 
sequencing (NGS) data revealed that there is a 
wide range of variations in the occurrence of U at 
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the 5’ position in monocots (45-85 %) and dicots 
(45-70 %). In potato, U and A dominate almost 
equally at the 5’ position. The effect of differen- 
tial sorting in silencing has not been widely 
reported in plants other than Arabidopsis and 
rice. The 5’ A-variant of 24-mer miRNAs are 
expected to sort in the AGO4 complex which 
causes heterochromatinization (Takeda et al. 
2008). 

The NGS data on plant miRNAs also show 
that a small percent of many miRNAs undergo 
modifications both at the 5’ and 3’ ends with 3’ 
end modifications being more common (Ameres 
and Zamore 2013). These modifications are 
either additions or deletions but its biochemical 
causes are still uncertain. These modifications 
arise either due to faulty processing of DCL1 or 
could be the result of post-maturation processes 
of miRNAs (Voinnet 2009). More than 80 % of 
the modified Arabidopsis and rice miRNAs are 
found in the AGOI and AGO4 complexes, indi- 
cating that the modified miRNAs have a func- 
tional role. 

Small interfering RNAs (siRNAs) are cell 
autonomous and migrate from their place of 
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origin to distant locations to cause repression of 
the target genes. However, plant miRNAs are 
generally considered as sluggish and do not tend 
to move out from their place of origin as revealed 
by the in situ hybridization data (Valoczi et al. 
2006) or cell-specific expression of artificial 
miRNA (Tretter et al. 2008). Though, recent 
reports indicate that a few miRNAs are really 
mobile. The tomato miR172, produced either in 
the floral or leaf tissues, migrates down to the 
roots of the plant and helps to acquire resistance 
against the nematode-induced damage. Few other 
miRNAs produced in leaf tissues are also found 
in phloem and roots which enable the plants to 
defend against the stress responses. Sensing low 
levels of nutrients like sulfur and phosphate, the 
miR395 and miR399 are known to translocate 
from the leaves to the roots to help ameliorate the 
nutrient deficiency (Voinnet 2009). 


32.3. miRNA Families, 
Conservation, and Evolution 


Since the advent of NGS techniques and advanced 
computational facilities, the small RNA database 
is flooded with plant miRNA sequences. The 
miRBase v21 = (http://www.mirbase.org/index. 
shtml) reports the presence of 7,057 miRNAs in 
73 different plant species. The miRNAs of same 
or nearly identical sequences can be produced by 
many pre-miRNAs having otherwise very differ- 
ent base sequences and can be clubbed together 
as members of a family. The miR169 family of 
Arabidopsis has 14 members. A recent study has 
computationally identified over 2,000 miRNAs 
from six Solanaceae plants based on the genomic 
sequences and PUT (PlantGDB-assembled 
unique transcripts, EST Assemblies). The 982 
potato miRNAs have been classified in 71 
miRNA families, and the miR1128 family 1s pre- 
dicted to have 141 members and thus is consid- 
ered to be the largest known family in the plant 
kingdom (Gu et al. 2014). Sometimes the unique- 
ness of the plant species can also be discerned by 
the presence of distinct miRNA family(ies). 
Initially, miRNAs discovered in A. thaliana 
and rice (Oryza sativa) were from families that 
are conserved between the two species (Llave 
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et al. 2002; Park et al. 2002; Jones-Rhoades and 
Bartel 2004; Sunkar and Zhu 2004). 
Computational prediction and sequencing tech- 
niques resulted in the identification of miRNAs 
from ancient families (Axtell and Bartel 2005). 
miR396, miR397, miR398, miR828, miR2111, 
and miR403 are few miRNA families which are 
conserved in plants ranging from vascular plants 
to seed plants (Cuperus et al. 2011). miR156, 
involved in the transition from the juvenile to the 
adult phase of shoot development in plants, has 
been found to be much conserved from angio- 
sperm to lower plants. miR166 and miR171 have 
also been found to be conserved among different 
plant species. However, there are miRNAs such 
as miR415 and miR417 which have been reported 
only in angiosperms and hence are not conserved. 
Interestingly, in single-cell alga Chlamydomonas 
reinhardtii, no miRNAs have been reported 
which are otherwise much conserved across the 
plant species suggesting its early evolutionary 
divergence from higher plants (Table 32.1). 
miRNA families also show cross-kingdom con- 
servation. Two miRNA annotated families 
(miR854 and miR855) were reported as con- 
served between the plant and animal lineages 
(Arteaga-Vazquez et al. 2006). However, loci 
from these families lie in retrotransposons. 

Apart from conserved miRNAs, species-specific 
or non-conserved miRNAs have also been identi- 
fied. Deep sequencing of A. thaliana small RNA 
populations suggested that, in spite of conserved 
miRNAs, there are miRNAs restricted to the family 
Brassicaceae (Lu et al. 2006; Rajagopalan et al. 
2006; Fahlgren et al. 2007, 2010; Ma et al. 2010). 
A large number of species-specific miRNA genes 
were observed in Physcomitrella patens and 
Selaginella moellendorffii (Axtell et al. 2007), rice 
(Heisel et al. 2008; Lu et al. 2008a; Sunkar et al. 
2008; Zhu et al. 2008), Medicago truncatula 
(Szittya et al. 2008; Lelandais-Brie re et al. 2009), 
and Glycine max (Subramanian et al. 2008). The 
presence of a large number of non-conserved miR- 
NAs in plants also suggests that plants carry 
recently evolved miRNA loci. 

The miRNA region can be divided into two 
regions, evolutionary stable and variable, which 
are the outcomes of evolution over a long period. 
The region including the mature miRNA and 


644 M. Pareek et al. 


Table 32.1 Representative plant miRNAs and their conservation across the major plant phyla (based on miRBase v21) 


Angiosperms Gymnosperms Bryophyte Alga 
miRNAs Ath Sly Gma_ Bna Osa Tae Hvu Zma Sbi Pta Pab Clin Ppa Cre 
miR156 + + - — + = 
miR159 
miR 160 
miR162 
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miR397 
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miR416 
miR417 
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miR419 
miR420 
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miR477 + - - + 
miR479  — _ _ — + — — — _ = = = - = = 
miR482 + + - + 
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miR535  —- — — — _ 
miR536 —- — _ _ = = = - = - = = _ 
miR783 — — — — _ ~ - = o = = - - © _ 
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Ath Arabidopsis thaliana, Sly Solanum lycopersicum, Gma Glycine max, Bna Brassica napus, Nta Nicotiana tabacum, 
Osa Oryza sativa, Tae Triticum aestivum, Hvu Hordeum vulgare, Zma Zea mays, Sbi Sorghum bicolor, Pta Pinus taeda, 
Pab Picea abies, Cln Cunninghamia lanceolata, Ppa Physcomitrella patens, Cre Chlamydomonas reinhardtii 
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miRNA* is stable, whereas the other region, 
which represents most of the miRNA genes 
including the loop and other sequences, belong to 
the variable region (Bullini and Coluzzi 1972). 
Over the geological periods, plant species have 
evolved in their lifestyle and behavior. Since the 
miRNAs govern the plant behavior, it is no wonder 
that the plant miRNAs have also evolved accord- 
ingly. The rate of miRNA evolution is determined 
by several factors including the time intervening 
the emergence and continuance of a given plant 
species. For example, the moss Physcomitrella 
patens and the eudicot A. thaliana are widely dif- 
ferent in their characteristics and have diverged 
about half a billion years ago. However, both the 
lineages have a comparable number of miRNA 
families, 67 and 66 in moss and A. thaliana, 
respectively (Taylor et al. 2014). Recent analysis 
reveals that the evolution of the novel miRNA 
family did not occur at a continuous speed in the 
geological scale of time, but it resulted due to sud- 
den bursts of phenotypic evolution such as the 
establishment of plant life on lands. Indeed such 
an expansion in the miRNA families within the 
angiosperm lineages has been speculated from the 
birth of flowering plants (Nozawa et al. 2012). 
Among other interesting features of miRNA 
evolution, the most notable one would possibly 
be the origin of the miRNA gene itself. A closer 
look at the pre-miRNA sequences outside the 
mature miRNA region of a given pre-miRNA 
reveals a good amount of their similarity with the 
miRNA target gene (protein-coding) sequence. 
Based on this observation, the evolution of 
miRNA genes with the inverted duplication of 
miRNA target gene or its fragments has been pro- 
posed (Voinnet 2009; Cuperus et al. 2011). In 
fact three possible pathways of formation of 
miRNA genes are known till now. (a) The theory 
of formation of plant miRNA genes based on the 
inverted duplication of protein-coding genes 
have been advanced by a number of investigators 
(Rajagopalan et al. 2006; Fahlgren et al. 2007, 
2010). This theory is attractive as it readily 
explains the target selection by miRNAs and the 
consequent silencing. The processing aspects 
have also undergone evolution from the DCL4- 
mediated processing to the DCL1-centered activ- 
ity. (b) Miniature inverted-repeat transposable 
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elements (MITEs) could have given rise to 
miRNA genes as they harbor potential hairpin 
structures. Many Arabidopsis and rice miRNAs 
might have evolved through MITEs (Piriyapongsa 
and Jordan 2008). (c) Duplication of preexisting 
miRNA genes with accumulation of mutations 
might have brought in new miRNA genes. As the 
plant miRNAs are often clubbed as members of 
the miRNA family, consisting of several miRNA 
genes, this theory has received particular atten- 
tion (Li and Mao 2007). All of these factors con- 
tribute to different extents in the evolution of 
plant miRNAs (Nozawa et al. 2012). 

Irrespective of pathways of evolution, a few 
miRNA genes which are highly conserved and 
expressed across the plant kingdom is classified 
as the old class of miRNA genes, although these 
are few in numbers. Most other miRNA genes are 
phylogenetically restricted and have evolved rel- 
atively recently and expressed in either low or 
moderate copies in plants. These latter miRNA 
genes are known as the young ones. A few of 
them might not have their targets at all, and these 
have evolved most recently in the time scale. If 
they cannot acquire their target genes, they might 
eventually dissipate (Cuperus et al. 2011). 


32.4 Role of MicroRNAs in Gene 
Regulation 


miRNAs regulate the target gene expression 
either by mRNA cleavage or by translational 
repression. Choice of the posttranscriptional 
mechanisms is determined by the identity of the 
target. Once incorporated into a cytoplasmic 
RISC, the miRNA will specify cleavage if the 
mRNA has sufficient complementarity to the 
miRNA, or it will repress the translation if the 
mRNA does not have sufficient complementarity 
to be cleaved but does have a suitable clustering 
of miRNA-binding sites. 


32.4.1 miRNA-Directed Target mRNA 
Cleavage/Degradation 


Plant miRNAs recognize full or near comple- 
mentary mRNA targets and direct endonucleolytic 
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mRNA cleavage (Llave et al. 2002; Rhoades 
et al. 2002; Tang et al. 2003; Voinnet 2009). 
Cleavage occurs between nucleotides in the 10th 
and 11th nucleotide position, opposite the 
miRNA strand, and is catalyzed by the AGOs. 
The resulting 5’ and 3’ mRNA fragments are 
degraded from the newly generated 3’ and 5’ 
ends, by the activity of nonspecific nucleases 
(Souret et al. 2004). 

The register of cleavage does not change when 
the miRNA is not perfectly paired to the target at 
its 5’ terminus (Kasschau et al. 2003; Palatnik 
et al. 2003). Therefore, the cut site appears to be 
determined relative to miRNA residues, not 
miRNA: target base pairs. After cleavage of the 
mRNA, the miRNA remains intact and can guide 
the cleavage of additional mRNAs (Hutvagner 
and Zamore 2002; Tang et al. 2003). These stud- 
ies indicated that endonucleolytic cleavage of 
highly complementary targets is a prominent 
mechanism of miRNA-based regulation in plants. 


32.4.2 miRNA-Directed Translational 
Repression of Target Genes 


miR156/miR157, which targets SBP-box gene 
SPL3, and miR172, which targets APETALA 2, 
have been reported to repress translation in plants 
(Aukerman and Sakai 2003; Chen 2004; 
Gandikota et al. 2007). A. thaliana mutants 
defective in silencing of a GFP reporter contain- 
ing miR171-binding site in the 3’-UTR showed 
higher expression of GFP protein, while the 
mRNA levels were not restored. This suggested 
that the mutations suppressed miRNA-mediated 
translational repression but not mRNA cleavage. 
Nevertheless, the observations suggest that plant 
miRNAs can repress translation in the absence of 
target degradation (Brodersen et al. 2008). 
Enhancer of decapping 4 (eDC4; also known 
as Gel and HeDIS), which is known as vARI- 
COSe in A. thaliana, is required for miRNA- 
mediated mRNA decay but not translational 
repression in animals (Eulalio et al. 2007). In 
Drosophila melanogaster cells, eDC4 is known 
as a Suppressor of miRNA-mediated gene silenc- 
ing by depleting eDC4-suppressed mRNA 
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degradation mediated by miRNAs, restoring 
transcript levels (Eulalio et al. 2007), whereas, in 
plants, vARICOSe mutants showed increased 
levels of protein expression but no increase in 
mRNA levels (Brodersen et al. 2008). 


32.4.3 miRNA-Directed DNA 
Methylation 


miRNAs are also involved in DNA methylation, a 
well-known epigenetic regulation, and change 
the expression pattern of genes. There are miRNA 
variants which associate with AGO proteins 
involved in RNA-directed DNA methylation 
(Law and Jacobsen 2010). AGO4, AGO6, and 
AGO%9 have been reported in A. thaliana to direct 
methylation of target DNA by binding with 24-nt 
siRNAs having 5’ adenosine generated by DCL3 
from dsRNAs produced through the activities of 
RDR2- and DNA-dependent RNA polymerase 
IV (Qi et al. 2006; Mi et al. 2008; Havecker et al. 
2010; Law and Jacobsen 2010). In rice, DCL3a 
has been shown to process multiple miRNA fold- 
backs, yielding 24-nt miRNA-like siRNAs which 
preferentially associate with rice AGO4a and 
AGO4b, and like 24-nt siRNAs, they can guide 
the methylation of target (Wu et al. 2010). In 
moss Physcomitrella patens dcllb mutants, there 
was accumulation of miRNA-target RNA 
duplexes and hypermethylation of the genes 
encoding target RNAs, leading to gene silencing 
(Khraiwesh et al. 2010). Therefore, it can be con- 
cluded that apart from the function of mRNA 
repression, miRNA and miRNA-—AGO com- 
plexes can function at transcriptional silencing 
level. 


32.5 Regulatory Role of miRNAs 
in Plant Growth 
and Development 


Plant development is a highly regulated process 
and controlled at many levels. The discovery of 
miRNAs and their roles in plant development has 
opened a new frontier to further understand the 
mechanisms underlying this regulation. miRNA 
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target genes are mainly transcription factors and 
F-box proteins, involved in major plant 
developmental pathways such as leaf develop- 
ment, patterning and polarity, floral identity and 
flower development, flowering time, 
developmental-phase transition, shoot and root 
development, hormone signaling, and _ stress 
responses (Jones-Rhoades et al. 2006). 

Developmental defects of the A. thaliana 
mutants ago/l, dcll, hen1, hyll, and hst provided 
clues for the key roles of miRNAs in plant devel- 
opment (Telfer and Poethig 1998; Bohmert et al. 
1998; Jacobsen et al. 1999; Lu and Fedoroff 
2000; Chen et al. 2002). Reduced level of miRNA 
and many developmental abnormalities were 
observed in loss-of-function dcl/ gene (Park 
et al. 2002; Reinhart et al. 2002; Dugas and Bartel 
2004). HASTY, a protein required for transport- 
ing miRNA—miRNA* duplex from the nucleus to 
the cytoplasm, when downregulated, caused 
many pleiotropic phenotypes including disrupted 
leaf shape and flower morphology, reduced fertil- 
ity, accelerated vegetative phase change, and dis- 
rupted phyllotaxis of the inflorescence (Bollman 
et al. 2003). These findings clearly indicate that 
miRNAs are involved in a variety of developmen- 
tal processes in plants. 


32.5.1 Leaf Development 
and Patterning 


Plant leaves exhibit adaxial/abaxial (upper/lower) 
axis pattern which is controlled by the expression 
of class II homeodomain-leucine zipper 
(HD-ZIP) (Juarez et al. 2004). PHABULOSA 
(PHB), PHAVOLUTA (PHV), and REVOLUTA 
(REV) are three closely related Arabidopsis 
HD-ZIP transcription factors. Using computa- 
tional approaches, miR165/miR166 has been 
implicated in the functioning of these transcrip- 
tion factors. In phb-d, phv-d, and rev-d mutants, 
the mutations are found to be in the region of 
miR165/miR166 complementary sites which 
decrease the efficiency of miRNA-directed cleav- 
age in vitro (Tang et al. 2003). Another member 
of class IIIf HD-ZIP transcription factors, 
INCURVATA4/ATHB15/CORONA, was found 
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to participate in the control of leaf polarity, as 
well as in shoot and root apical meristem pattern- 
ing and stem vascular differentiation (Prigge 
et al. 2005; Williams et al. 2005; Ochando et al. 
2006). Gain-of-function icu4 mutants bear the 
same nucleotide substitution similar to phabu- 
losa, phavoluta, and revoluta mutants. This muta- 
tion affects the miR165/miR 166 complementarity 
in target mRNAs, resulting in impaired cleavage 
of the mutant transcripts. Study also revealed that 
miR164 is another regulator of leaf patterning 
(Nikovics et al. 2006). miR164a and the tran- 
scription factor CUP-SHAPED COTYLEDON2 
(CUC2) gene controls leaf margin serration in 
Arabidopsis. 

In addition to miR166 and miR165, miR159/ 
miRJAW also regulates leaf development by tar- 
geting a subset of TCP transcription factor genes. 
Overexpression of miRJAW resulted in low lev- 
els of all tested TCP mRNAs and caused jaw-D 
phenotypes, including uneven leaf shape and cur- 
vature. In contrast, overexpression of miRJAW- 
resistant TCP mutants indicated that 
miRJAW-guided mRNA cleavage was sufficient 
to restrict TCP function (Palatnik et al. 2003). 
Lately, it has been shown that miR396 which 
regulates growth-regulating factors (GRFs tran- 
scription factors), when overexpressed, affected 
the cell proliferation in the meristem and devel- 
oping leaves and showed reduced leaf size 
(Mecchia et al. 2013). 


32.5.2 Flower Development 


Flower development is one of the most important 
stages of plant development, especially for flow- 
ering plants. Mature flowers consist of carpels, 
stamens, petals, and sepals. The ABC model best 
explains the origin of flower (Lohmann and 
Weigel 2002; Jack 2004). According to this 
model, four flower organs are controlled by the 
combinatorial actions of three classes (A, B, and 
C) of transcription factors (Lohmann and Weigel 
2002). APETALA 2 (AP2) is one of the class A 
genes that play an important role in flowering 
time and flower morphology. It has been demon- 
strated that AP2 is one of the targets of miR172 
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(Aukerman and Sakai 2003; Chen 2004). 
Overexpression of miR1I72 inhibited the 
translation of the AP2 gene and AP2-like genes 
(target of eat 1, TOE1) and resulted in early flow- 
ering and disrupting the specification of floral 
organ identity similar to loss-of-function ap2 
gene mutants (Aukerman and Sakai 2003; Chen 
2004; Luo et al. 2013). Another important feature 
of flower development is phase change, 1.e., the 
change from vegetative growth to reproductive 
growth. This transition (including the transition 
from a juvenile to adult leaf) is also regulated by 
the AP2-like gene glossyI5 (gl15) in maize. 
Overexpression of miR172 resulted in decreased 
gl15 expression through gl15 mRNA cleavage 
and resulted in delayed phase change from vege- 
tative to reproductive (Lauter et al. 2005). 

Formation of extra petals in early-arising 
flowers was observed in Arabidopsis eep/ mutant 
which is the target of miR164c, one of the three 
known members of the miR164 family (Baker 
et al. 2005). The other two members of the 
miR164 family are miR164a and miR164b, 
which negatively regulate several members of the 
NAC family of transcription factors that are 
required for plant organ formation and boundary 
establishment (Reinhart et al. 2002; Rhoades 
et al. 2002; Jones-Rhoades and Bartel 2004). 

GAMYB, gibberellin-modulated MYB._ tran- 
scriptional activator and a positive regulator of 
LEAFY, has been reported to play an important 
role in flower development (Achard et al. 2004; 
Kaneko et al. 2004; Millar and Gubler 2005). 
GAM YB is one target of miR159. Overexpression 
of miR159 resulted in a reduction of LEAFY tran- 
script levels and further perturbed anther develop- 
ment and delayed flowering in_ short-day 
photoperiods (Achard et al. 2004; Schwab et al. 
2005). miR156 is also involved in floral develop- 
ment and phase change by targeting members of 
squamosa promoter binding protein-like (SPL) 
plant-specific transcription factors. Overexpression 
of miR156 affects phase transition from vegetative 
growth to reproductive growth, including early ini- 
tiation of rosette leaves, a severe decrease in apical 
dominance, and a moderate delay in flowering 
(Schwab et al. 2005). 
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Cardamine flexuosa, an herbaceous biennial 
plant, requires an age-dependent vernalization. 
miR156 and miR172 have been shown to control 
the timing of sensitivity to vernalization by regu- 
lating the MADS-box gene CfSOC1 and hence 
regulate flowering. The transgenic plants with the 
highest miR172 level flowered without exposure 
to cold temperature (Zhou et al. 2013a). 


32.5.3 Organ Development 


Plant shoots and roots develop from the shoot 
apical meristems and the root apical meristems, 
respectively. cuc genes encode the transcription 
factors of the NAC family known to be involved 
in shoot apical meristem formation and cotyle- 
don separation during embryogenesis in 
Arabidopsis. Members (e.g., CUC1, CUC2, and 
NACI) of the NAC family are targets of the 
miR164 family (Laufs et al. 2004; Mallory et al. 
2004; Guo et al. 2005; Schwab et al. 2005). Plants 
overexpressing miR164 show a range of develop- 
mental abnormalities similar to nac gene family 
mutants. In contrast, loss-of-function miR164 
mutants accumulate higher levels of nac mRNA, 
resulting in more lateral root formation (Guo 
et al. 2005). Thus, in the Arabidopsis meristem, 
miR164 restricts the boundary expansion by 
degrading cucl and cuc2 mRNAs during organ 
initiation. 


32.6 Role of miRNA in Plant 
Pathogen/Pest Interaction 


Plants in their native environment interact with 
many microorganisms like virus, bacteria, fungi, 
insects, nematodes, etc. Interacting microorgan- 
isms may be beneficial or pathogenic to the 
plants. This brings about many changes in the 
expression patterns of genes involved in plant 
pathogen interaction. miRNAs have been shown 
to play an important role in regulating the genes 
during stress responses in plants. This section 
provides a brief review of miRNAs associated 
with plant pathogen and pest attack. 
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32.6.1 miRNAs During Viral Infection 


MicroRNAs responsive to plant infection by 
viruses were identified in many plant species, e.g., 
Brassica rapa (He et al. 2008), rice (Du et al. 
2011), Arabidopsis (Bazzini et al. 2009; Blevins 
et al. 2011; Hu et al. 2011), and tomato (Lang et al. 
2011). Turnip mosaic virus (TuMV)-encoded RNA 
silencing suppressor, P1/HC-Pro, was reported to 
interfere with miR171, which targets Scarecrow- 
like transcriptional factors and inhibits the miR171- 
guided target mRNA cleavage. Eight out of ten 
miR171 targets showed elevated accumulation in 
the presence of P1/HC-Pro, but the expression of 
miR171 was not affected due to the suppressor 
impacting steps downstream the maturation of 
miR171 (Kasschau et al. 2003). In Arabidopsis, 
miR156 and miR164 were induced by infection 
with the virus TuMV and were induced in trans- 
genic Arabidopsis plants expressing the viral 
silencing suppressor P1/HC-Pro (Kasschau et al. 
2003). Upon virus infection, new miRNAs 
(miR1I58 and miR1I885) were identified in 
Brassica; miR158 and miR1885 were specifically 
induced by TuMV but not by cucumber mosaic 
virus or tobacco mosaic virus (He et al. 2008). 


32.6.2 miRNAs During Bacterial 
Infection 


Differential miRNA expression pattern during 
bacterial infection was first observed in tumor 
induced by Agrobacterium tumefaciens infection, 
in which At-miR393 and At-miR167 were down- 
regulated. The predicted targets of these miRNAs 
are ARF8 and TIR1 proteins which are known to 
positively regulate auxin response in plants 
(Dunoyer et al. 2006). Concurrent report has 
showed that miRNA393 expression was increased 
in flg22-elicited (PAMP associated with 
Pseudomonas syringae) Arabidopsis seedlings. 
At-miR393 targets the auxin receptor TIR1 and 
AFB transcripts leading to repression of the 
auxin-signaling pathway. Further overexpression 
of At-miR393 resulted in phenotypes similar to 
auxin-signaling mutants and showed increased 
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resistance against bacterial infection (Navarro 
et al. 2006). Using direct sequencing approach, 
expression profile of small RNA of Arabidopsis 
leaves treated with P. syringae pv. tomato (Pst) 
identified three miRNAs (miR160, miR167, and 
miR393) that were highly induced and one that 
was downregulated (miR825) after infection. 
miR167 and miR160 target the transcripts encod- 
ing transcription factors involved in auxin- 
signaling pathways (Fahlgren et al. 2007). These 
reports indicate that miRNA-mediated downreg- 
ulation of auxin signaling is involved in plant 
immunity against bacterial infections. 

agol and dcll mutants are compromised in 
flg22-mediated bacterial resistance, providing 
their role in PTI response. RNA sequencing asso- 
ciated with AGOI shows accumulation of 
miR160, miR167, miR393, miR396, and miR824 
and reduction of miR398 and miR/773. 
Overexpression of miR160, miR398, and miR773 
shows that miR160 positively regulates while 
miR160 and miR398 negatively regulate PTI 
response (Li et al. 2010). 

Moreover, it has been also reported that 
miR393* accumulates during Pst infection and is 
highly enriched in AGO2 protein in Arabidopsis 
seedlings. The miR393*—-AGO2 complex targets 
a SNARE protein MEMB12 which regulates 
secretion and accumulation of PRI protein gov- 
erning the bacterial resistance to plants. Thus, 
miR393-—m1R393* pair is an amusing example 
that contributes to antibacterial resistance using 
different AGO proteins (AGOI/AGO2) and 
targeting different pathway genes (Zhang et al. 
201 1b). 

Global miRNA expression profiling using 
small RNA libraries of the Arabidopsis plant 
treated with the different strains of Pst shows the 
involvement of miRNA-mediated regulation of 
different plant signaling pathways. Among them, 
miR160, miR167, miR390, and miR393 regulate 
auxin signaling, and miR319 interfere in jasmo- 
nate pathway genes. Moreover, miRNA159 sup- 
presses abscisic acid signaling and activates 
salicylic acid signaling pathway genes via target- 
ing MYB33 and MYB/0/ transcripts (Zhang et al. 
201 1a). 
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32.6.3 miRNAs During Fungal 
Infection 


Various miRNA families were shown to differen- 
tially express in response to infection by the rust 
fungus Cronartium quercuum f. sp. fusiforme, 
which causes fusiform rust disease 1n pines; 10 of 
11 miRNA families were downregulated, includ- 
ing seven that were pine specific. For example, 
miR156 and miR160 were repressed in pine 
stems infected with the rust fungus (Lu et al. 
2007). Moreover, miRNAs may affect the 
response of wheat (T7riticum aestivum L.) to pow- 
dery mildew infection; some of the conserved 
miRNAs showed differential expression patterns 
in response to powdery mildew infection: 
miR156, miR159, miR164, miR171, and miR396 
were downregulated, and miR393, miR444, and 
miR827 were upregulated, respectively (Xin 
et al. 2010). 

Next-generation sequencing approach was 
used to identify the novel and conserved miRNA 
in soybean during different stress conditions. The 
study revealed the downregulation of miR166, 
miR169, miR397, and MIR-Seq13 genes in the 
susceptible genotype during infection with 
Phakopsora pachyrhizi (Kulcheski et al. 2011). 
Three different cultivars of soybean were ana- 
lyzed for the miRNA expression using microar- 
ray technology. Analysis showed that miR1507, 
miR1508, miR1510, miR159, miR319, miR396, 
and miR482 are negatively regulated, whereas 
families of miR156, miR166, and miR171 were 
positively regulated by Phytophthora sojae infec- 
tion (Guo et al. 2011). 

Tomato miRNAs regulated by Botrytis cine- 
rea have been identified through microarray- 
based analysis. miR160 and miR1I7la were 
downregulated and miR169 was upregulated dur- 
ing Botrytis infection (Jin et al. 2011). Microarray 
analysis of Populus trichocarpa plantlets infected 
with the poplar stem canker pathogen, 
Botryosphaeria dothidea, revealed upregulation 
of 12 miRNAs, namely, miR156, miR159, 
miR160, miR164, miR166, miR168, miR172, 
miR319, miR398, miR408, miR1448, and 
miR1450, but no downregulation of any miRNA 
was observed (Zhao et al. 2012). 
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Verticillium wilt, caused by the fungal patho- 
gen Verticillium dahliae, is a vascular disease 
which threats many economically important crops 
including cotton and eggplant. Small RNA librar- 
les were constructed by cotton root tissue inocu- 
lated with the fungal spores; on comparison with 
the control and other cultivar tissues, 215 miRNA 
families were identified including the 14 novel 
miRNA families. During the fungal infection, 
more than 65 miRNA families showed change in 
their expression among the libraries. Similarly, in 
eggplant during infection by V. dahlia, mock con- 
trol and infected seedling were used for deep 
sequencing of small RNA. Sequencing results 
showed the presence of 99 known miRNA fami- 
lies and two putative miRNAs in eggplant. 
However, six miRNAs changed their expression 
during the fungal infection in eggplant. During 
the Verticillium infection, the small RNA pull is 
shifted from 21-nt to 24-nt reads in comparing 
with mock to infected samples. The same phe- 
nomenon was observed in both cotton and egg- 
plant, representing the involvement of different 
small RNA silencing pathways during fungal 
infection (Yin et al. 2012; Yang et al. 2013b). A 
recent study on miRNA involvement in plant 
immunity was conducted in rice to reveal the role 
of miRNAs in plant response to the blast fungus 
Magnaporthe oryzae. A new rice miRNA, Osa- 
miR7695, was discovered and found to negatively 
regulate a natural resistance-associated macro- 
phage protein 6 (OsNramp6). Overexpression of 
Osa-miR7696 in transgenic rice plants conferred 
enhanced resistance to rice blast infection. The 
study also highlights a miRNA-mediated regula- 
tion of OsNramp6 in disease resistance while 
illustrating the existence of a novel regulatory net- 
work that integrates miRNA function and mRNA 
processing in plant immunity (Campo et al. 2013). 


32.6.4 miRNAs During Insect Attack 


Very few reports are available for miRNA expres- 
sion profiling after insect attack. Pandey et al. 
(2007) reported herbivory-induced change in 
small RNA transcriptome of plants. Myzus 
persicae (green peach aphid) produced signifi- 
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cantly less progeny on Arabidopsis miRNA 
pathway mutants, suggesting an important role of 
miRNAs during phloem-feeding insect, the green 
peach aphid (Myzus persicae), attack. Plants 
unable to process miRNAs respond to aphid 
infestation with increased induction’ of 
PHY TOALEXIN DEFICIENT3 (PAD3) and pro- 
duction of camalexin. Aphids ingest camalexin 
when feeding on Arabidopsis and were more suc- 
cessful on pad3 and cyp79b2/cyp79b3 mutants 
defective in camalexin production. Aphids pro- 
duced less progeny on artificial diets containing 
camalexin. miRNAs important for the growth and 
development of pests can be a potential target for 
raising resistant cultivars. This work also high- 
lights the extensive role of the miRNA-mediated 
regulation of secondary metabolic defense path- 
ways with relevance to resistance against a hemip- 
teran pest (Kettles et al. 2013). 


32.6.5 miRNAs During Nematode 
Infestation 


miRNAs were also suggested to be involved in 
plant—nematode interactions. miR161, miR164, 
miR167a, miRI72c, miR396c, miR396a, 
miR396b, and miR398a were downregulated in 
Arabidopsis in response to infection by the nema- 
tode Heterodera schachtii (Hewezi et al. 2008; 
Khraiwesh et al. 2012). Comparative analysis of 
miRNA profiling in soybean indicated that 101 
miRNAs belonging to 40 families were respon- 
sive to the infection of the soybean cyst nematode 
(SCN; Heterodera glycines), the most devastat- 
ing pathogen in soybean. Twenty miRNAs were 
differentially expressed between SCN-resistant 
and SCN-susceptible soybean cultivars (Li et al. 
2012). 


32.7. Role of miRNAs During 
Abiotic Stress Responses 


Abiotic stresses are the major constraint for the 
agricultural productivity. Also, to combat the 
oversized population and exploitation of fertile 
land, we need to use the unusable land for farming 
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in geographically different areas. Many abiotic 
stresses cause decrease in plant productivity 
among which drought, salinity, cold, and heat 
stress are common. miRNAs are involved in abi- 
otic stress responses in plants and are well stud- 
ied. The following part provides some examples 
of miRNAs and their role during abiotic stresses. 


32.7.1 Drought and Salinity Stress 


Drought and salinity are two major constraints to 
agricultural productivity worldwide. In addition 
to protein-coding genes, the expression of miR- 
NAs in plants is altered during stress conditions 
(Sunkar and Zhu 2004; Zhao et al. 2007; Trindade 
et al. 2010; Kulcheski et al. 2011; Sunkar et al. 
2012). The miRNAs that have been identified as 
stress responsive in diverse plant species are too 
numerous. We have listed below the recently 
identified miRNAs which have been reported to 
be involved in stress response regulation. 

Nuclear factor Y (NF-Y) is a heterotrimeric 
transcription factor composed of NF-YA, NF-YB, 
and NF-YC proteins. GmNFYA3 encodes the 
NF-YA subunit of the NF-Y complex in soybeans 
(Glycine max L.). Coexpression in Nicotiana ben- 
thamiana and 5’ RACE assays indicated that 
miR169 directs GmNFYA3 mRNA cleavage 
in vivo. miR169 was demonstrated to be a posi- 
tive modulator of drought stress tolerance in 
transgenic Arabidopsis plants (Ni et al. 2013). In 
tomato, miR169 was induced by drought stress, 
and constitutive overexpression of tomato 
miR169c led to reduced stomatal openings and 
transpiration rate and leaf water loss, thus enhanc- 
ing drought tolerance in transgenic plants com- 
pared with wild-type controls. The transgenic 
tomato plants did not show any morphological 
and developmental variations under field condi- 
tions, demonstrating that miR169 or its targets 
could be potential candidate genes for genetic 
engineering to achieve enhanced abiotic stress 
tolerance in plants (Zhang et al. 20I1Ic). 
Contrasting roles of miR169, 1.e., positive regula- 
tor in tomato and negative regulator in Arabidopsis, 
suggest that the function of conserved miRNAs 
may vary from species to species. 


652 


miR319 is one of the first characterized and 
conserved miRNA families in plants and has 
been demonstrated to target TCP (for TEOSINTE 
BRANCHED/CY CLOIDEA/PROLIFERATING 
CELL FACTORS [PCF]) gene-encoding plant- 
specific transcription factors. The role of miR319 
during abiotic stress was demonstrated using 
transgenic creeping bent grass (Agrostis stolon- 
ifera) overexpressing a rice (Oryza sativa) 
miR319 gene, Osa-miR319a. Transgenic plants 
overexpressing Osa-miR319a displayed morpho- 
logical changes and exhibited enhanced drought 
and salt tolerance. Gene expression analysis indi- 
cated that at least four putative miR319 target 
genes, AsPCF5, AsPCF6, AsPCF8, and AsTCP14, 
and a homologue of the rice NAC domain gene 
AsNAC60 were downregulated in transgenic 
plants. The results clearly demonstrate that 
miR319 controls plant responses to drought and 
salinity stress. The enhanced abiotic stress toler- 
ance in transgenic plants can be related to signifi- 
cant downregulation of miR319 target genes, 
implying their potential for use in the develop- 
ment of novel molecular strategies to genetically 
engineer crop species for enhanced resistance to 
environmental stress (Zhou et al. 2013b). 


32.7.2 Cold and Heat Stress 


Temperature fluctuations over the course of each 
day and during changes in the seasons are com- 
mon, and plants have to reprogram their gene 
expression to adjust to such dramatic shifts in 
temperature. miRNAs have been identified in 
plants recently in response to temperature stress. 
Cold-responsive gene regulatory networks are 
regulated by a number of miRNAs. Comparative 
profiling of miRNA expression among different 
species [Arabidopsis, Brachypodium, and poplar 
(Populus trichocarpa)| during cold _ stress 
revealed similarities as well as differences in 
miRNA regulation. miR397 and miR169 were 
upregulated in all the three, while miR172 upreg- 
ulation was observed in Arabidopsis and 
Brachypodium but not in poplar (Zhang et al. 
2009). Opposing patterns of miRNA expression 
during cold stress have also been reported in 
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other plant species. During cold stress, miR168 
levels were upregulated in poplar and Arabidopsis 
(Lu et al. 2008b; Liu et al. 2008) but downregu- 
lated in rice (Lv et al. 2010). Similarly, miR171 
was downregulated in rice (Lv et al. 2010) but 
upregulated in Arabidopsis (Liu et al. 2008) in 
response to cold stress. Some of these conflicting 
reports could be genuine responses that may be 
attributed to the differential responses of unre- 
lated plant species. 

The miR319 family is one of the conserved 
miRNA families among diverse plant species 
which is reported to regulate plant development 
in dicotyledons, but little is known at present 
about its functions in monocotyledons. In rice, 
the miR3/9 gene family comprises two mem- 
bers, Osa-miR319a and  Osa-miR319b. 
Overexpression of Osa-miR319a and Osa- 
miR319b in rice both resulted in wider leaf 
blades. Leaves of Osa-miR319 overexpression 
transgenic plants showed an increased number of 
longitudinal small veins, which probably 
accounted for the increased leaf blade width. In 
addition, overexpressing Osa-miR319 led to 
enhanced cold tolerance (4 °C) after chilling 
acclimation (12 °C) in transgenic rice seedlings. 
Notably, under both 4 and 12 °C low tempera- 
tures, Osa-miR319a and Osa-miR319b were 
downregulated, while the expression of miR319- 
targeted genes was induced. Furthermore, down- 
regulating the expression of OsPCF5 and 
OsPCFS (target genes of miR319) in RNA inter- 
ference (RNAjt) plants also resulted in enhanced 
cold tolerance after chilling acclimation. The 
study demonstrated that miR319 plays important 
roles in leaf morphogenesis and cold tolerance in 
rice (Yang et al. 2013a). 

Powdery mildew infection and high tempera- 
ture are important limiting factors for yield and 
grain quality in wheat production. Long non- 
protein-coding RNAs (npcRNAs) are 
developmentally regulated and play roles in 
development and stress responses of plants. 
Several putative wheat stress-responsive long 
npcRNAs (approx. 125) were computationally 
identified which were not conserved among plant 
species. Among them, some were precursors of 
small RNAs such as miRNAs and siRNAs, two 
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long npcRNAs were identified as signal recogni- 
tion particle (SRP) 7S RNA variants, and three 
were characterized as U3 snoRNAs. Wheat long 
npcRNAs showed tissue-dependent expression 
patterns and were responsive to powdery mildew 
infection and heat stress (Xin et al. 2011). 

Heat stress rapidly induces miR398 and 
reduces transcripts of its target genes CSD1, 
CSD2 (copper/zinc superoxide dismutase), and 
CCS (a gene encoding a copper chaperone for 
both CSDI1 and CSD2). Transgenic plants 
expressing miR398-resistant forms of CSD1, 
CSD2, and CCS under the control of their native 
promoters were more sensitive to heat stress (as 
indicated by increased damage at the whole-plant 
level and to flowers) than transgenic plants 
expressing normal coding sequences of CSD1, 
CSD2, or CCS under the control of their native 
promoters. In contrast, csd/, csd2, and ccs mutant 
plants were more heat tolerant (as indicated by 
less damage to flowers) than the wild type. 
Expression of genes encoding heat stress tran- 
scription factors (HSF genes) and heat shock pro- 
teins (HSP genes) was reduced in heat-sensitive 
transgenic plants expressing miR398-resistant 
forms of CSD1, CSD2, or CCS but is enhanced in 
the heat-tolerant csd/, csd2, and ccs plants. The 
study indicated that miR398 triggers a regulatory 
loop that is critical for thermotolerance in 
Arabidopsis (Guan et al. 2013). 


32.8 miRNA Interference: 
Applications in Agriculture 


Food security is a major problem across the globe 
to fulfill the needs of a growing population. The 
limitation of fertile land and use of harmful 
chemicals for agriculture practices also pose a 
major constraint to the environment. To over- 
come these problems, resistant and high-yielding 
cultivars are in demand. The transgenic crops 
using novel and advanced technologies ensure us 
to give high-yielding crop and utilization of 
unfertile land with minimum _ environmental 
harms. miRNA interference (miRNA1) is a recent 
and novel approach (Wang 2009) which can be 
used to overcome agricultural harms by manipu- 
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lating expression of miRNA genes or the endog- 
enous genes (Rajam 2012). 


32.8.1 Deregulation of miRNAs 
(Overexpression 
and Downregulation) 


miRNAs can be studied by changing their expres- 
sion levels either by overexpression or downreg- 
ulation of miRNA genes which ultimately 
regulate target transcript levels and result in par- 
ticular phenotype. miRNA genes are present in 
genome as independent transcriptional units hav- 
ing upstream promoter and cis-acting elements. 
Overexpression studies can be done by using 
binary vectors or any plant transformation vector 
having the functional miRNA gene with upstream 
native or constitutive promoter. Downregulation 
of miRNA can be possible either by expressing 
antisense miRNA (antimiR) or by overexpressing 
the miRNA-resistant target transcript in plants. 
antimiR functions by simply binding with mature 
miRNAs, whereas miRNA-resistant target tran- 
script does not cleave because of nucleotide sub- 
stitution at miRNA-binding site (Rajam 2012; 
Guo et al. 2005). 

Most of the miRNA overexpression studies 
were carried out in A. thaliana as it is a model 
plant system for molecular studies. Various kinds 
of overexpression studies were carried out to 
functionally characterize miRNA genes. For 
instance report, inducible expression of 
miR164 in wild-type plants led to reduction in 
NACI mRNA levels and reduced lateral root 
emergence, while miR164 mutant showed low 
level of miR164 and increased NACI transcript 
resulting in more lateral roots (Guo et al. 2005). 
Similarly, stomatal density on epidermis in 
Arabidopsis is regulated by Agamous-likel6 
(AGL16) gene, a member of the MADS-box pro- 
tein family, which is in turn regulated by 
miRNA824. Overexpression of miR160 resulted 
in agravitropic roots and increased the number of 
lateral roots in Arabidopsis by manipulating 
auxin-signaling genes (Wang et al. 2005). In 
another study, overexpression of miR824-resis- 
tant AGLI6 shows increased incidence of sto- 
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mata, while overexpression of miR824 resulted 
in decreased incidence of stomata in higher-order 
complexes (Kutter et al. 2007). At-miR159 is 
involved in the regulation of short-day photope- 
riod flowering time and anther development. 
Overexpression of At-miR159 resulted in delayed 
flowering in short-day photoperiods and enhanced 
male sterility (Achard et al. 2004). 

Rice 1s a major food crop worldwide and can 
be a good model crop to study novel molecular 
practices for crop improvement. Overexpression 
of OsamiR397 resulted in enlarged grain size and 
promoted panicle branching. It has been shown 
that OsamiR397 regulates the brassinosteroid 
signaling which resulted in increased grain yield 
(Zhang et al. 2013). In another report, overex- 
pression of miR319a and miR319b resulted in 
wider leaf blades and enhanced cold tolerance in 
rice seedlings (Yang et al. 2013a). 

Sly-miR169 accumulates during drought stress 
while their targets’ nuclear factor Y subunit genes 
(SINF-YA1/2/3) and one multidrug resistance- 
associated protein gene (SIMRPI1) levels were 
reduced significantly in tomato. Overexpression 
of Sly-miR169 in tomato plants resulted in 
reduced stomatal opening, decreased transpira- 
tion rate, lowered leaf water loss, and enhanced 
drought tolerance (Zhang et al. 201 1c). 

These reports on different plant systems 
showed that miRNA overexpression may lead to 
some agriculturally important phenotypes that 
can be useful to develop high-yielding crops. 
Similarly, the miRNA genes of the target patho- 
gens or pests essential for their growth and devel- 
opment can be suppressed by plant expression of 
an antisense RNA specific to their miRNA 
sequences. Downregulation or inhibition of 
miRNA gene expression can also provide a novel 
tool for agricultural application but has not been 
explored in plants (Rajam 2012). 


32.8.2 Artificial MicroRNAs 
(amiRNAs) 


The artificial microRNA (amiRNA) technology 
exploits endogenous miRNA precursors to gener- 
ate small RNAs (sRNAs) that direct gene 
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silencing in either plants or animals (Alvarez 
et al. 2006; Niu et al. 2006; Schwab et al. 2006). 
miRNA precursors preferentially produce the 
miRNA-—miRNA* duplex. When both sequences 
are altered without changing structural features 
such as mismatches or bulges, it will lead to the 
accumulation of miRNA of desired sequence. 
amiRNAs were first generated and used in human 
cell lines (Zeng et al. 2002) and later in 
Arabidopsis (Parizotto et al. 2004), where they 
were shown to effectively interfere with reporter 
gene expression. Later on, it was demonstrated 
that not only reporter genes but also endogenous 
genes can be targeted with amiRNAs and also 
seem to work with similar efficiency in other 
plant species (Alvarez et al. 2006; Schwab et al. 
2006). 

amiRNA sequences are designed according to 
the determinants of plant miRNAs so that the 
selected 21 mer specifically silences its intended 
target genes. There are three major criteria which 
have to be taken care while designing the amiR- 
NAs. Firstly, they should start with a U (found in 
most plant and animal miRNAs); secondly, they 
should display 5’ instability relative to their 
amiRNA* (the nucleotides pairing to the miRNA 
in the precursor); and thirdly, their tenth nucleo- 
tide should be either an A (Reynolds et al. 2004) 
for artificial siRNAs and overrepresented in plant 
miRNAs (Mallory et al. 2004) or a U. 

So far, most of the amiRNAs in plants are 
based on the natural precursor structures of Ath- 
miR1IS59a, Ath-miR164b, Ath-miR172a, Ath- 
miR319a, and Osa-miR528. Liu et al. (2010) 
developed a simple amiRNA vector (pAmiR 169d) 
based on the structure of Arabidopsis miR169d 
precursor (pre-miR169d). amiRNAs are effective 
when expressed from either constitutive or tissue- 
specific promoters. 

amiRNAs have shown its potential in the 
development of virus-resistant plants. Cotton leaf 
curl disease, caused by single-stranded DNA 
viruses of the genus Begomovirus (family 
Geminiviridae), 18 a major constraint to cotton 
cultivation. Two amiRNA constructs, (P1C) hav- 
ing the miR169a sequence with the exception of 
the replaced 21 nt of the V2 gene sequence of 
Cotton leaf curl Burewala virus (CLCuBuV) and 
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(PID) in which the sequence of the miRNA169a 
backbone was altered to restore some of the 
hydrogen bonding of the mature miRNA duplex, 
were used to transform the Nicotiana benthami- 
ana. PIC transgenic plants showed good resis- 
tance against CLCuBuV, whereas PID plants 
showed overall poorer resistance to challenge 
with all viruses tested (Ali et al. 2013). 

amiRNAs also offer a promising strategy to 
control the multiplication and spread of gemini- 
viruses in host plants. amiRNAs target the middle 
region of the AVI (coat protein) transcript 
(amiR-AV 1-3) and the overlapping region of the 
AV1 and AV2 (pre-coat protein) transcripts 
(amiR-AV 1-1). Our analyses demonstrate that 
transgenic tomato plants expressing amiR-AV1- 
1, targeting the coat protein of geminivirus, 
Tomato leaf curl virus (ToLCV), were highly tol- 
erant to Jomato leaf curl New Delhi virus 
(ToLCNDV) (Vu et al. 2013). 

The results show that the amiRNA approach 
can deliver efficient resistance in plants against 
viral infection and that this has the potential to 
provide broad-spectrum protection from a num- 
ber of viruses. 


32.8.3 miRNA-Induced Gene 
Silencing (MIGS) 


Plants have mainly two kinds of small RNA, siR- 
NAs and miRNAs; besides these, there is another 
class of siRNA which is generated by the path- 
way other than the previously known siRNA for- 
mation mechanism. The special class of siRNA is 
known as tasiRNAs, as they are generated by the 
miRNA-directed cleavage of TAS transcripts in 
Arabidopsis plant. The generation of tasiRNA 
required component of microRNA (miRNA) 
pathway (AGO1, DCL1, HEN1, and HYL1) and 
component of cosuppression pathway (RDR6 
and SGS3) (Vazquez et al. 2004; Chapman and 
Carrington 2007). 

Arabidopsis has two genes TASI and TAS2 
which are the target of miR173 and produced 21 
nt which targets the genes having partial similar- 
ity with TAS transcript. tasiRNAs regulate gene 
expression by the activity of SGS3, RDR6, and 
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DCL4 (Vazquez et al. 2004; Yoshikawa et al. 
2005). In similar fashion, miRNA390 cleaves 
TAS3 transcript and produces tasiRNAs which 
have their target site in auxin-responsive factors. 
miRNA390 generated tasiRNAs which repressed 
the expression of the ARF2, ARF3, and ARF4, 
thus releasing repression of lateral root growth 
(Marin et al. 2010). 

miRNA-directed cleavage of the transcript 
having TAS site upstream to the gene of interest 
can be exploited to target the endogenous gene 
for the functional study and for crop improve- 
ment; the strategy is now popularly known as 
miRNA-induced gene _ silencing (MIGS). 
Felippes and Weigel (2009) reported that a frag- 
ment of CHLORATA42 (CH42) linked to a 
miRNA173 site lead to silencing of the endoge- 
nous CH42 gene. It shows that only the miRNA 
target site of JAS transcript is sufficient for the 
production of tasiRNAs and silencing of the 
endogenous transcripts. This hypothesis was fur- 
ther supported by generating the chimeric con- 
structs of genes AGAMOUS (AG), EARLY 
FLOWERING 3 (ELF3), FLOWERING LOCUS 
(FT), and LEAFY (LFY) with upstream 
miRNA173 target site (22 nt). The Arabidopsis 
plant expressing chimeric RNA of AG, ELF3, 
FT, and LFY shows the silencing of the endoge- 
nous copy of genes and shows phenotype similar 
to loss-of-function mutants. The miRNA173- 
directed cleavage of the chimeric mRNA pro- 
duces specific tasiRNAs that silence the 
endogenous gene (Felippes et al. 2012). 

MIGS has many advantages over the hpDRNA 
and amiRNA silencing strategy. The first advan- 
tage of MIGS is silencing construct preparation 
is very easy by only placing the miRNA173 tar- 
get site (22 nt) on upstream of the partial or full 
region of desired gene. The chimeric construct 
produces specific tasiRNA and_ effectively 
silences the gene of interest. The second advan- 
tage is that MIGS provides an easy way to silence 
more than one diverse gene by simple prepara- 
tion of a chimeric gene placed behind the miRNA 
target site. However, miR173 is only reported in 
the Arabidopsis plant; it limits the use of miR173- 
based MIGS in other crop plants. This problem 
can be overcome by overexpressing the 
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Ath-miR1I73 and then expressing chimeric tran- 
script for silencing of desired genes in other sys- 
tems. Ath-miR173 is not well conserved and its 
overexpression may not harm other plant 
systems. 


32.9 Conclusions and Future 
Perspectives 


MicroRNAs are a class of small noncoding RNAs 
implicated in a wide range of biological pro- 
cesses by means of regulation of gene expres- 
sion. The majority of plant miRNA targets 
comprise the transcription factors and other regu- 
latory proteins, indicating that miRNAs are major 
regulators of gene expression. miRNAs provide a 
promising tool for functional genomics to better 
understand the basic mechanisms underlying 
various plant developmental events. Mutants of 
miRNA biogenesis genes have shown severe 
pleiotropic abnormalities. Overexpression of 
miRNAs and miRNA-resistant versions of par- 
ticular miRNA targets shows a range of unusual 
phenotypes (Jones-Rhoades et al. 2006). 

miRNAs are not only reported as crucial regu- 
lator of developmental processes, but they are 
also involved in plant abiotic and biotic stress 
responses. Many miRNA families are known to 
play important roles during abiotic stress such as 
cold, heat, drought, salt, etc. (Sunkar et al. 2012), 
and also during biotic stress like viral, bacterial, 
fungal, insect, and nematode attack (Balmer and 
Mauch-Mani 2013). miRNAs regulating stress- 
responsive genes can help to understand the way 
plants work under extreme environmental condi- 
tions and may be used for fine tuning the gene 
expression to make plants more stress tolerant. 
Over the years, miRNAs have emerged as an 
important tool for crop improvement. Indeed, 
work has already been successfully undertaken in 
this direction in plants like Arabidopsis and 
tomato. This may help to reduce food insecurity 
by breeding improved crop cultivars with 
enhanced agronomic traits. 

Although miRNAs are potential tools for crop 
improvement, undesirable phenotypic changes in 
transgenic plants can be problematic. Thus, it is 
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mandatory to study the underlying mechanisms 
of miRNA-mediated regulation in plants for 
proper execution of transgenic strategies so as to 
obtain desired traits with minimal off-target 
effects. 

Hence, we can conclude that agricultural bio- 
technology along with miRNA-based techniques 
would serve as an important measure for improve- 
ment of crop plants. 
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Abstract 

The most important component for subsistence of both biotic and abiotic 
components on earth is the environment. The survival and well-being of 
both these components depend upon a healthy physical environment. The 
solution for today’s deteriorating environment is not only removal of 
the contaminants but also recycling and reuse of hazardous substances by 
converting waste to wealth in an aesthetic and eco-friendly manner. At 
present, an emerging way for the management of wastes is the surfacing 
of environmental biotechnology. In simple words, environmental biotech- 
nology can be defined as “the integration of natural sciences and engineer- 
ing in order to achieve the application of organisms, cells, parts thereof 
and molecular analogues for the protection and restoration of the quality 
of the environment’. It can be used as a tool to maintain healthy equilib- 
rium between environment and its living and nonliving components. This 
chapter deals with different domains of environmental biotechnology 
related to decontamination of polluted environment that serve as sustain- 
able solutions for many associated environmental problems. 


Keywords 
Contamination ¢ Environmental biotechnology ¢ Restoration ¢ Sustainable 
solutions 


33.1 Introduction 


There has been a tremendous increase in the 
amount of stress created on environment due to the 
en er ee eee (2) unstoppable and unavoidable increase in industri- 
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magnification, emission of carbon dioxide leading 
to acidification of oceans, global warming as a 
result of release of greenhouse gases, introduction 
of foreign plant species (invasive species) leading 
to disturbed biodiversity, etc. Production of smog 
due to ever increasing air pollution has reduced the 
amount of sunlight reaching earth affecting the 
photosynthesis process in plants. The environment 
has gone through undesirable changes due to 
man-made activities like agricultural, industrial, 
constructional, exploration and recreational activi- 
ties. Industrial activities comprise of oil refineries 
that are the biggest pollution generators in present 
era which process crude oil to obtain valuable 
products like LPG, kerosene, petrol, jet fuel, diesel 
oil, etc., resulting in generation of hazardous waste 
as an end product called oil sludge. This oil sludge 
iS a major waste generated by oil refineries. Also, 
contamination occurs during its transport, storage 
and disposal. This sludge contains persistent com- 
pounds like PAH and aliphatic hydrocarbons 
that remain in the environment over a period of 
time and cause harmful effects on humans. They 
are known to impose carcinogenic, mutagenic and 
teratogenic properties on human health (Bamforth 
and Singleton 2005). Oil spills also have an 
adverse effect on plants and animals as well. 
Lungs, liver, kidney and intestines of aquatic birds 
and animals may get damaged due to exposure to 
crude oil. Furthermore, plants covered with oil are 
unable to photosynthesise (Mandal et al. 2011). 

Mining activities especially opencast mining 
significantly devastate the land ecosystem, in 
which the land gets degraded and contaminated 
which subsequently results in pollution of nearby 
waterbodies, streams and groundwater leading to 
frightening environmental problems. Opencast 
mining activities involve the removal of large 
quantities of overburden, dumping and backfill- 
ing of the excavated area (Juwarkar and 
Jambhulkar 2008). Metalliferous mining and 
processing usually produce severe pollution of 
heavy metals. These processes leave the land 
infertile or barren which is more prone to erosion 
and leaching which further distributes pollutants 
in the nearby environment and, thus, disturbs the 
surrounding ecological balance. 
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The agricultural activities encompass the use 
of pesticides and chemical fertilisers for pest 
control and increased crop production which has 
led to land contamination rendering the soil to be 
infertile. The run-off of excess pesticides in local 
waterbodies has manifested several health prob- 
lems such as headache, body weakness, blurred 
vision, vomiting, irritability, impaired concentra- 
tion and abdominal pain. Other severe effects 
caused due to pesticides include suppressed 
immune system, asthma, reduced sperm concen- 
tration and vigour, liver diseases and nerve 
damage. The pesticide methyl bromide has been 
identified to have ozone-depleting properties 
(Ibitayo 2006). 

Furthermore, one of the greatest challenges 
for the scientific community for the past few 
decades has been the environmental contamina- 
tion caused due to wastewater. The wastewater 
generated by industries or during domestic and 
agricultural purposes has been reported to con- 
tain many harmful and toxic compounds which, 
upon unscientific discharge/disposal, leads to the 
contamination of nearby environment. 

Owing to these detrimental effects of different 
contaminants, it is necessary to sustain the envi- 
ronment. Environmental biotechnology can serve 
as a solution to this nuisance. Environmental 
biotechnology is defined as “the integration of 
natural sciences and engineering in order to 
achieve the application of organisms, cells, 
parts thereof and molecular analogues for the 
protection and restoration of the quality of our 
environment” (Ezeonu et al. 2012) . This chapter 
concentrates on different technologies developed 
till date to sustain a healthy environment by 
remediating and reclaiming polluted environ- 
ment by the use of plants and microorganisms 
as acting agents. 


33.2 Acting Domains 
of Environmental 
Biotechnology 


Figure 33.1 below depicts the domains, followed 
by a detailed description. 
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Fig.33.1 Acting domains of environmental biotechnology 


33.2.1 Restoration of Environmental 
Quality 


The quality of environment is deteriorating due 
to attack from different hazardous chemicals and 
their components. The land resources were 
thought to be abundant in early times; however, 
our carelessness and negligence in using them 
have resulted in rendering these lands as barren 
and degraded. These lands are an effect of 
increased industrial activities in past few decades 
and disposal of hazardous substances. In response 
to the adverse health effects on humans as well as 
other living organisms created by contaminated 
sites, worldwide efforts are being undertaken 
by the scientific and regulatory community. 
The physico-chemical methods are not feasible 
in giving root solution to the problem of land 
contamination as it transfers the contaminant to 
other places and leads to formation of toxic 
breakdown products. Keeping in mind the need 
of present era, productive and clean land, an eco- 
friendly and sustainable technology needs to be 
introduced. Bioremediation technology can be 
an answer to the questions regarding the contam- 
inated land-related questions. Bioremediation 
can be defined as “the use of living organisms to 
degrade the environmental contaminants into less 
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toxic forms” (Vidali 2001). The living organisms 
are generally plants and microorganisms. By 
using bioremediation technology, the contami- 
nated land can be redeveloped for use in an 
economic and environment-favouring approach. 
Bioremediation can be divided into different 
branches such as biostimulation, bioaugmenta- 
tion, biodegradation, phytoremediation, biostabi- 
lisation, etc. Bioremediation can be in situ or ex 
situ. Ex situ remediation involves excavation of 
contaminated soil and treating it away from 
the site of contamination. In situ techniques 
are mostly favoured as it reduces the chances of 
further contamination during transport and also 
cuts the cost of carrying the contaminant to a 
desired place. In situ techniques generally reme- 
diate the contaminant in the same place either by 
addition of microorganisms (bioaugmentation) or 
by adding nutrients (biostimulation). Apart from 
these two techniques, plants can also be used to 
further reduce the contamination resulting in 
complete remediation of the site which is called 
phytoremediation (Juwarkar et al. 2010). 
Bioremediation processes generally make use of 
microorganism that uses the organic contaminates 
as their carbon source. Microorganisms during 
their co-metabolism process break down the 
contaminant to form carbon dioxide and water 
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resulting in its complete mineralisation. However, 
sometimes complete mineralisation may not 
occur, but the product formed may be either non- 
toxic or less toxic as compared to the parent 
contaminant (Suthersan 2010). These reactions 
are generally oxidation reactions. Microbes 
synthesise certain enzymes that catalyse these 
reactions during which the carbon double bond 
of organic compound is broken and the adjacent 
hydrogen atom is transferred to hydrogen recep- 
tor (Nanekar et al. 2015). Similarly, plants are 
also known to degrade and hold on to toxic com- 
ponents and help in remediation. Plants also 
secrete certain nutrient-rich exudates which con- 
tain enzymes, amino acids, organic acids, sugars 
and phenolics (Bias et al. 2006). These act as food 
for the organisms that thrive in the rhizosphere, 
fastening the degradation process. Also, they have 
the potential to accumulate metals and other 
organic compounds in the tissues and roots. Many 
contaminants with industrial and mining origin 
have been remediated using bioremediation tech- 
nology. Juwarkar and Singh (2010) have success- 
fully remediated zinc mine spoil dumps using this 
integrated biotechnological approach that has 
restored the productivity and fertility of the spoiled 
land. Also, organisms are being modified geneti- 
cally to increase their degradation efficiency 
towards different contaminants. The bottlenecking 
in remediation of persistent contaminants can be 
circumvented by combination of different meta- 
bolic activities of different organisms in single 
organism or by enhancing the expression of gene 
responsible for degradation. Table 33.1 represents 
different bacteria and plants used for remediation 
purposes of different contaminants. 
Biotechnological tools are already being used 
for wastewater treatment because these treat- 
ments can manage a broad range of effluents 
effectively as compared to chemical or physical 
methods. It is usually suitable for the treatment of 
wastewater having common organic pollutants. 
The use of microorganisms for wastewater 
treatment mainly depends on the source and 
characteristics of wastewater. Three types of 
wastewater usually categorised on the basis of 
their source are domestic/municipal wastewater, 
industrial/commercial wastewater and agricultural 
wastewater. The components in the effluent may 
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be of physical, chemical or biological nature 
which can pose certain hazardous impacts on the 
environment which include changes in habitat 
of nearby vicinity especially aquatic habitat, 
changes in biodiversity and water quality, etc. 
Owing to very diverse parameters, it 1s very 
difficult to form the most effective microbial 
consortia and treatment strategy for wastewaters. 
Usually, these wastewaters require pretreatment 
as many of the components in wastewater are 
toxic to the microorganisms. The unstable organic 
matter of wastewater is converted into stale prod- 
ucts such as CO, CO,, CHy, NH3, etc. Molecular 
techniques can be used for biotreatment of waste- 
water and are a quite new field. The major appli- 
cation of these techniques may include enhanced 
degradation of xenobiotic contaminants and 
detection of pathogens and parasites using 
nucleic acid probes. The process of wastewater 
treatment process can be enhanced by selection 
of novel organisms so as to perform a certain 
action (Gavrilescu 2010). 


33.2.2 Recovery and Waste 
Utilisation 


Environmental biotechnology is creating new revo- 
lution by introducing genetically engineered organ- 
isms to extract their potential in production of 
valuable products from wastes. Also, waste can be 
utilised for synthesis of products like biosurfactants 
for use in remediation. Dubey and Juwarkar (2001) 
have studied biosurfactant production using distill- 
ery waste and whey waste by Pseudomonas aeru- 
ginosa species isolated from oil sludge. According 
to their observations, the culture produced 0.91 and 
0.92 g/l biosurfactant using distillery and whey 
wastes for its growth, respectively. 

Crop waste can be used as raw material for 
ethanol production using genetically modified 
organisms that form sugars and ultimately etha- 
nol by synthesising increased amounts of cellu- 
losic enzymes. This bioethanol may reduce the 
use of fossil fuels. Also, as it is derived from 
wastes, bioethanol can be used for carbon dioxide 
recycling in a closed loop, i.e. from the atmo- 
sphere to plant and from plant back to ethanol 
(Kim and Dale 2004). 
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Table 33.1 Genetically modified plants/bacteria and their role 


Naphthalene degradation 
Chlorinated biphenyl! degradation 


Petroleum degradation 
Bioaccumulation of mercury 
Remediation of arsenic-contaminated 


In situ bioremediation of 
Hg-contaminated sites 


Cd (naturally polluted soils) 
Remediation of Ni 
Remediation of Cr 


Enhanced provitamin A content 
Stress tolerance 


Chloroacetanilide herbicide 
phytoremediation 


Remediation of phenol 
Remediation of pentachlorophenol 
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Reference 


Sayler and Ripp (2000) 
Boldt et al. (2004) 


Jussila et al. (2007) 
Deng and Wilson (2001) 
Kostal et al. (2004) 


Ng et al. (2009) 


Ivask et al. (2011) 
Lopez et al. (2002) 
Ackerley et al. (2004) 


Aluru et al. (2008) 
James (2006) 
Macek et al. (2007) 


Oller et al. (2005) 
Limura et al. (2002) 


Remediation of bisphenol A and 


Sonoki et al. (2005) 


pentachlorophenol 


Sr.no. Species (plant/microorganism) Role 
Genetically modified bacteria 
1 Pseudomonas fluorescens HK44 
Z Pseudomonas fluorescens 
F113rifpcbrrnBP1::gfpmut3 
3 Pseudomonas putida PaW85 
4 Escherichia coli 
5 Escherichia coli strain 
water 
6 Achromobacter sp. AO22 
7 Bacillus subtilis BRIS I (pTOO24) 
8 Pseudomonas fluorescens 4F39 
9 Pseudomonas putida 
Transgenic plants 
10 Transgenic maize 
11 Transgenic maize 
12 GM tobacco 
13 Lycopersicon esculentum 
14 Nicotiana tabacum 
15 N. tabacum 
16 N. tabacum and Arabidopsis thaliana 


Many bacteria and fungi are efficient in solu- 
bilising metals that can be extracted from metal-rich 
soils under appropriate environmental conditions 
(Krebs et al. 1997). The principles behind the 
ability of microbes to leach and mobilise metal 
are redox reactions, formation of organic or inor- 
ganic acids and excretion of complexing agents. 
Metals are extracted by the principle of either 
electron transfer from metal to the microorgan- 
ism or on bacterial oxidation of metal ion (Torma 
1991). Abidin et al. (2011) in their studies con- 
cluded that Durio zibethinus husk 1s an efficient 
adsorbent for gold (III) and can be used as a low- 
cost alternative to adsorb gold (III) in wastewater 
treatment. 


33.2.3 Substitution of Non- 
renewable Base to Renewable 


The endless need of fossil fuels as energy 
resources has created two problems, 1.e. scarcity 
of available resources and pollution created by its 
burning. Also as it is a non-renewable source of 


Tolerance to herbicide 


Didierjean et al. (2002) 


energy, it will end at some time or other. Hence, 
there is a need to find a substitute for these 
energy-rich fuels. Biogas can serve as a substitute 
for conventional fuels which is also considered as 
economic over LPG. Ananthakrishnan and his 
co-workers (2013) have installed a kitchen waste 
biodigester and have found it economical and 
technically viable over LPG. 

The importance of biologically produced fuels 
has increased as a result of dramatic variations in 
oil prices and global apprehension for climatic 
changes. Biofuels are being encouraged as the 
most capable substitutes by deducing the depen- 
dence on fossils fuels, supporting local economy 
and reducing CO2 emissions. Production of bio- 
fuels from renewable resources can be seen as a 
desirable means for meeting the needs of aviation 
and other transport facilities. Immense interest is 
concentrated on biofuels as they are produced 
from photosynthetic plants which store lipids in 
the form of triacylglycerols (TAGs). These TAGs 
can be used for synthesis of biodiesel through a 
transesterification process (Smith et al. 2009). An 
igniting interest 1s developing in algaculture for 
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making biodiesel, biogasoline, biomethanol and 
many other biofuels owing to the energy and food 
crisis worldwide making use of land that is either 
not under agricultural use or is not suitable for it. 
The algal fuels are gaining interest due to attrac- 
tive characteristics that they can be grown using 
ocean and wastewater with minimal impact on 
freshwater and are relatively harmless to the 
environment if spilled owing to their biodegrad- 
able nature. 

Potts and his coresearchers (2011) have 
successfully synthesised biobutanol from mac- 
roalgae growing in Jamaica Bay, New York City. 
The algae growing in the Bay containing nitrates, 
phosphates and carbon dioxide coming from the 
atmosphere were dried, grinded and subjected to 
hydrolysis to extract carbohydrate ultimately 
forming algal sugar solution. The butanol formed 
was removed by distillation process. Bacterial 
strains Clostridium beijerinckii and C. saccha- 
roperbutylacetonicum were also used in the 
process to reach a final concentration of 0.29 g 
butanol/g sugar. 

Chisti (2007) in his work mentioned that 
microalgae can be used for production of bio- 
diesel. Microalgae are cell factories that use sun- 
light to convert carbon dioxide to biofuels, foods 
and high-value bioactivities. Making use of this 
quality, it can be used to produce a variety of bio- 
fuels like methane by anaerobic digestion of algal 
biomass biodiesel derived from microalgal oil 
and photobiologically produced biohydrogen. 


33.3 Detection and Monitoring 
33.3.1 For Pollution/Pollutants 


Detection and monitoring of pollution is equally 
important as treatment or remediation. The direct 
means to detect pollution are keeping a record of 
the amount of pollutant, whereas the indirect 
ones include recording of changes in biotic and 
abiotic factors. For example, the estimation of 
biochemical oxygen demand (BOD) can give a 
direct estimation of pollution levels of a water- 
body as the greater the amount of pollutant 
organic matter present in the waterbody, the 
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greater the amount of microorganisms to break it 
and greater the amount of dissolved oxygen is 
required. The indirect means include biotic index 
which is a scale that gives measure of an ecosys- 
tem quality by presence, absence or abundance of 
a particular species. Organisms surviving in the 
environment are exposed to many chemicals and 
tend to accumulate these chemicals in varying 
concentrations. Gerdol et al. (2000) performed a 
survey for deposition of heavy metal in mountain 
territories of northern Italy. The moss samples 
(Hylocomium splendens) were collected from 
dense network sites, and the data was correlated 
to metal concentration and climatic factors. 

A variety of biological methods are available 
for monitoring of pollutants and detection of pol- 
lution at different places. Many bacteria and 
viruses are used to monitor aquatic environment 
where water is used for drinking. Escherichia 
coli is an indicator species for contamination of 
water by domestic and industrial sewage dis- 
charges (Borrego et al. 1987). The use of living 
agents as monitoring tools is known as biomoni- 
toring. The recent advancement in the field of 
monitoring includes methods like biosensors and 
immunoassays. 

Biosensors are analytical application of bio- 
catalysts such as isolated enzymes, tissues or 
organelles or whole cells. These biocatalysts are 
immobilised on physico-chemical device that 
monitors the activity by chemical transformation 
of the substance/substrate (Neujahr 1984). 
Biosensors are analytical tools and have advan- 
tages of being highly sensitive, rapid, reproducible 
and easy to use. Owing to their poor stability in 
solutions, immobilisation plays an important role 
to develop a stable biocomponent with transduc- 
ers for integration (Kumar and D’Souza 2012). 
An optical microbial biosensor was described by 
Kumar et al. (2006) for the detection of methyl 
parathion. The biocomponent consisted of whole 
cells of Flavobacterium sp. immobilised by 
trapping in glass fibre filter. The immobilised 
species produced organophosphorus hydrolase 
that hydrolyses methyl parathion into detectable 
product p-nitrophenol. For this reason, the use of 
disposable microbial biocomponent for the detec- 
tion of methyl parathion with optical biosensor 
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was a simple, single-step and direct measurement 
of very low quantity of the sample also. 


33.3.2 For Bioremediation 


Bioremediation, as mentioned above, is intended 
to improve the quality of contaminated environ- 
ment. For an effective bioremediation to take 
place, survival of the augmented organisms is 
essential as they are the degrading army. For this, 
several monitoring assays have been listed till 
date. Luciferase markers are useful in monitoring 
inoculated consortiums. The bacteria tagged with 
bacterial luciferase genes (/uxAB) can easily be 
detected and counted with their luminescent 
colonies on agar plates (Jansson et al. 2000). 

The green fluorescent protein (GFP) which is 
coded by gfp gene from a jellyfish Aequorea vic- 
toria 1s also used as a marker to monitor bacterial 
cells in the environment. Southern blot techniques 
have been used to monitor naphthalene degraders 
in soil mesocosms where probes specific for the 
nah operon were used to determine naphthalene 
degradation potential of the microbes. Dot-blot 
hybridisations have been used to measure the 
level of PCB-degrading organisms in soil envi- 
ronments with isolated polychlorinated biphenyl 
(PCB) catabolic genes (Widada et al. 2002). 

Fazi et al. (2008) in their study have mentioned 
that PCR-based methods are used for monitoring 
the presence of Dehalococcoides sp. which 1s 
capable of completely dechlorinating chloroeth- 
enes to harmless ethene. Also at field level, real- 
time quantitative PCR (qPCR) for the 16S rRNA 
gene is the most widely used technique. Another 
powerful tool for the detection and quantification 
of key degradative microorganisms is fluores- 
cence in situ hybridisation (FISH). 


33.4 Present and Future 
of Environmental 
Biotechnology 


The key service that environmental biotechnology 
provides to the developing era is detoxification 
of different contaminated matrices to retrieve lost 
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assets and converting them to forms that can be 
used by the society with ease. The emergence of 
environmental biotechnology has created a robust 
paradigm when it comes to the fields like reme- 
diation and treatment of contaminated sites, 
production of wealth from waste, improvisations 
in technologies to monitor both pollutant and 
pollution, etc. However, there is still a wide scope 
to intertwine molecular biology and microbiology 
as the key science lying underneath environmental 
biotechnology is microbial ecology. The aware- 
ness and management of different fundamental 
biological processes can be achieved by probing 
the environment and using different methods on 
genomic, proteomic and metabolomic levels. To 
ensure that the microbial communities provide 
the desired services, they have to be managed. 
Genetic modification of organisms and develop- 
ing different monitoring tools are the most 
anticipated areas of work in the field of environ- 
mental biotechnology. Introducing these newly 
developed techniques at field level will help to 
lessen the problems created due to contaminants 
and cure the environment. 


33.5 Conclusions 


Biotechnology has established itself in many 
areas and upcoming developments assure to widen 
its scope. With new developments in treating the 
contaminated environment, the emerging list of 
what can be practised or remediated by simple 
and feasible means is increasing. As we move on 
to the next millennium, the problems will grow as 
urbanisation and industrialisation continue to 
mount. With this increasing load of pollution 
and contamination, the role of environmental 
biotechnology as a tool also grows for remedia- 
tion and environmentally sensitive industry. 
Environmental biotechnology, so far, has proven 
its potential in many areas like wastewater 
treatment and contaminated land reclamation, 
production of economically important products 
by genetic modifications, monitoring the contam- 
inants making use of biosensors, etc. Therefore, 
environmental biotechnology is well positioned 
to add to the development of a more sustainable 
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environment. The quest continues as time runs 
introducing new problems to deteriorate the 
quality of environment giving an opportunity for 
environmental biotechnology to discover new 
horizons. 
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Abstract 

Developmental activities have resulted in contamination of the environment 
and thus disturbed the ecosystem. One of the main agenda for scientific 
community has been remediation of contaminated sites which has 
been expensive and intrusive to the ecosystem. Phytoremediation is a set 
of different processes/techniques which uses plants for containment, 
destruction, or extraction of contaminants. Various processes involved in 
phytoremediation are phytodegradation, phytoextraction, rhizofiltration, 
phytostabilization, phytovolatilization, and hydraulic control. This chapter 
highlights various processes involved in phytoremediation along with the 
essential components of phytoremediation. Emphasis is also given on 
recent advancements in phytoremediation in which microbe-assisted 
phytoremediation and use of transgenic plants have been discussed. This 
technology has been receiving attention because of its cost-effectiveness 
and environment-friendly approach. Therefore, phytoremediation may 
Open a new avenue for a broader and more efficient and sustainable 
solution for the remediation of contaminated sites. 


Keywords 
Contamination « Phytoremediation * Heavy metals * Microbe-assisted 
phytoremediation * Transgenic plants 


34.1. Introduction 


Ever since man has started developmental activi- 
ties, it has made his life more and more convenient. 
J.K. Sharma * A.A. Juwarkar (24) However, these activities are also having adverse 
Eco-System Division, CSIR-National Environmental impact on the environment owing to exploitation of 
Engineering Research Institute (CSIR-NEERI), | natural resources, destruction of biodiversity, and 
Diener pe dee cate aa as use of chemicals. This has led to the disturbance in 


e-mail: aa_juwarkar@ neeri.res.in; — 
juwarkar @ gmail.com the ecosystem. The developmental activities such 


Bir Bahadur et al. (eds.), Plant Biology and Biotechnology: Volume II; Plant Genomics 673 
and Biotechnology, DOI 10.1007/978-81-322-2283-5_34, © Springer India 2015 


674 


as manufacturing, transportation, and construction 
not only exhaust the natural resources but also lead 
to production and release of huge quantity of waste 
material which in turn leads to environmental pol- 
lution. The dispersion of various contaminants gen- 
erated by various industrial activities has led to 
contamination of soil and water. Environmental 
pollution is not only a health issue but also an 
important social issue in which pollution has the 
capabilities to devastate homes and communities. 
Untreated or inappropriately treated waste is a 
main reason of pollution and environmental degra- 
dation which ultimately causes ill effects on human 
and environmental health. 

A range of organic and inorganic chemicals 
are responsible for environmental contamination. 
These include heavy metals, hazardous wastes, 
explosives, pesticides, petroleum products, etc. 
(Ghosh and Singh 2005). Major constituent of 
inorganic pollutants is heavy metals which present 
various problems to the environment, while 
organic contaminants pose different kinds of 
problems to the environment. While many metals 
are essential, they are toxic at higher concentra- 
tions as they lead to oxidative stress because of 
the formation of free radicals. Although numer- 
ous regulatory measures have been enforced to 
reduce or limit the discharge of pollutants in soil, 
they are not adequate for checking the pollution. 

Therefore, the need for the management and 
remediation of contaminated sites has been an 
issue of global concern, and researchers all over 
the globe are working extensively on formulating 
cost-effective and environmentally sound tech- 
nologies for decontamination of these sites. 
Several methods are developed to manage or 
remediate these pollutants which include physi- 
cal, chemical, and biological methods. Biological 
methods have an advantage over physical 
and chemical methods being cost incurred and 
environment friendly but have limitations with 
respect to efficiency (Juwarkar et al. 2014; 
Sharma et al. 2014). Soil microflora is able to 
degrade most of the organic contaminants, 
whereas metals need immobilization or physical 
removal. All of us should be aware of the cost 
that has to be paid to remediate the mess that has 
resulted from our careless practices. However, it 
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is also obvious that extensive research 1s essential 
to develop cost-effective and more efficient meth- 
ods for decontamination of contaminated soils. 
Bioremediation is one such technology that is 
gaining global attention which uses biological 
organisms such as microbes or plants for cleanup 
of contaminated sites (Juwarkar et al. 2014). 

Microbes offer practical solutions to the 
environmental contamination, whereas plants 
of many different diversities present different 
solutions. The utility of plants seems to be eternal. 
They provide oxygen to breath; they are the basis 
of agriculture, medicines, fuel, fiber, oils, and 
industries. Besides this, plants can also serve as a 
tool for cleanup of contaminated environment 
as well as CO, sink. The use of plants for reme- 
diation of environmental contamination is termed 
as phytoremediation. Phytoremediation has 
also been expressed as green remediation, 
agroremediation, botano-remediation, and vege- 
tative remediation. Phytoremediation implies the 
use of natural or genetically engineered/modified 
plants, many a times, together with their associ- 
ated rhizospheric microbes which encourage 
plant growth and decontaminate the contami- 
nated soil in combination with the plants. Plants 
extract heavy metals, naturally occurring aromatic 
and hydrocarbon compounds, and anthropogenic 
chemicals (pesticides, herbicides, fungicides, 
industrial chemicals, etc.). This approach has 
been established to be relatively economical, 
efficient, and environment friendly. 

Plants survive by utilizing the resources available 
in their surrounding environment as they struggle 
for light, nutrients, and water. Phytoremediation 
involves understanding and exploiting specific 
plants in a best way to clean up the chemical imbal- 
ance created in the environment. The understand- 
ing of site’s physiochemical properties, climatic 
conditions, type of contamination, site-specific 
plants and associated microbes, etc., helps to 
develop an effective phytoremediation system. 
Phytoremediation has been successfully adopted 
for remediation of several contaminants around 
the globe. Phytoremediation covers a variety of 
methods/processes which can lead to degrada- 
tion, removal (through accumulation), or immo- 
bilization of the contaminant. 
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Fig.34.1 Various processes of phytoremediation 


Figure 34.1 depicts various processes involved 
in phytoremediation. The process of phytoreme- 
diation depends mainly on the type of plant and 
nature of the contaminant. Different processes 
involved in phytoremediation will be discussed 
individually along with their application in detail 
in this chapter. With each phytoremediation 
process/method, it is essential that the undesired 
transfer of contaminant to other media should be 
avoided. Phytoremediation is efficient in the 
removal or cleanup of different types of contami- 
nants such as petroleum hydrocarbons, heavy 
metals, pentachlorophenol (PCP), polychlorinated 
biphenyls (PCBs), polycyclic aromatic hydro- 
carbons (PAHs), etc. (Pivetz 2001). Simple 
plantation and/or vegetation is not sufficient for 
phytoremediation. Selection of appropriate plant 
Species, its resistance to the contaminant, and 
role of different amendments in plant growth 
play a key role in effective implementation of 
phytoremediation. 

Although thousands of plants have been studied 
by various researchers for their phytoremediation 
potential of various contaminants, a limited 
number of plants have shown significant results 
which can be applied on the field. For successful 
phytoremediation to occur, screening studies are 
an important aspect in selecting a suitable plant 
species (Vishnoi and Srivastava 2008). The results 
can be extrapolated from the data of hydroponic 
or greenhouse studies which can be then transferred 
to the field. Verification of applicability and 


Phytovolatalization 


Immobilization 






Hydraulic control 
Phytostabilization 





efficacy of different methods of phytoremediation 
is needed on site-specific manner which is required 
to establish a solid technology for remediation. 
Nevertheless, phytoremediation necessitates 
dedication of resources and time but has shown 
the possibility to present a cost-effective, envi- 
ronmentally sound alternative to conventional 
remediation technologies at suitable sites. 


34.2 Essential Plant’s Metabolic 
Processes 
for Phytoremediation 


Phytoremediation takes benefit of inherent pro- 
cesses of plants. These processes include uptake 
of water and chemicals, metabolism inside the 
plant, release of exudates into the soil, and bio- 
chemical and physical impacts of plant roots. 
Growth of plants mainly depends on photosyn- 
thesis, in which water and carbon dioxide are 
transformed into carbohydrates and oxygen by 
means of the energy from sunlight. The roots of 
plants are effective in extracting water held in 
soil and are also capable of water uptake even 
from negative osmotic potential through xylem. 
Transpiration is another process which occurs 
primarily at the stomata in which there is loss of 
water from plant into the atmosphere in the form 
of vapor. Carbon dioxide uptake from atmo- 
sphere takes place via stomata, along with release 
of oxygen. Respiration carbohydrates formed 
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through photosynthesis and production of ATP 
are essential for the active transport of nutrients 
and require oxygen. Plants need macronutrients 
(N, P, K, Ca, Mg, S) and micronutrients (B, Cl, 
Cu, Fe, Mn, Mo, Zn, Co, Ni, Se, Si, V) for optimum 
growth (Vishnoi and Srivastava 2008). Nutrient 
uptake pathways can extract contaminants which 
are similar in chemical behavior or form to the 
required nutrients. The uptake of contaminant by 
plants largely depends on the hydrophobicity 
(Vangronsveld et al. 2009), polarity, type of plant, 
age of contaminant, and physicochemical proper- 
ties of soil. Also, exudates released by plant roots 
along with the root-associated bacteria affect the 
uptake of various contaminants by the plant (Sun 
et al. 2010). 


34.3 Different Processes/ 
Techniques 
in Phytoremediation 


There are a number of different processes which 
will be discussed in this chapter along with their 
application. Table 34.1 briefs various processes/ 
techniques of phytoremediation along with the 
mechanisms involved and contaminants treated. 
Defining these processes is useful to elucidate 
and understand different mechanisms that may 
occur due to vegetation. Fate of the contaminant 
and necessary steps to be taken for effective 
phytoremediation must be defined. Different 


Table 34.1 Overview of phytoremediation 


Technique Mechanism 

Phyto/rhizodegradation Enzyme-catalyzed destruction 
of contaminants 

Phytoextraction Uptake and accumulation of 
contaminants in the plant 

Rhizofiltration Uptake/extraction of 
contaminants into the plant 
roots 

Phytostabilization Containment of contaminant 

Phytovolatilization Removal of contaminants by 


evapotranspiration 
Hydraulic control Degradation and/or 


containment of contaminant 
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types of contaminants and contaminated media 
may require different types of phytoremediation 
processes specific to the type of contamination. 
Table 34.2 highlights various plants used for 
phytoremediation of various contaminants by 
various researchers over the period of time. 
Various processes involved in phytoremediation 
are described below. 


34.3.1 Degradation 


Degradation is usually accounted for organic 
contaminants. It is the breakdown or transforma- 
tion of complex organic compound to simple and 
less toxic compounds. In phytoremediation, two 
types of degradation may occur depending upon 
the type of contaminant and plant: phytodegrada- 
tion and rhizodegradation. 


34.3.2 Phytodegradation 


The process of phytodegradation usually implies 
uptake, metabolism, and degradation of contami- 
nants inside the plant (Wenzel 2009). It may also 
happen in the soil, sediments, sludges, surface 
water, or groundwater by means of enzymes 
released by plant. Phytodegradation is a contami- 
nant destruction process which is also known 
as phytotransformation. It is reported that the 
degradation of organic pollutants is driven by 


Media Contaminant 
Soil, sediment, sludges, | Organic compounds: PAHs, 
groundwater pesticides, PCBs, chlorinated 


solvents, etc. 

Metals: As, Cr, Cd, Cu, Hg, 
Pd, Mn, Ni, etc. 

Metals: Cr, Cd, As, etc. 


Soil, sediment, sludges 


Groundwater, surface 
water 


Metals: As, Cr, Cd, Cu, Hg, 
Pd, Mn, Ni, Zn, etc. 
Chlorinated solvents and 
inorganics such as Se, Hg, etc. 


Soil, sediment, sludges, 
mine tailing 

Soil, sediment, sludges, 
groundwater 
Water-soluble organics and 
Inorganics 


Groundwater, surface 
water 
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Table 34.2 Plants used for phytoremediation of different contaminants 


Genus/species 
Eucalyptus urophylla X E. grandis Pb 
Medicago sativa 
Nephrolepis exaltata Hg 
Pteridium esculentum and Silene vulgaris 


Pteris vittata 


Raphanus sativus Cu 

Thlaspi caerulescens Cd and Zn 
Brassica Juncea Phorate 
Panicum virgatum and Populus deltoides PCBs 

S. plumbizincicola, E. splendens, PCBs, Cd, Cu 
and M. sativa 

Jatropha curcas As, Cr, Zn 
Medicago sativa L, Festuca arundinacea PAHs 


Schreb., and Vetiveria zizanioides 


metabolic processes of plants (Prasad and Freitas 
2003). Contaminants such as chlorinated solvents, 
herbicides, pesticides, and other organic contam- 
inants may be eliminated or reduced by employing 
phytodegradation process. 

In this process, plant metabolism contributes 
to the reduction of contaminant by means of different 
metabolic processes such as_ transformation, 
breakdown, or volatilization, and the simpler 
breakdown products then are incorporated into 
plant tissues. Certain enzymes in plants such as 
dehalogenases, oxygenases, and reductases are 
responsible for the breakdown of various pollutants 
into simpler forms (Ghosh and Singh 2005). 


34.3.3 Rhizodegradation 


Rhizodegradation is the breakdown of contami- 
nants through the influence of plant roots which 
ideally results in destruction of contaminant. 
Naturally occurring microorganisms such as bac- 
teria, fungi, and actinomycetes can breakdown 
the organic contaminants in soil into less toxic 
products or sometimes completely mineralize to 
carbon dioxide and water. Plant roots have an 
ability to enhance the number and variety of 
microbial populations in the rhizosphere which 
ultimately leads to enhanced degradation of con- 
taminant (Rani and Juwarkar 2012). Plant roots 


Contaminant 


Benzopyrene, PAEs, and PAHs 


As, Zn, and Cd 
As, Hg, Cs, and Sr 


Reference 

Peng et al. (2012) 

Sun et al. (201 1a), Fu et al. (2012) 
Chen et al. (2009) 

Robinson et al. (2006) 

Chen et al. (2009), Wang et al. (2002) 
Choudhary et al. (2009) 

McCutcheon and Schnoor (2003) 


Rani and Juwarkar (2012), Rani et al. 
(2012) 


Meggo et al. (2013) 
Wu et al. (2012) 


Kumar et al. (2008) 
Sun et al. (2011b), Nanekar et al. (2013) 


release exudates in the form of sugars, amino 
acids, organic acids, fatty acids, sterols, growth 
factors, nucleotides, flavanones, enzymes, etc., 
which increase the activity of the rhizospheric 
microflora subsequently resulting in increased 
contaminant biodegradation. Rhizodegradation 
primarily occurs in soil. Alteration in soil 
geochemical conditions such as pH takes place 
because of the release of exudates that may result 
in alteration in the transportation of contaminants 
(Pivetz 2001). The plant roots also positively 
affect the soil properties such as water content, 
water and nutrient transport, aeration, structure, 
temperature, etc., which facilitate more favorable 
conditions for soil microflora, besides the release 
of exudates. 


34.3.4 Phytoextraction 


Phytoextraction is one of the best processes for 
the removal of contaminants from soil which 
implies uptake of contaminant by plant roots with 
subsequent accumulation of the contaminant in 
the plant’s aboveground portion, which is usually 
followed by harvest and eventual disposal of 
plant biomass (Wenzel 2009). It is also called as 
phytoaccumulation. It is a process of contaminant 
removal. Phytoextraction is a process which 
generally applies to metals (e.g., Ag, Cd, Co, Cr, 
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Cu, Hg, Mn, Mo, Ni, Pb, Zn), metalloids (e.g., 
As, Se), radionuclides (e.g., 90 Sr, 137 Cs, 234 U, 
238 U), and nonmetals (e.g., B) (Salt et al. 1995; 
Pivetz 2001), because these contaminants in 
general are not further degraded within the 
plant. Phytoextraction/phytoaccumulation has 
also been referred as phytomining or biomining. 
A brief definition of phytomining is the utiliza- 
tion of plants to achieve an economic return from 
metals extracted by a plant. 

Phytoextraction usually has not been consid- 
ered for uptake of organic contaminants as these 
can be metabolized or volatilized by the plant and 
thus prevents the accumulation of contaminant 
within the plant. The uptake of a specific com- 
pound is determined by its lipophilicity described 
by its octanol water partition coefficient (K,,). 
Therefore, hydrophobicity of a contaminant 
increases with increase in K,,, and it has been 
studied that the compounds with K,,, value >3 are 
usually adsorbed by the roots and not subjected 
to uptake (Gaskin 2008; Imfeld et al. 2009). 
However, there have been few studies reporting 
the phytoaccumulation of organic contaminants. 
Researchers have shown interest in studying 
the metal-accumulating plants termed as hyper- 
accumulators which can accumulate a higher 
degree of metals for phytoremediation studies. 
The target medium in this process is usually soil; 
however, sediments, sludges, and groundwater 
can also be subjected to phytoextraction in 
conjunction with rhizofiltration. Zhang et al. 
(2009) reported reduction in exchangeable Cd in 
the planted soil after phytoextraction experiment 
using maize. Similarly, the decreased level of Cd 
in soil planted with maize was also documented 
(Mojiri 2011). 

In order to craft this process economically and 
environmentally feasible, the plants must accu- 
mulate large concentrations of contaminants 
(e.g., heavy metals) into their roots with its 
subsequent translocation to surface biomass 
followed by generation of large quantity of plant 
biomass. Factors such as growth rate, disease 
resistance, element selectivity, etc., are also 
imperative in phytoextraction (Ghosh and Singh 
2005). 


J.K. Sharma and A.A. Juwarkar 


34.3.5 Rhizofiltration 


Rhizofiltration is the process which implies the 
removal of contaminants by plant roots by pre- 
cipitating, concentrating, and absorbing the con- 
taminant (Juwarkar et al. 2010). This process 
usually applies to surface water, wastewater, or 
extracted groundwater either through adsorption 
or on the roots or through absorption into the 
roots. The roots of plants release secondary 
metabolites called as root exudates which may 
lead to the biogeochemical conditions resulting 
in precipitation of contaminants into the water 
body or on the roots. Depending on the type of 
contaminant, its concentration, and plant species, 
the contaminant may remain adsorbed on the 
root, absorbed into the roots, or translocated 
into the aboveground portion of the plant. 
Rhizofiltration can be carried out in situ to reme- 
diate contaminated surface water bodies or ex 
situ, in which the engineered system of tanks is 
used to hold the externally supplied contaminated 
water and the plants. Both the systems necessi- 
tate an understanding of the contaminant specia- 
tion and their interactions within the system. 
Rhizofiltration and phytoextraction are having 
similarity for accumulation of contaminants on 
or in the plant. While rhizofiltration accumulates 
contaminants in or on the roots with a portion of 
contaminant subsequent translocation into other 
parts of the plants, phytoextraction occurs above 
the ground and not in the roots. Additionally, 
rhizofiltration differs from phytoextraction as 
the contaminant is primarily in water, rather 
than in soil. In a hydroponic experiment carried 
out by Abhilash et al. (2009b), the potential of 
Limnocharis flava (L.) for phytofiltration of Cd in 
contaminated water with low concentrations of 
Cd was investigated. 


34.3.6 Phytostabilization 


Phytostabilization is the use of plants to restrain 
soil contaminants in situ, through alteration of 
the chemical, biological, and physical conditions 
in the soil. Phytostabilization is also known as 
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phytoimmobilization (Wenzel 2009). 
Phytostabilization can take place through 
sorption, precipitation, complexation, or metal 
valence reduction (Ghosh and Singh 2005). The 
plants’ key purpose is to reduce the amount of 
water percolating in the soil environment, which 
may lead to the formation of hazardous leachate 
and prevent erosion of soil to reduce the chances 
of distribution of hazardous contaminants to 
nearby area. The transport of contaminant in soil, 
sediments, or sludges can be reduced through 
absorption and accumulation in roots, adsorption 
on the roots, and complexation, precipitation, or 
metal valence reduction in soil. Further, the 
plantation reduces wind and water erosion of 
soil, thereby reduces spreading of the contami- 
nant because of runoff, and may also prevent 
leachate generation. Phytostabilization research 
has generally focused on metal contamination 
such as lead, chromium, and mercury which are 
being identified among top potential candidates 
for phytostabilization (Pivetz 2001). 

Alvarenga et al. (2009) studied the effect of 
the sewage sludge, organic residues, garden waste 
compost, and municipal solid waste compost on 
the phytostabilization of a metal-contaminated 
soil. Similarly, municipal solid waste was utilized 
for reclamation of coal mine spoil dump 
(Juwarkar and Singh 2007). 


34.3.7 Phytovolatilization 


Phytovolatilization employs an uptake of a 
contaminant by the plant and subsequent release 
of a volatile contaminant which is a volatile 
degradation product of original contaminant or a 
volatile form of a primarily nonvolatile contami- 
nant (Wenzel 2009; Juwarkar et al. 2010). 
Phytovolatilization takes place as growing trees 
or plants absorb water along with organic and 
inorganic contaminants. Phytovolatilization is 
mainly a contaminant removal process, in which 
the transfer of the contaminant takes place 
from the original matrix, 1.e., soil or groundwa- 
ter, to the atmosphere. The metabolic processes 
occurring in the plant are responsible for altera- 
tion of contaminant to less toxic form. In some 
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cases, the transfer of contaminant to the 
atmosphere enables much more efficient and 
effective natural degradation to occur, for example, 
photodegradation of the volatilized contami- 
nants. While phytovolatilization leads to transfer 
of contaminants into the atmosphere, a risk anal- 
ysis of this process on the ecosystem including 
humans becomes essential. Phytovolatilization 
has been mainly employed for the elimination 
of mercury, in which there is a transformation of 
mercuric ion into less toxic elemental mercury 
that takes place (Henry 2000). This process also 
depends on the type of contaminant, climatic 
condition, type of plant, and the soil type. 


34.3.8 Hydraulic Control 


Hydraulic control (or hydraulic plume control) is 
the use of plants to control the movement of 
groundwater and/or soil water, because of uptake 
and consumption of large quantity of water. 
Hydraulic control may influence and potentially 
hold the movement of a groundwater plume, 
lessen or prevent infiltration and leaching, and 
encourage upward flow of water from the water 
table (Pivetz 2001). Other phytoremediation 
processes, such as phytodegradation, rhizodegra- 
dation, and phytovolatilization, may take place 
as the contaminated water is brought to and 
into the plant. Water uptake by vegetation and 
transpiration rates is vital for hydraulic control 
and remediation of groundwater. This water 
uptake and the transpiration rate depend upon 
the species, age, mass, size, leaf surface area, 
and growth stage of the vegetation. They are also 
affected by various climatic factors, such as tem- 
perature, humidity, precipitation, etc. 


34.4 Things to Be Considered 
for Successful 
Phytoremediation 
Implementation 


Prior to initiating the phytoremediation for a 
contaminated site, it is essential to evaluate 
whether natural attenuation processes such as 
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biodegradation, photodegradation, dispersion, 
sorption, or volatilization would attain site- 
specific remediation in a particular time frame. If 
site-specific natural attenuation processes are 
capable of reducing the contaminant concentra- 
tion, then that site may not necessarily be the 
candidate for phytoremediation. 

Successful plant growth strongly depends on 
climatic conditions. Proper timing of precipita- 
tion, optimum amount of sunlight, shade, wind, 
proper air temperature, and growing season length 
are required to ensure optimum growth of plants. 
The study of site-specific plants to select a proper 
species can prove to be very beneficial. Soil 
amendment is a key requirement for the success- 
ful implementation of phytoremediation on con- 
taminated sites. The maintenance of proper pH, 
nutrients, and soil water content is an essential 
factor for plant growth. Amendments such as fly 
ash, pig manure, sewage sludge, etc., are effective 
in lowering the toxicity of contaminant in soil and 
offer a slow release of nutrients such as N, P, and 
K to support growth of plants. However, potential 
adverse effects of their addition must be consid- 
ered before they are added (Juwarkar and Singh 
2010). Also, the type and concentration of 
contaminant is an imperative aspect to be studied 
before implementation of phytoremediation. 


34.5 Advantages and Limitations 
of Phytoremediation 


Phytoremediation on the one hand has advantages 
over conventional remediation technologies, 
while on the other hand, it has few disadvantages. 
The advantages and disadvantages of phytoreme- 
diation are described below. 


34.5.1 Advantages 


e Phytoremediation is an in situ technique 
which limits environmental disturbance and 
reduces cost. 

¢ [tis particularly well studied for the treatment of 
large area of surface contamination especially 
where other methods may not be cost-effective. 
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e Phytoremediation involves exploiting the nat- 
ural ability of the environment to restore itself; 
therefore, it is a widely acceptable method for 
remediation of contaminants. 

e Plant sample can be used as indicators to know 
the extent of remediation or conversely 
contamination. 

e Plants help to contain the region of contamina- 
tion by removing water from soil, thereby 
keeping the contaminants from spreading or 
confining them within or near the root system. 

e Some wetland plants can transport oxygen to 
the rhizosphere that is available to microor- 
ganisms that allows aerobic degradation of a 
wide variety of organic contaminants. 

e Phytoremediation leads to the improvement of 
soil quality by improving soil structure. 

e Phytoremediation increases the porosity and 
therefore water infiltration, providing nutri- 
ents (nitrogen-fixing legumes) and accelerat- 
ing nutrient cycling, and increases soil organic 
carbon. 

e The use of plant in remediation efforts stabi- 
lizes the soil, thus preventing the soil erosion. 

e Phytoremediation eliminates secondary air 
or waterborne waste; an example is the 
accumulation of PAHs from the atmosphere. 

e It has the potential as a sink to CO, a known 
greenhouse gas that causes global warming. 

e Apart from above benefits, it helps in green- 
belt formation, thus improving the floral 
diversity. 


34.5.2 Limitations 


e Phytoremediation is slower than other ex situ 
remediation methods for the removal of 
contaminants from the environment. 

e Phytoremediation requires long period and 
several seasons for site cleanup. 

e Because it is slow, phytoremediation is not an 
appropriate method where the target contami- 
nant poses an immediate threat to human 
health or the environment. 

e Ifthe contaminant is bound tightly on the soil 
particles or organic matter, 1t may not be avail- 
able to plants for degradation. 
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e Environmental conditions like soil texture, 
pH, salinity, oxygen availability, temperature, 
and level of nonhydrocarbon contaminants 
such as metals must be within limits tolerable 
to plants. 


Recent Advancements 
in Phytoremediation 


34.6 


Researchers working on biological remediation 
of toxic contaminants have shown their faith 
in phytoremediation owing to its additional 
advantages such as enhanced efficiency and CO, 
sequestration capability. In recent times, consid- 
erable work has been going on throughout 
the globe to enhance the phytoremediation effi- 
ciency. Genetic engineering has been employed 
to produce transgenic plants with enhanced 
efficiency for phytoremediation since past few 
years. Microbe-assisted phytoremediation (MAP), 
in which the use of selected plant species and 
site-specific bacteria 1s employed for enhanced 
remediation potential, has also been proved to 
produce significant results. 


34.6.1 Microbe-Assisted 
Phytoremediation (MAP) 


The limitations of phytoremediation, predomi- 
nantly those associated with low efficiency and 
restrictions to the use of contaminant-containing 
biomass, could be dealt through novel biotechno- 
logical approaches that exploit recent advances in 
our understanding of interactions between plants 


Fig.34.2 Schematic 
representation of microbe- ( Plants 
assisted phytoremediation ae a 


Soil 


a 





and rhizospheric microorganisms and within 
plant tissues. Many plants employ interactions 
with rhizospheric microbes to survive in toxic 
and nutrient-restricted environments (Weyens 
et al. 2009; Abhilash et al. 2012). Plants secrete a 
wide range of chemicals in root exudates which 
usually help in the enhanced growth of rhizo- 
spheric microbes. These root exudates include 
organic acids, phenolic compounds, amino acids, 
etc., which play an important role in plant- 
microbe interactions (Tanimoto 2005; Crowley 
and Kraemer 2007). This process has been termed 
as microbe-assisted phytoremediation (MAP) 
technology which is a modern, very efficient, and 
successful phytoremediation technology for 
the remediation of degraded lands (Juwarkar 
and Singh 2010). Figure 34.2 shows schematic 
representation of MAP. This process mainly 
involves bioaugmentation with — site-specific 
microorganisms to promote faster degradation of 
the contaminant along with plant. It can be best 
applied at sites with comparatively shallow con- 
tamination of pollutants that are amenable to the 
processes such as phytotransformation, rhizosphere 
bioremediation, phytostabilization, phytoextraction, 
and/or rhizofiltration. MAP has been successfully 
implemented for restoration of different types of 
contaminated lands (Juwarkar 2012). 


34.6.2 Transgenic Plants 
in Phytoremediation 


Plant breeds having superior potential for phy- 
toremediation with elevated biomass production 
can be an option to improve phytoremediation. 


Chances of translocation of 
compounds to the plant or 
atmosphere are less 


Mineralization of the 
contaminant 
may occur 
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The productivity of plant is controlled by several 
genes. The development of proteomics, genomics, 
and metabolomics has contributed greatly to boost 
or manipulate the plant metabolism of several 
xenobiotic pollutants. Genetic engineering has 
potential to amplify the efficiency and use of 
phytoremediation, in which plants can be geneti- 
cally modified using specific bacterial, fungal, or 
plant genes which possess useful properties for 
contaminant degradation, uptake, and transfor- 
mation. Initially, the transgenic plants were used 
to improve crop productivity and reduce the 
use of pesticide. Genetic engineering techniques 
to incorporate more efficient hyperaccumulator 
gene into plants have been recommended by 
many authors (Chaney et al. 2000; Abhilash et al. 
2009a; Juwarkar 2012). Insertion of multiple 
genes such as cytochrome P450s, GSH, and GT 
for complete degradation of xenobiotics in 
plant is among the promising development in 
transgenic technology (Aken 2008; Doty 2008; 
Abhilash et al. 2009a). Another approach which 
can be adopted is the development of genetically 
engineered plants with the ability to release 
degradation-promoting enzymes in the rhizo- 
spheric zone. Genetically modified transgenic 
plants were first developed for phytoremediation 
to remediate heavy metal-contaminated soils, 
for example, Nicotiana tabacum expressing a 
yeast metallothionein gene for higher tolerance 
to cadmium or Arabidopsis thaliana overexpressing 
a mercuric ion reductase gene for higher tolerance 
to mercury (Abhilash et al. 2009a). 


34.7 Future Prospects 
and Conclusions 


Phytoremediation is a quite new and attractive 
technology which has emerged in recent years and 
offers practical advantages over traditional 
remediation technologies. It offers tremendous 
perspectives for the development of cost-effective 
and efficient processes for decontamination of a 
wide range of pollutants. It has proved to have 
high potential for degradation of organic pollutants 
in soil, sediments, and groundwaters. Also the 
capability of plants to hyperaccumulate metals 


J.K. Sharma and A.A. Juwarkar 


can be deployed in heavy metal-contaminated 
sites. As highlighted above, it offers several ways 
to get rid of contaminants in soil, sediments and/ 
or, contaminated water. It is a solar driven, cost- 
effective, and natural cleanup technology which 
works best even in shallow levels of contamina- 
tion. However, there are certain limitations in 
implementation of phytoremediation for which 
more research is required. Scientific community 
now posses a very good understanding of the bio- 
chemical processes involved in the detoxification 
of contaminants within plant cells. One of the 
most important aspects for researchers is to 
improve the efficiency of phytoremediation as it 
is essential to recognize the prices incurred in 
remediation process. After all, money matters! 
There are challenges which are required to be 
addressed especially when specific sites require 
customized approach for decontamination owing 
to the unique complexity of the location. The use 
of nanoparticles also can be proved effective in 
enhancing the efficiency of phytoremediation 
(Abhilash et al. 2012). Researchers have studied 
that CeO, and ZnO nanoparticles showed a positive 
impact on growth of root and shoot in edible 
plants like soybean, wheat, corn, and alfalfa 
(L6pez-Moreno et al. 2010a, b), suggesting that 
nanotechnology can significantly enhance the 
efficiency of phytoremediation. The use of trans- 
genic, MAPT, and nanotechnological approach 
can lead to the development of effective and 
efficient phytoremediation technology which can 
deal with a wide range of contaminants. Future 
interest will require receiving public and regula- 
tory acceptance for successful implementation of 
phytoremediation using transgenic plants. 
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M. Nagarajan, R. Rajesh Kumar, 
K. Meenakshi Sundaram, and M. Sundararaman 


Abstract 

Marine biotechnology explores marine organisms as a prolific source for 
attractive biomolecules with broad biotechnological applications. This 
review dictates the potentials from natural products of both free-living and 
associated forms of marine bacteria, actinobacteria, cyanobacteria, fungi, 
and algae towards pharmacological and industrial importance. In accor- 
dance with the pharmacological perspectives, secondary metabolites, 
explored during the recent decade of 2003-2013, with significant cytotox- 
icity on human and murine cancer cells, antimicrobial and antiprotozoal 
properties, inhibition of metabolic key enzymes/proteins, and anti- 
inflammatory and antioxidant activities were emphasized here. Recent 
studies (2010-2014) towards biofuel and antifouling in the deal of indus- 
trial applications were highlighted. Undoubtedly, marine animals, viz., 
sponges, corals, mollusks, prochordates, etc., are the potential source of 
bioactive natural products with wider applications, but extensive explora- 
tion for their valuable chemicals left them in the dangerous zone. In order 
to save the bioresources and extend the research in marine microbes and 
macroalgae, this review is confined within marine bacterial, fungal, and 
algal applications. But the tool of metagenomics is devoid of such limita- 
tions; metagenome sample from marine source is sufficient to construct 
metagenome library so as to investigate the potentials by employing either 
activity-based screening or sequence-based screening approaches. 
Furthermore, many of marine microbes and algae are actively engaged in 
genetic engineering studies in applied aspects. 
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35.1 Introduction 

Marine environment is a hidden treasure to be 
discovered a lot and biotechnology is one of the 
tools aiding to explore. Marine organisms 
adapted to survive in their environment by syn- 
thesizing unique chemicals in order to compete 
for nutrition, offense, and defense and manage 
varying physical and chemical factors. Such 
products can alternatively be used for our phar- 
macological and industrial needs. Among the 
potential genera identified for various biotechno- 
logical applications, the following marine forms 
are found to be promising candidates: marine 
bacteria, Bacillus, Erythrobacter, Halomonas, 
and Rapidithrix; marine actinobacteria, Nocardia, 
Streptomyces, Salinispora, etc.; marine cyano- 
bacteria, Lyngbya, Leptolyngbya, Oscillatoria, 


Symploca, Rivularia, etc.; marine fungi, 
Acremonium, Acrostalagmus, Aigialus, 
Apiospora, Arthrinium, Aspergillus, 
Chaetomium, Coniothyrium, Diaporthe, 
Emericella, Exserohilum, Fusarium, 
Gliocladium, Gymnascella, Metarhizium, 


Neosartorya, Penicillium, Phoma, Scytalidium, 
Spicaria, Trichoderma, Zygosporium, etc.; and 
marine algae, Ulva, Sargassum, Turbinaria, 
Laminaria, Padina, Rhodomela, etc. Bioactive 
metabolites produced by these organisms are per- 
taining to various chemical groups including 
short peptides (linear peptide, cyclic peptide, and 
cyclopeptide — head to tail linkage of amino 
acids/its derivatives; glycolipopeptide and 
depsipeptide — one or more amide bonds (-CO- 
NH-) are replaced by ester bonds (-CO-O-); pep- 
tolide — cyclodepsipeptide), terpenes (containing 
isoprene units; diterpene contains four isoprene 
units, meroterpene is a partial terpenoid, and ses- 
terterpene contains five isoprene units), terpenoid 
(in addition to isoprene units, it also bears 


other functional groups), azoles (five-membered 
heterocycle containing nitrogen; thiazole/thiazo- 
line are  sulfur-containing azoles), lactones 
(cyclic ester), macrolides (macrocyclic lactones), 
polyketides (containing alternate carbonyl and 
methylene (-CH2-) groups), fatty acid amide, 
quinones (even number of methine (=CH-) group 
in benzene ring is replaced by carbonyl (-C(=O)-) 
group to yield quinone; naphthoquinone, naph- 
thalene containing; anthraquinone, anthracene 
containing), heterocycles (cyclic ring made by at 
least one atom other than carbon), alkaloids 
(basic nitrogenous heterocyclic compounds), 
carboline (pyrido indole), quinoline (nitrogen- 
containing heterocycle), exopolysaccharides 
(polymer of one/more sugar residues), etc. Many 
of such marine metabolites display notable anti- 
neoplastic property by exerting cytotoxicity on 
various human and murine cancer cell lines 
(Pettit et al. 2009; Shao et al. 2010; Hu and 
Macmillan 2011; Balunas et al. 2012 ; Boudreau 
et al. 2012; Eamviarn et al. 2013; Huang et al. 
2012; Shaala et al. 2013; Wang et al. 2012). 
Certain enzymes/proteins are specifically respon- 
sible for pathogenicity and diseases and are tar- 
geted with novel drugs. A few of such drug 
targets including proteases, protein tyrosine 
phosphatases, p53/MDM2 interaction, SrtA, 
HDAC, voltage-gated channels, AchE, P450, 
etc., are effectively inhibited by marine-derived 
compounds (Wen et al. 2008; Pontius et al. 
2008b; Malloy et al. 2012a; Nunnery et al. 2012; 
Julianti et al. 2013; Liu et al. 2013; Salvador 
et al. 2013; Pavlik et al. 2013). 

Tuberculosis, candidiasis, flu, malaria, leish- 
maniasis, etc., are prevalent in India and often 
pose increased risk on human health. Developing 
resistance against the drugs is a characteristic of 
causative agents. This urges to develop new drugs 
for the treatment, and marine microbes provide a 
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plenty of scope as potential resource. These 
marine forms also render anti-inflammatory 
products to mitigate the host inflammatory 
response, since inflammatory disorders result in 
hypersensitive reaction, viz., allergies, autoim- 
mune diseases, etc. (Huang et al. 2007; Kong 
et al. 2010; Hwang et al. 2011; Choi et al. 2012). 
Drugs/toxins and normal cellular respiration will 
induce the production of reactive oxygen species 
(ROS) free radicals that damage cell membrane, 
DNA, and proteins. Antioxidants quench free 
radicals and prevent ROS-mediated cell injury. 
Specifically, marine algae are enriched with anti- 
oxidant metabolites/pigments and thus serve as a 
good biosource of antioxidants (Chen et al. 2011; 
Mohankumar et al. 2012; Wang et al. 2013a; Li 
et al. 2014). 

Large complex biofilm formation (biofouling) 
on the vessels in marine environment is annoying 
ship transport and a causative of huge economic 
loss for marine industries. Inorganic biocides not 
only prevent biofouling but also hurt harmless 
marine creatures. Demand of ecosafe antifou- 
lants can be met with potential marine products. 
Global demand for the fuel is exponentially 
increasing; biotechnology offers fuel production 
from biological sources. Both marine micro- and 
macroalgae are being used for the biofuel pro- 
duction (Qin et al. 2005; Kraan 2013; Merlin 
et al. 2013). 

Marine metagenomics offers the mining of 
novel bioactive products and industrially valu- 
able macromolecules from the uncultivable 
marine microbes and marine animals either 
through functional-based or sequence-based 
approaches (Donia et al. 2011; Felczykowska 
et al. 2012; He et al. 2012b, 2013b). For this pur- 
pose, metagenomic sample is sufficient rather 
than exploiting the biomass of the source organ- 
isms in a large quantity; thereby, bioresources 
will not be depleted. Moreover, genetic engineer- 
ing utilizes the marine microbes and algae for the 
research and industrial values (Filonov et al. 
2011; Montsant et al. 2004; Zaslavskaia et al. 
2001). Hence, this review focuses the products 
from marine bacteria, fungi, and algae with phar- 
macological and industrial values. 
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35.2. Pharmacological Potentials 


Natural products from marine microbes were 
determined to have various pharmacological val- 
ues including anticancer/cytotoxicity and inhibi- 
tion of specific drug target (enzymes/proteins), 
and antimicrobial, anti-inflammatory, and anti- 
oxidant properties are detailed. 


35.2.1 Cytotoxicity/Antineoplastic 
Activity 


Cytotoxicity of the marine microbial metabolites 
was observed on the various human cancer cell 
lines, viz., oral (KB), gastric (HM02, NUGC-3, 
GXF251L), hepatic (HepG2, SMMC-7721), 
breast (MWCF7, MDA-MB-231, MAXF401NL), 
lung (A549, NCI-H460, NCI-H23, A549, H460, 
HuCCA-1), non-small-cell lung cancer (NSCLC) 
(H1325, H2122, HCC44, HCC366, LXF529L), 
pancreas (BXPC-3, PAXFI1657L), CNS (SF- 
268), colon (KM20L2), prostate (DU-145, PC3), 
cervix (HeLa S3 cells, HeLa, Caski), ileocecal 
colorectal (HCTS8), colorectal (HT-29, HCT116), 
ovary (SK-OV, COC1), leukemia (CCRF-CEM), 
lymphoblastic leukemia (MOLI-3), lymphoma 
(U937), melanoma (MEXF462NL), _ renal 
(RXF486L, RXF 393), blood and bone marrow 
(promyelocytic leukemia HL60), etc., and also 
on certain murine cancer cells, viz., mouse neuro- 
2a blastoma cells, murine leukemia (P388), 
mouse melanoma (B16), mouse fibroblast 
(NIH-3T3), mouse lymphocytic leukemia 
(L1210), etc. (Table 35.1). 


35.2.1.1 Marine Bacterial Compounds 

Marine bacterial cytotoxins are mainly falling in 
the groups of peptides, azoles, and terpenoids. 
Some of the compounds with cytotoxic proper- 
ties at low concentration are discussed in detail. 
Pettit et al. (2009) have isolated two cyclodepsip- 
eptides, bacillistatins | and 2, from marine bacte- 
ria Bacillus silvestris, which are obtained from a 
Pacific Ocean (southern Chile) crab and are 
found to be potential antineoplastic agents that 
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strongly inhibit human cancer cell lines including 
BXPC-3 (pancreas), MCF-7 (breast), SF-268 
(CNS), NCI-H460 (lung), KM20L2 (colon), and 
DU-145 (prostate) and one murine cell line, P388 
(lymphocytic leukemia). Bacillistatin 
1(C57Ho6N¢6O01), cyclodepsipeptide, was consti- 
tuted with a head to tail linkage of three repeated 
sequence of R-valine, S-lactic acid, S-valine, and 
2R-hydroxy-3S-methylvaleric acid. Bacillistatin 
2 shares the same molecular weight with bacil- 
listatin 1 but slightly differs in one of its 
components; i.e., of the three 2R-hydroxy-3S- 
methylvaleric acid, one is_ replaced with 
2R-hydroxy-4S-methylvaleric acid. Difference in 
the position of methylation (fourth carbon in one 
of valeric acids in bacillistatin 2 and third carbon 
in compound 1) influenced slightly in the cyto- 
toxicity, 1.e., compound 2 is slightly more active. 
Erythrobacter sp., isolated from the mangrove 
sediment of Trinity Bay, was found to produce an 
unusual natural product with mixed biosynthetic 
origin, erythrazole B. The compound is very rare 
because of the presence of both benzothiazole and 
terpene groups. Its chemical components are gly- 
cine, tetrasubstituted benzothiazole, and 22 car- 
bons containing unusual diterpene terminated with 
carboxylic acid. This compound exerts potent 
cytotoxicity against non-small-cell lung cancer 
(NSCLC) cell lines (H1325, H2122, and HCC366) 
(Hu and Macmillan 2011). Likewise, Hu et al. 
(2012) isolated Erythrobacter sp. from marine 
sediment of Galveston, Texas. Cytotoxic erythro- 
lic acid D was purified from this bactertum, com- 
prised of 4-hydroxybenzoic acid and an unusual 
17 carbon-containing diterpene terminated with 
carboxylic acid. This compound was also found to 
exhibit cytotoxicity against NSCLC cell lines. 


35.2.1.2 Marine Actinobacterial 
Compounds 

The marine actinomycete Nocardia dassonvillei 
which was isolated from the Arctic Ocean with 
extreme variations in pressure, salinity, and tem- 
perature showed cancer cell cytotoxicity and 
antifungal activity (Gao et al. 2012). The novel 
natural product N-(2-hydroxypheny]l)-2- 
phenazinamine (NHP) (Fig. 35.1) isolated from 
N. dassonvillei is the key for cancer cell cytotoxicity 
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against HepG2, A549, HCT-116, and COCI cells 
and also for antifungal activity against Candida 
albicans. The extreme environmental conditions 
in the Arctic such as low temperature, UV radia- 
tion, and variations in sunlight provoke not only 
the habitual adaptation but also the production of 
potential chemical compounds with significant 
biological activities. Earlier studies revealed that 
the NHP molecules exhibit activity by permeat- 
ing the tissues and organs as well as affecting 
multiple pathways and/or multiple targets. 

Asolkar et al. (2006) reported the cytotoxic 
compounds such as daryamides A, B, and C and 
(2E,4E)-7-methylocta-2,4-dienoic acid amide 
from the marine actinomycete Streptomyces 
strain CNQ-085, which was isolated from a 
marine sediment collected at a depth of ca.50 m 
off San Diego, CA, in which daryamide A exhib- 
ited significant cytotoxicity against human colon 
carcinoma cell line (HCT-116). These com- 
pounds are related to the manumycins, a micro- 
bial natural product which was reported for its 
antimicrobial, cytotoxic, and other biological 
activity potentials. Manumycin and its analogs 
inhibit Ras farnesyl transferase and the growth of 
Ki-ras-activated murine fibrosarcoma in mice; 
hence it is anticipated that the daryamides may 
also have such activity. 

Mayamycin is a novel benz[aJanthracene 
derivative (Fig. 35.1) which was derived from 
the marine sponge Halichondria panicea asso- 
ciated with Streptomyces sp. strain HB202 
(Schneemann et al. 2010). The compound may- 
amycin data revealed that the chemical nature 
was an angucycline-type polyketide with a 
C-glycosidically bound amino sugar. Hence, 
the production of aromatic polyketides was 
demonstrated by genetic analysis and it con- 
firms the presence of type II polyketide syn- 
thase. This polyketide-derived  tetracyclic 
benz[a]anthraquinone holds a wide variety of 
biological activities such as_ antibacterial, 
enzyme inhibition, and cytostatic effects. 
Therefore, mayamycin was tested for its bio- 
logical significance, and it exhibits an excellent 
cytotoxicity against different human cell lines 
as well as pathogenic bacterial strains including 
antibiotic-resistant strains. 
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(6S,3S)-6-isobutyl-3-methyl-2,5-diketopiperazine 


(Cho et al., 2012a) 





N-(2-hydroxyphenyl)-2-phenazinamine(NHP) 
(Gao et al., 2012) 


OH 
HOOC O~o 
O CH, 
CH; CH, 
Citrinin (Subramani et al.,2013) 





HN OH 


Mayamycin (Schneemann et al., 2010) 





Heronapyrrole A (1) R= Me 
Heronapyrrole B (2) R=H 


OH 


Heronapyrroles A-C (1-3) 


Fig.35.1 Marine actinobacterial metabolites 


Motohashi et al. (2010) isolated the two new 
anthracyclines, namely, tetracenoquinocin (1) 
and 5-iminoaranciamycin (2), in addition to the 
known compounds aranciamycin (3) and antibi- 
otic SM 173B from the marine sponge Haliclona 


O 


HO 







Heronapyrrole C (3) 


(Raju et al., 2010) 


sp. associated with Streptomyces _ sp. 
Sp080513GE-26. The compounds 1-3 were 
checked for its cytotoxic potential against human 
cervical carcinoma HeLa cells and human 
acute myelogenous leukemia LH-60 cells. 
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Comparatively, compound 3 (aranciamycin) 
exhibits potential inhibition than the compounds 
1 and 2 with ICs, values of 2.7 and 4.1 uM against 
HeLa and HL-60 cells, respectively. The 
functional group ketone at C-5 in the com- 
pounds | and 3 is majorly responsible for 
cytotoxicity. 

Lajollamycin, a nitro-tetraene spiro-/-lactone- 
y-lactam antibiotic, was isolated from the marine 
Streptomyces nodosus (NPSO07994) which is 
collected from Scripps Canyon, La_ Jolla, 
California, by Manam et al. (2005). It inhibits the 
tumor cell B16-F10 and the growth of both drug- 
sensitive and drug-resistant Gram-positive 
bacteria. 


35.2.1.3. Marine Cyanobacterial 
Compounds 

Intriguingly, marine cyanobacteria produce sec- 
ondary metabolites that collectively contain both 
non-ribosomal peptide and polyketide moieties. 
The non-ribosomal peptide part is biosynthesized 
by non-ribosomal peptide synthetase mainly with 
non-proteinogenic and/or modified amino acids, 
while the polyketide part is made by polyketide 
synthase with unusual hydroxylated fatty acid. 
Hence, these metabolites are said to be of mixed 
biosynthetic origin, having pronounced bioactiv- 
ity. Some of such cytotoxins isolated very 
recently during 2011-2013 are alone described 
here. Nagarajan et al. (2012) have already elabo- 
rated about the bioactive natural products iso- 
lated from marine’ cyanobacteria during 
2000-2011. 

Apratoxins are potent cytotoxic cyclodepsip- 
eptides isolated from marine cyanobacteria 
Lyngbya. Apratoxins A—-G were identified during 
the period of 2001-2010 and their structural and 
pharmacological characteristics were described 
by Nagarajan et al. (2012). Recently, Thornburg 
et al. (2013) have isolated apratoxin H and apra- 
toxin A sulfoxide from marine cyanobacterium 
Moorea producens (Lyngbya majuscula), col- 
lected from the Nabq Mangroves in the Gulf of 
Aqaba (Red Sea). Apratoxin H (C4¢H7,NsOxS) is 
made up of head to tail linkage of pipecolic acid, 
N-methyl-isoleucine, N-methyl-alanine, 
O-methyl tyrosine, modified cysteine, and dihy- 
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droxylated fatty acid moiety 3,7-dihydroxy-?2, 
5,8, 8-tetramethylnonanoic acid (Dtena). 
Apratoxin H differs from A sulfoxide by the 
presence of pipecolic acid in place of proline 
residue. Sulfur in the modified cysteine residue 
of apratoxin A undergone oxidation to yield apra- 
toxin A sulfoxide (C4s;H¢9N;OoS). Apratoxin H 
seems more potent than apratoxin A sulfoxide 
against lung cancer cells (NCI-H460) (Fig. 35.2). 

Grassypeptolides D and E (Fig. 35.2) are yet 
other potent cytotoxic cyclic depsipeptides iso- 
lated from marine cyanobacterium Leptolyngbya 
sp. that were collected from a shipwreck in the 
Red Sea. Both compounds possess the same 
molecular formula (C5;7Hsg;NoO19S2) and the same 
components, viz., B-amino acid residue 2-methyl- 
3-aminobutyric acid (Maba), threonine, N-methyl] 
leucine, 2-aminobutyric acid-derived thiazoline 
carboxylic acid (Aba-thn-ca) residue, 
N-methylphenylalanine-derived 2-methylthiazoline 
carboxylic acid (N-Me-Phe-4-Me-thn-ca), pro- 
line, N-methyl valine, and phenyllactic acid (Pla). 
They differ in the orientation of carbon (7th) in 
threonine and carbon (11th) in N-methyl leucine 
(Thornburg et al. 2011). 

Strong cytotoxic cyclodepsipeptide lagun- 
amide C (Cy,H73N5O,9) was isolated from L. 
majuscula from the western lagoon of Pulau 
Hantu Besar, Singapore. Head to tail-linked 
(-(N-methyl-alanine)-isoleucine-(N-methyl- 
glycine)-(N-methyl-phenylalanine)-alanine- 
(2-hydroxyisoleucic acid (Hila))-5,8- dihydroxy- 
2,6,9-trimethyl-undec-2-enoic acid (Dtuea)-) 
constitute this cyclodepsipeptide (Fig. 35.2). 
Hydroxy acid Hila and a polyketide (fatty acid) 
moiety Dtuea along with methylated and non- 
methylated amino acids made this compound to 
potently inhibit a variety of cancer cells such as 
P388, A549, PC3, HCT8, and SK-OV3 (Tripathi 
et al. 2011). 

Boudreau et al. (2012) have isolated cyclodep- 
sipeptide viequeamide A (Cy H69N;O;9) from 
marine button cyanobacterium Rivularia sp., col- 
lected near the island of Vieques, Puerto Rico. 
The stereochemistry of this cytotoxin was devised 
as Thr, N-Me-Val, Pro, 2-hydroxy-3- 
methylpentanoic acid (Hmpa), N-Me-Val, Val, 
2,2-dimethyl-3-hydroxy-7-octynoic acid (Dhoya) 
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Fig.35.2 Marine cyanobacterial metabolites 


(Fig. 35.2). The presence of polar amino acid 
threonine may be a factor for the enhanced activ- 
ity of this compound towards H460 human lung 
cancer cells. 
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Viequeamide A (Boudreau et al., 2012) 


Active-guided fractionation of — the 
EtOAc:MeOH (1:1) extract of Symploca cf. hyd- 
noides which was collected from Cetti Bay, 
Guam, yielded veraguamide A (C37H5)N,OgBr). 
This cytotoxic cyclic hexadepsipeptide is made 
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up of N-Me-Val, Pro, Hmpa, N-Me-Val, Val, 
8-bromo-3-hydroxy-2-methyloct-7-ynoic — acid 
(Br-Hmoya) (Fig. 35.2). Salvador et al. (2011) 
have suggested that the alkynyl bromide 
(Br-Hmoya) moiety of veraguamide A may be 
responsible for its strong toxicity towards the 
lung cancer cell line H-460. 

Anticancer peptides bisebromoamide and nor- 
bisebromoamide have been isolated by Sasaki 
et al. (2011) from Lyngbya sp., which were col- 
lected at the reef of Bise, Okinawa, Japan. The 
structural elements of —bisebromoamide 
(C5,;H7.”BrN7O,S) are 2-(1-oxo-propyl) pyrro- 
lidine (Opp), N-Me-Phe, Leu, 4-methyl thiazo- 
line (4-Me-Tzn), 4-Me-Pro, N-methyl-3- 
bromotyrosine (N-Me-Br-Tyr), Ala, and pivalic 
acid (tert-butyl containing carboxylic acid). 
Norbisebromoamide (C5,)H7)BrN7OgS) resembles 
bisebromoamide in the chemical composition 
except demethylation in thiazoline residue. Both 
of them are toxic to cervix cancer cells (HeLa 
S3). The key and unusual structural feature and 
elements are the fusion of thiazoline with proline, 
2-Opp, and N-Me-Br-Tyr. Thus, the above cyclic 
peptides with modified amino acids, hydroxy 
acids, and modified fatty acids render potent 
cytotoxicity against human cancer cell lines. 


35.2.1.4 Marine-Derived Fungal 
Compounds 

Amazingly, marine fungi are the fluent producers 
of bioactive compounds with very diverse chem- 
istry including macrocycles; heterocycles; methyl 
guanine with isoprene; derivatives of azole, ben- 
zoyl, benzaldehyde, and tyrosine; halogenated 
polyketides; pentacyclic alkaloids; cytochalasins; 
polyphenols; quinines; indole diketopiperazines; 
peptides; lactones; etc. 

Boot et al. (2007) have isolated linear pen- 
tadecapeptides from marine-derived fungus 
Acremonium sp. Efrapeptin Ea (Cg9Hy4,N 1:0 16) 
is composed of 15 normal/modified amino acids 
such as acylated pipecolic acid (Ac Pip), amino- 
isobutyric acid (Aib), Pip, isovaline (Iva), Aib, 
Leu, B-Ala, Gly, Aib, Aib, Pip, Aib, Ala, and Leu 
with a cap of (C,3H»5N3) bicyclic amine. 
Efrapeptin F (Cg9Hj4,N;gO0 16") 1s constituted with 
Ac-Pip, Aib, Pip, Aib, Aib, Leu, B-Ala, Gly, Aib, 


699 


Aib, Pib, Aib, Ala, Leu, and Iva with cap. 
Likewise, efrapeptin G (Cg3Hj43N;sO016") 1s made 
up of Ac-Pip, Aib, Pip, Iva, Aib, Leu, B-Ala, Gly, 
Aib, Aib, Pip, Aib, Ala, Leu, and Iva with cap. 
These linear peptides were found to have antican- 
cer activity at nanomolar concentration. 

Marine-derived fungus Zygosporium masonii 
was found to produce cytotoxic cyclic pentadep- 
sipeptide zygosporamide (C36H5)N4O,), com- 
prised of L-Phe, L-Leu, D-Leu, L-Phe, and 
O-Leu. Presence of one D-amino acid (leucine) 
has significantly enhanced anticancer activity 
against colon cancer cell line (HT-29) (Oh et al. 
2006). 

Two cytotoxic indole diketopiperazines 
(luteoalbusins A and B) were identified from 
deep sea-derived fungus Acrostalagmus luteoal- 
bus SCSIO F457. Luteoalbusin A (Cj3H5)N,0;S;) 
is composed of 2,3-disubstituted indole, 
3-substituted indole, and thiodiketopiperazine 
(two ketones, two quaternary carbons (sulfated), 
and two nitrogens). Luteoalbusin B 
(C33H2)N4O3S3) additionally possesses one sulfur 
atom. They display significant cytotoxic activity 
against SF-268, MCF-7, NCI-H460, and HepG-2 
cells (Wang et al. 2012). 

Besides the above fungal compounds, ophi- 
obolin K, luteoalbusins A and B, ustusorane E, 
trichodermamide B, vermelhotin, epoxyphoma- 
lin A and B, and aspergiolide A exhibit potent 
anticancerous activity against a broad spectrum 
of human cancer cell lines and murine cancer 
cells. 


35.2.2 Inhibition of Enzymes/ 
Proteins 


Enzymes and proteins are employed specifically 
in various cellular functions including DNA rep- 
lication, cell division, gene expression, metabo- 
lism, signal transduction, transport, immunity, 
pathogenicity, etc. By regulating the activity of 
specific proteins/enzymes, it will be able to con- 
trol the particular cellular function. In pharma- 
cology, various protein/enzyme inhibitors are 
used against specific drug targets to treat cancer, 
microbial infections, and neurological disorder, 
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regulate host immune system, etc. Such drug tar- 
gets, associated problems, and its inhibitors from 
marine microbes’ are discussed below 
(Table 35.2). 


35.2.2.1 Neutrophil Elastase 
Inflammation is an immune response to protect 
tissues from injury/damage and it employs neu- 
trophils (leukocytes). Activation of neutrophils 
releases a variety of cytotoxic products including 
serine proteases like elastase which digest elastin 
and many other proteins of invading microbes. 
Excessive release of elastase due to chronic 
inflammation additionally damages the host tis- 
sues especially the lung tissues that results in 
various disorders, viz., emphysema, inflamma- 
tory chronic lung diseases, chronic obstructive 
pulmonary disease, acute lung injury, acute respi- 
ratory distress syndrome, and cystic fibrosis. 
Hence, neutrophil elastase is a therapeutic target 
and extensive host tissue damage can be prevented 
by administration of elastase inhibitors (Korkmaz 
et al. 2010). Porcine pancreatic elastase is often 
used as model for human leukocyte elastase. 
Secondary metabolites like Symplostatins 5-10 
from marine cyanobacterium Symploca sp. and 
conioimide, cereolactam, and cereoaldomine 
from marine fungus Coniothyrium cereale are 
potent elastase inhibitors (Elsebai et al. 2011, 
2012; Salvador et al. 2013). 


35.2.2.2 Protein Tyrosine 
Phosphatase (PTP) 

Secreted Mycobacterium tuberculosis (MTB) 
PTPA binds to macrophage vacuolar-(H+)- 
ATPase (V-ATPase) located in the membrane. 
V-ATPase is involved in the acidification of the 
lumen and coordinates phagosome-lysosome 
fusion. Inhibition of V-ATPase, in turn, evades 
MTB from the digestion within macrophage 
(Wong et al. 2011). Secreted M. tuberculosis 
PTPB manipulates host signal transduction path- 
ways and poses infection (Grundner et al. 2005). 
Some PTPs are positively signaling the growth 
factor receptors for tumor formation (oncogenic) 
(Ostman et al. 2006). Hence, PTP is found to be 
a therapeutic target, and its inhibition can cure 
tuberculosis and certain cancers. Asperterpenoid 
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A, sydowiols A and C, and aquastatin A were 
found to be inhibiting PTP (Huang et al. 2013; 
Liu et al. 2013; Seo et al. 2009). 


35.2.2.3. p53/MDM2 Interaction 

p53, a tumor-suppressor protein, arrests cell 
cycle at G2,/S phase in response to DNA dam- 
age, oxidative stress, oncogene expression, etc., 
induces apoptosis, and eradicates tumor/cancer 
growth. But p53 is exported from the nucleus to 
cytosol by MDM2 (E3 ubiquitin ligase) enzyme, 
where p53 is conjugated with ubiquitin that, in 
turn, results in proteasomal degradation of p53 
(Chene 2003). Inhibition of p53/MDM2 interac- 
tion with natural product would be an attractive 
strategy to activate p53 and check tumor/cancer 
growth. Hoiamide D from marine cyanobacte- 
rium Symploca sp. was determined to effectively 
inhibit p53/MDM2 interaction (Malloy et al. 
2012a). 


35.2.2.4 Sortase (Srt) 

It is a prokaryotic signal peptidase, cleaving the 
carboxy terminal sorting signal (signal for pro- 
tein transportation) of cell surface proteins and 
trafficking and covalently attaching them to the 
cell wall. These surface proteins are implicated in 
adhesion, colonization, infection, host immune 
evasion, etc. By inhibiting the sortase enzyme, 
the transportation of surface proteins will be 
impaired and pathogenicity of the microbe can be 
muted. The enzyme sortase A (SrtA) from S. 
aureus was inhibited by marine-derived fungal 
metabolites herqueidiketal and chrysoarticulin C 
(Jeon et al. 2013; Julianti et al. 2013). 


35.2.2.5 Histone Deacetylase (HDAC) 

Histone deacetylase is an anticancer drug target. 
Histone acetyl transferase acetylates the core his- 
tone of nucleosome to allow the gene expression 
through transcription factors. HDAC removes 
acetyl groups from the histones and silences the 
gene expression. Inhibition of HDAC leads to 
hyperacetylation of histones; in addition certain 
transcription factors are also acetylated. This, in 
turn, upregulates certain transfactors like tumor- 
suppressor p53 that arrest cell cycle and prevent 
cancer will get activated. Santacruzamate A from 
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Symploca sp. and speradine A from Aspergillus 
tamari displayed HDAC inhibition (Pavlik et al. 
2013; Tsuda et al. 2003). 


35.2.2.6 Sodium and Calcium 
Channels 

They are transmembrane proteins maintaining 
the membrane potentials. They send Na* and Ca** 
ions inside the cell, respectively, to depolarize 
(more positive or less negative inside the cell) the 
membrane to achieve action potential. This is 
required in many excitable animal cells (neurons, 
myocytes, and endocrine cells) for cell-cell com- 
munication and to activate various signal trans- 
duction pathways. Clinically, sodium channel 
blockers are used in various treatments especially 
as analgesic (pain reliever) (Wood et al. 2004), 
and calcium channel blockers are used to cure 
hypertension. Hermitamides A and B, jan- 
thielamide A, and kimbeamide A are shown to 
block voltage-gated sodium channels (De 
Oliveira et al. 2011; Nunnery et al. 2012); inhibi- 
tion of calcium oscillations was observed in cred- 
neramides A and B and ethyl tumonoate A-treated 
neurons (Engene et al. 2011; Malloy et al. 2012b). 


35.2.2.7. Acetylcholine Esterase 
(AchE) 

Acetylcholine (Ach) is a neurotransmitter trans- 
mitted from neuron to target cell across the nerve 
synapse, and it binds with acetylcholine receptor 
in nerve-nerve junction and neuromuscular junc- 
tion of the signal receiver that activates signal 
transduction pathways in signal-receiving neuron 
or muscle cells. AchE located on the synaptic 
cleft hydrolyses acetylcholine and_ thereby 
releases choline from Ach and reduces its con- 
centration at the synaptic cleft. This, in turn, 
releases Ach from Ach receptor, terminates sig- 
nal transmission, and allows the Ach receptor to 
bind with another Ach during transmission. 
When AchE is inhibited, Ach accumulates at the 
nerve synapse and continuously transmits the 
signal (Pohanka 2012). Irreversible inhibitors 
lead to paralysis, convulsion, and _ death. 
Reversible inhibitors for short duration will be 
used for the treatment of various neurological 
diseases. AchE was inhibited by sorbiterrin 
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A and paeciloxanthone (Chen et al. 2012a; Wen 
et al. 2008). 


35.2.2.8 Cytochrome P450 

This monooxygenase family is involved in phase 
I xenobiotic/drug metabolism and it detoxifies or 
reduces the toxicity of the drugs/xenobiotics/ 
lipophilic compounds by oxidation. Inhibition of 
cytochrome P450 leads to oxidative stress that, in 
turn, results in cell death. Monodictyochromes A 
and B, monodictysin B and C, and monodictyx- 
anthone from marine fungus Monodictys putredi- 
nis were found to inhibit cytochrome P450 1A 
(Krick et al. 2007; Pontius et al. 2008b). 


35.2.3 Antimicrobial Compounds 


Marine bacterial and fungal natural products that 
belong to carboline, quinoline, lactones, pep- 
tides, macrolactones, diketopiperazines, fatty 
acids, polyketides, terpenes, alkaloid, xanthones, 
etc., are found to have potent antimicrobial prop- 
erties against human bacterial, viral, and fungal 
pathogens and protozoal parasites. Notably, piti- 
peptolide F (against Mycobacterium tuberculo- 
sis); brevianamide S (against M. bovis BCG); 
trichoderins A, Al, and B (against M. smegmatis, 
M. bovis BCG, and M. tuberculosis H37Rv); 
l-acetyl-beta-carboline (against methicillin- 
resistant Staphylococcus aureus (MRSA); bacil- 
listatins 1 and 2 (against penicillin-resistant 
Streptococcus pneumoniae and _ multidrug- 
resistant $. pneumoniae); diketopiperazines 
(against Vibrio anguillarum); asperterrestide A 
(against influenza virus. strains A/WSN/33 
(HIN1) and A/Hong’ Kong/8/68 (H3N2)); 
isoaspulvinone E, aspulvinone E, and pulvic acid 
(against influenza A HINI1 virus); gageomacro- 
lactins 1-3 (against pathogenic fungi Aspergillus 
niger, Botrytis cinerea, Colletotrichum acutatum, 
Candida albicans, and Rhizoctonia solani); coi- 
bacins A-D (against Leishmania donovani); lagu- 
namide C (against Plasmodium falciparum); 
nodulisporacid A and vermelhotin (against P. fal- 
ciparum strain 94); chaetoxanthones B and C 
(against P. falciparum and Trypanosoma cruzi); 
hypothemycin and 4-O-demethylhypothemycin 
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(against P. falciparum K1); N-(2-hydroxypheny]l)- 
2-phenazinamine (NHP) (against C. albicans); 
daryamides A and B (against C. albicans); 
mayamycin (benz[a]anthracene) (against 
Bacillus subtilis, Brevibacterium epidermidis, 
Dermabacter hominis, Klebsiella pneumoniae, 
Propionibacterium acnes, Pseudomonas aerugi- 
nosa, Staphylococcus aureus, Staphylococcus 
epidermidis, Staphylococcus lentus, 
Xanthomonas campestris, Phytophthora infes- 
tans); heronapyrrole A, heronapyrrole B, and 
heronapyrrole C (pyrroloterpenes) (against S. 
aureus and B. subtilis); salinosporamide A 
(against P. falciparum and P. yoelii); citrinin 
(Figure 35.1) (against methicillin-resistant S. 
aureus, rifampicin-resistant S. aureus, S. aureus, 
Enterococcus faecium, and Cryptococcus neofor- 
mans); etc., were some of the recently explored 
antimicrobial compounds with significant bioac- 
tivity (Asolkar et al. 2006; Balunas et al. 2012; 
Gao et al. 2012, 2013; He et al. 2013a; Lee et al. 
2013; Montaser et al. 2011; Pettit et al. 2009; 
Prudhomme et al. 2008; Pruksakorn et al. 2010; 
Raju et al. 2010; Song et al. 2012; Schneemann 
et al. 2010; Subramani et al. 2013; Tareq et al. 
2013; Tripathi et al. 2011) (Table 35.3). 


35.2.4 Anti-inflammatory 
Compounds 


Foods and nutrients in their isolated or combined 
forms are helpful in inhibiting the inflammatory 
diseases such as allergy or autoimmune diseases 
due to their synergistic effects in human physiol- 
ogy. Apart from dietary materials, the seaweeds 
are also used as gelling agents, stabilizers in food 
preparations, and pharmaceutical materials 
(Mohamed et al. 2012). Antioxidants often pre- 
vent inflammatory processes in the cell 
(Grivennikov and Karin 2010). Brown seaweeds 
seem to be better than red and green ones due to 
their higher dietary and pharmacological valu- 
ables (Kraan 2012). Seaweeds are full of hydro- 
soluble and liposoluble vitamins, viz., thiamine, 
riboflavin, B-carotene, and tocopherols, and also 
long-chain polyunsaturated essential fatty acids 
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including n-3 family (n-3 LC-PUFA) such as 
eicosapentaenoic acid (EPA, 20:5n-3), which 
may act against heart diseases, thrombosis, ath- 
erosclerosis, and carcinogenesis (Johns and 
Nichols 1979; Khotimchenko et al. 2002). The 
major portion of the seaweed extracts contains 
essential sterols such as fucosterols and choles- 
terols from brown algae; green algae are contain- 
ing a wide variety of sterol family than browns 
due to their inhabitant properties. The phenolic 
components, namely, sargahydroquinoic acid, 
sargachromanol E and D, and polyunsaturated 
fatty acids such as palmitic acid, palmitoleic acid, 
elaidic acid, linoleic acid, linolenic acid, arachi- 
donic acid, and cis-5, 8,11,14,17-eicosanoic acid 
that are found in a variety of seaweeds play a sig- 
nificant role in anticancer and anti-inflammatory 
activities (Kong et al. 2010). Berge et al. (2002) 
reported the sulfolipids of Porphyridium cruen- 
tum rich in palmitic acid, arachidonic acid (C20:4 
omega-6), eicosapentaenoic (C20:5 omega-3) 
acids, and 16:1n-9 fatty acid-inhibited DNA 
alpha-polymerase and with its anti-inflammatory 
activity and antiproliferative activities against 
human colon adenocarcinoma DLD1 cells. The 
ubiquitin-proteasome pathway (UPP) is a major 
nonlysosomal system responsible for intracellu- 
lar protein degradation in eukaryotic cells, and 
inhibition of NF-«B is a crucial step in anti- 
inflammatory and cellular growth arrest (Chen 
and Dou 2010). Hwang et al. (2011) showed that 
the anti-inflammatory and antiproliferative activ- 
ity of sulfated polysaccharide from Sargassum 
hemiphyllum could be a possible reduction in 
cytosol/nuclear factor-kappa B (NF-«B) p65 
expression by inhibiting IL-1B, IL-6, TNF-R, 
NO, iNOS, and cyclooxygenase-2 (COX-2). 
Activated NF-«B, a well-known transcription 
factor, leads to the development, differentiation, 
and proliferation of T and B lymphocytes and the 
secretion of cytokines by the classical pathway. 
Pheophytins of Cladophora fascicularis are 
reported to be the promising drugs by inhibiting 
proteasome ChI-L and NF-«B for anti- 
inflammation and _ antiproliferative activity 
(Huang et al. 2007). Denise et al. (2011) studied 
the role of aquamin, a well-known nutrient sup- 
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plement, from red algae Lithothamnion corallioi- 
des in activating NF-KB in RAW 264.7 cell lines. 
Their study revealed that aquamin inhibited the 
phosphorylation and degradation of IkB. Aquamin 
also inhibited COX-2 gene expression and thus 
rendered NF-«B inhibition. 

C-reactive protein (CRP) is one of the acute- 
phase proteins involved in inflammation. The 
circulating CRP is synthesized and secreted pre- 
dominantly by hepatocytes in response to proin- 
flammatory cytokines such as tumor necrosis 
factor (TNF), interleukin-1, and interleukin-6; 
obesity and smoking are the main factors for the 
activation of the above cytokines, and the natu- 
ral dietary substances such as seaweeds are of 
great interest in inhibiting these factors (Pepys 
and Hirschfield 2003). Particularly, Nanri et al. 
(2008) found that Japanese men and women 
were less prone to the sustainability of the life- 
threatening diseases than western people due to 
inhibition of C-reactive protein substances by 
seaweed extracts that are routinely used in their 
food recipe. Marine-derived microbial second- 
ary metabolites open new door for novel drug 
discovery. The cyclomarins are well known for 
their anti-inflammatory effects from marine 
benthic bacteria, especially from Streptomyces 
sp. with novel cyclic heptapeptides containing 
four unusual amino acids. Marine-derived 
cyclomarin A, with its naval amino acid 
2-amino-3,5-dimethylhex-4-enoic acid, acts as 
a potent anti-inflammatory agent and displayed 
almost 92 % reduction in phorbol ester (PMA)- 
induced mouse ear edema experiment at 50 pg/ 
ear; indomethacin showed only 72 % of such 
activity (Renner et al. 1999). Pyrenocine A 
(PA), isolated from marine Penicillium paxilli 
Ma(G)K, suppressed LPS-induced macrophage 
activation on both pre- and posttreatment meth- 
ods. PA inhibited nitrite and PGE2 production. 
It also affected Mac-1 cell migration and co- 
stimulatory molecules on lymphocyte activation 
through MyD88-dependent intracellular signal- 
ing pathway (Toledo et al. 2014). They also 
reported that pyrenocine A inhibited NF-«B- 
mediated signal transduction on macrophages 
stimulated by LPS. 
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Marine cyanobacterial metabolites bearing 
halogenated carboxylic acid moiety depict anti- 
inflammatory properties. In the recent study, 
Choi et al. (2012) reported that honaucins A—C 
isolated from marine cyanobacterium 
Leptolyngbya_ crossbyana inhibit the LPS- 
induced nitric oxide (NO) production from 
murine macrophage RAW264.7 that, in turn, 
downregulates the expression of proinflamma- 
tory cytokines’ especially interleukin-1f. 
Honaucin A constitutes 3-hydroxy-y- 
butyrolactone condensed with 4-chlorocrotonic 
acid, while honaucins B and C_ comprise 
4-chlorocrotonic acid with 3,4-di-O-substituted 
butanoic acid. They inhibit NO production at 
ICs9s of 4.0, 4.5, and 7.8 uM respectively. Thus, 
marine extracts and metabolites possess notice- 
able anti-inflammatory activities. 


35.2.5 Antioxidants 


Extracellular polysaccharides from marine 
microbes were unveiled to have antioxidant prop- 
erties. Marine fungi are also serving as one of the 
sources for antioxidants. Extracellular polysac- 
charides (EPS) designated as YSS (containing 
backbone (1—2)-linked o-mannopyranose, 
(1—6)-linked o-mannopyranose and_ branch 
B-galactofuranose, (1-—6)-linked 
Q-mannopyranose) from marine fungus 
Aspergillus terreus (Wang et al. 2013a), AVP 
(containing (1—6)-linked oa-D-glucopyranose, 
(1—2)-linked o-D-mannopyranose with side 
chain (1—2)-linked o-D-mannopyranose trisac- 
charides) from  coral-associated fungus 
Aspergillus versicolor LCJ-5-4 (Chen et al. 
2012b), PS2-1 (containing glucuronic acid, man- 
nose with glucose, and galactose) from 
Penicillium sp. F23-2 (Sun et al. 2009), and As1-1 
isolated from mangrove endophytic fungus 
Aspergillus sp. Y16 showed antioxidant proper- 
ties by quenching free radicals (Chen et al. 2011). 
Apart from EPS, benzamide derivative (methyl 
4-(3, 4-dihydroxybenzamido) butanoate) from 
marine alga-derived endophytic fungus 
Aspergillus wentii EN-48 was also found to have 
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DPPH radical-scavenging activity (Li et al. 2014). 
EPS, from marine bacterium Edwardsiella tarda, 
ETW1, and ETW2 (mannan) possess antioxi- 
dant, hydroxyl, and DPPH radical-scavenging 
properties (Guo et al. 2010). 

Besides marine microbes, seaweeds are the 
benevolent organisms to yield antioxidants in 
great quantity. Recently, sulfated polysaccha- 
rides, extracted from brown algae Padina tetra- 
stromatica, Sargassum tenerrimum, and_ S. 
swartzii, were reported to have antioxidant prop- 
erties. Fucoidan (sulfated polysaccharide) 1iso- 
lated from seaweed Turbinaria conoides reduces 
oxidative stress in myocardial-injured rat by 
improving cellular antioxidants (Mohankumar 
et al. 2012). Vijayabaskar and Shiyamala (2012) 
have suggested that polyphenolic contents of 
Turbinaria ornata are responsible for its radical- 
scavenging property. Brown seaweed Cystoseira 
trinodis, collected in Mandapam coast, India, is 
identified to produce antioxidants, phlorotannins 
by Sathya et al. (2013). Novel compound from 
brown alga Ecklonia cava, 2,7"-phloroglucinol- 
6,6’-bieckol (PHB), serves as antioxidant (Kang 
et al. 2012). 

Apart from brown algae, red and green algae 
also produce antioxidants. Five nitrogen- 
containing bromophenols, 3-(2,3-dibromo-4,5- 
dihydroxybenzyl])pyrrolidine-2,5-dione; methyl 
4-(2,3-dibromo-4,5-dihydroxybenzylamino)- 
4-oxobutanoate; 4-(2,3-dibromo-4,5-dihydroxy- 
benzylamino)-4-oxobutanoic acid; 3-bromo-5- 
hydroxy-4-methoxybenzamide; and 2-(3-bromo- 
5-hydroxy-4-methoxyphenyl) acetamide, from 
red alga Rhodomela confervoides potently scav- 
enge DPPH radicals (ICs9s, 5.22—23.60 uM) (Li 
et al. 2012). Five — sesquiterpenoids 
(2,5,5-trimethyl-4-(4’-methyl-3'-penteny])-2- 
cyclohexen- 1-ol;4-isopenty1-3,4,5,5-tetramethyl-2- 
cyclohexen- 1 -ol;4-isopenty1-3,4,5,5-tetramethy]l- 
2-cyclohexen- 1 -ol;6-isopenty]-1,5,5,6-tetramethyl-1- 
cyclohexene;and3,4,5,5-tetramethyl-4-(3’-oxopentyl)- 
2-cyclohexen-l-one) from marine green alga 
Ulva fasciata are found to have antioxidant prop- 
erties (Chakraborty and Paulraj 2010). Thus, 
these antioxidants can be used to cope up with 
the oxidative stress after checking for their 
toxicity. 


35.3 Industrial Values 


35.3.1 Antifouling Activity 


Formation of the biofilms by microbes, mac- 
robes, and in combination on the submerged 
man-made surfaces in marine environment is 
termed biofouling and in due course establishes 
large complex biological layer that causes huge 
economic problem worldwide every year. Though 
heavy metals containing inorganic biocides are 
used as antifoulants, they are very toxic and 
directed to bioaccumulation and magnification. 
Hence, natural sources have been searched for 
eco-friendly potent antifoulants. Marine organ- 
isms including bacteria, fungi, and algae are 
explored to produce antifouling compounds. 
Aspergillipeptide C, a cyclic tetrapeptide, iso- 
lated from marine Gorgonian-derived fungus 
Aspergillus sp. SCSGAF 0076, poses strong anti- 
fouling activity against the larval settlement of 
bryozoans Bugula neritina (Bao et al. 2013). 
Shao et al. (201 1a) have isolated strong antifoul- 
ing compounds, 14-membered resorcylic acid 
lactone (cochliomycin A) and analogs zeaenol, 
LL-Z1640-1, and paecilomycin F, from 
Gorgonian-derived fungus Cochliobolus lunatus. 
These antifoulants inhibit the larval settlement of 
barnacle Balanus amphitrite (ECs 8 of 1.2, 5.0, 
5.3, and 17.9 pg mL"|, respectively). Likewise, 
fungus Aspergillus sp., isolated from gorgonian 
Dichotella gemmacea, was identified to produce 
aspergilone A, which is also inhibiting the settle- 
ment of B. amphitrite (ECs) of 7.68 pg mL") 
(Shao et al. 2011b). A  halotolerant fungus, 
Penicillium sp. OUCMDZ-776 produces anti- 
fouling alkaloid, penispirolloid A, which inhibits 
the larval settlement of bryozoans Bugula neri- 
tina (ECs) of 2.40 pg mL!) (He et al. 2012a),. 
a,B-unsaturated — lactones from marine 
Streptomyces sp. were found to inhibit the larval 
settlement of barnacle B. amphitrite. 

Six potent antifouling compounds, chro- 
manols S1-S6, isolated from brown alga 
Sargassum horneri, showed inhibition of various 
biofouling organisms including the larval settle- 
ment of hard fouling organism mussel Mytilus 
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edulis (ECs) of 0.11-3.34 wg mL), spore 
settlement of macroalga Ulva pertusa zoospores 
(EC5, of 0.01—-0.43 pg mL“'), the diatom Navicula 
annexa (ECs) of 0.008-0.19 wg mL”), 
Pseudomonas aeruginosa KNP-3, and 
Alteromonas sp. KNS-8 (MICs of 1.68—36.8 and 
1.02-30.4 pg mL, respectively) (Cho 2013). 
Cyclic diterpenoid dictyol C, isolated from 
marine brown alga Dictyota sp., was found to 
have antifouling activity by inhibiting the adhe- 
sion of marine’ bacterial biofilm of 
Pseudoalteromonas sp. D41 with an ECs, of 
30 uM (Viano et al. 2009). 

Cho et al. (2012) have isolated two 
diketopiperazines, namely, (6S, 3S)-6-benzyl- 
3-methyl-2,5-diketopiperazine and (6S, 35S)-6- 
isobutyl-3-methyl-2,5-diketopiperazine, from 
marine actinomycetes Streptomyces praecox 
291-11 (Fig. 35.1). They inhibit the settlement of 
diatom Navicula annexa and zoospores of green 
alga Ulva pertusa with the therapeutic ratio of 
>15 and hence, they are considered to be the 
promising antifoulants. In another study, Cho 
and Kim (2012) found that the diterpene 
compound lobocompactol is produced by the 
strain Streptomyces cinnabarinus PK209. 
Lobocompactol (Figure 35.1) is found to inhibit 
Ulva pertusa and Navicula annexa and showed 
the therapeutic ratio of 46.2 and 71.6 ug mL", 
respectively and also inhibited two biofilm form- 
ing bacteria KNP-5 and KNP-8. Five diketopi- 
perazines (DKPs), cyclo-(L-Leu-L-Pro), 
cyclo-(L-Phe-L-Pro), cyclo-(L-Val-L-Pro), cyclo- 
(L-Trp-L-Pro), and cyclo-(L-Leu-L-Val), were 
isolated from the deep-sea bacterium Streptomyces 
fungicidicus, showing the potential biofilm inhibi- 
tor against barnacle B. amphitrite. The ECs) value 
of five DKPs against barnacle larvae ranged from 
0.1 to 0.28 mM, and the LCs) ranged from 0.47 to 
0.75 mM (Li et al. 2006). Thus, any of such natu- 
ral antifoulants can be tried to overcome biofoul- 
ing phenomenon to save the environment. 


35.3.2 Biofuel 


In 2010, biofuel was fed to 2.7 % of the world’s 
requirement which reached almost 105 billion 
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liters as a whole. But, global demand for biofuels 
is continuously unabated every year. Though the 
land crops are used for biofuel production, the 
necessity for their production is dependent on the 
agricultural inputs (pesticides, fertilizers, farm- 
able land, and water). An alternative source (with 
abundance and carbohydrate-rich source), which 
can be independent of above said criteria, would 
be beneficial for enhanced biofuel production. By 
concerning these, the algae, particularly sea- 
weeds, would be expected as vital source for 
ethanol market due to their higher growth rate 
(Luning and Pang 2003), easy inhabitation, not 
requiring high fertilizers, low cellulose content, 
and devoid of lignin, a tough wall material usu- 
ally found in land plants (Thomas 2002). Large- 
scale seaweed cultivation would be a better 
alternative way for reducing the environmental 
pollution and global warming made by various 
anthropogenic activities (Kraan 2013). The types 
of biofuel from seaweeds can be categorized into 
biodiesel, bio-oils, and biohydrogens. A wide 
variety of algal genera especially kelps have been 
exploited recently as a rich source of biofuel (Qin 
et al. 2005), Merlin et al. (2013) reported that 
bioethanol production from Laminaria digitata, a 
well-known brown alga, would economically be 
beneficial than biogas (methane) production. 
Biodiesel is a product of the reaction between tri- 
glycerides and mono-alcohol (most commonly 
methanol or ethanol) with the catalysis of alkali, 
acids, or enzymes. Lipid content of an algal spe- 
cies 18 an important criterion for evaluating its 
biodiesel-producing capacity. The lipid profiles 
of many marine algae resemble with that of land 
plants and such algae can be considered as the 
best source for biodiesel production (Xu et al. 
2006b; Ohlrogge and Browse 1995). Cobelas and 
Lechado (1989) have analyzed the lipid profile of 
the various marine algae including cyanobacteria 
and reported that the fatty acid contents are as 
follows:C16:0andC 16:1 inthe Bacillariophyceae; 
C16:0 and C18:1 in the Chlorophyceae; C16:0 
and C18:1 in the Euglenophyceae; C16:0, C16:1, 
and C18:1 in the Chrysophyceae; C16:0 and 
C20:1 in the Cryptophyceae; C16:0 and C18:1 in 
the Eustigmatophyceae; C16:0 and C18:1 in the 
Prasinophyceae; C16:0 in the Dinophyceae; 
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C16:0,C16:1,andC18:1inthe Prymnesiophyceae; 
C16:0 in the Rhodophyceae; C14:0, C16:0, and 
C16:1 in the Xanthophyceae; and C16:0, C16:1, 
and C18:1 in Cyanobacteria. 

In contrast to marine eukaryotic algae, cyano- 
bacterial lipid content has been reduced at the 
laboratory conditions than in the natural environ- 
ment. This made a tough challenge in using cya- 
nophycean species for biofuel production, but by 
employing various genetic manipulation tech- 
niques, it is possible to increase their lipid con- 
tent (Hu et al. 2008; Beer et al. 2009). Wang et al. 
(2013b) studied mutated Synechocystis sp. 
PCC6803 for the production of alkanes in lab 
condition. By suppressing the expression of 
enzymes acyl-acyl carrier protein reductase, 
aldehyde-deformylating oxygenase and overex- 
pression of alkane biosynthetic genes in cyano- 
bacteria significantly improved the alka(e)ne 
production, and the yield was increased up to 
1.3 % (per DCW, dried cell weight), which was 
higher than 8.3-fold, comparatively to the wild- 
type strain (0.14 % per DCW). Recently, relevant 
bioenergy genes were identified in microalgae 
and relevant recombinant technology have been 
developed to engineer the pathway, resulting in 
higher yield of biofuel production (Beer et al. 
2009). 

Ahrens and Sander (2010) recently recorded 
that the cost for algal biomass conversion to com- 
mercial diesel product would be approximately 
50 €/L. Demirbas and Demirbas (2011) have 
reported that the oil production from marine 
algae is 7—-31-fold higher than palm oil from 
respective crop. They also added that 50 % bio- 
mass can be converted to biofuel due to high con- 
tent of lipids in the plant thalli. At mean time, 
algal lipids were also subjected to a bunch of con- 
straints including extraction, slurry-drying, and 
cost-effective biomass production and storage 
conditions (Hu et al. 2008). Fortunately, the aver- 
age lipid content of the algal cells has been raised 
in stress conditions, 1.e., culturing in vitro condi- 
tions. Hu et al. (2008) found that the lipid content 
has increased from 22.7 to 44.6 % of its dry cell 
weight while growing isolated diatoms at labora- 
tory conditions. Hydrocarbons are found at 
higher ratio next to triacylglycerol, up to 5 % 
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DCW (Lee and Loeblich 1971 in algae. Golueke 
et al. (1957) proposed an idea of anaerobic diges- 
tion of polluted biomass. Sialve et al. (2009) used 
these coupled techniques for the production of 
biogas from algal mass. On continuing this, a 
number of studies related to the biogas evaluation 
from algae have been achieved recently. 
El-Mashad (2013) has studied the methane pro- 
duction from anaerobic batch digestion between 
switch grass (Panicum virgatum) and Spirulina 
platensis. They reported that 83 % switch grass 
and 17 % S. platensis was well suited for meth- 
ane (236 mL g"') production on 40 days incuba- 
tion. Bruhn et al. (2011) reported that the 
anaerobic digestion with Ulva lactuca, isolated 
from marine environment, yielded 271 mL meth- 
ane per gram of cattle manure at a cultivating 
facility. They also added that the dried biomass 
produced five- to ninefold higher yield than wet 
biomass during the experiment. 


35.3.3 Nutraceuticals 


Consumption of marine foods provides enor- 
mous health benefits such as neuroprotective 
agents, curing cardiovascular diseases, diabetes, 
etc. (Miyashita and Kim 2011). Marine foods 
contain natural products like polyunsaturated 
fatty acids (PUFAs), polysaccharides, natural 
pigments (NPs), essential minerals, vitamins, 
enzymes, and bioactive peptides which serve 
such benefits (Pangestuti and Kim 201 1a). Soliev 
et al. (2011) have reviewed about marine bacte- 
rial pigments including prodiginines, carotenes, 
violacein, phenazines, quinones, tambjamines, 
melanins, scytonemin, and tryptanthrin, and they 
depicted a variety of biological activities like 
antimicrobial, anticancer, and antifouling activi- 
ties. Several marine algae have been reported for 
acetylcholinesterase (AChE) inhibitory activity; 
thereby, they can be used for treating Alzheimer’s 
disease (Pangestuti and Kim 2011b). Four marine 
algae, namely, Hypnea valentiae, Padina gymnos- 
pora, Ulva reticulata, and Gracilaria edulis, have 
been shown to inhibit AChE with the IC;,) values 
of 2.6, 3.5, 10, and 3 mg mL", respectively 
(Suganthy et al. 2010). The marine actinomycete 
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Streptomyces sp. strain LB173 produces a new 
phenazine natural product, geranylphenazinediol, 
which inhibits AChE with [C59 value of 2.62 uM 
(Ohlendorf et al. 2012). Protein tyrosine phos- 
phatase 1B (PTPIB) is the major cause for type 2 
diabetes mellitus by negative regulation of insu- 
lin signaling pathways. Yong et al. (2008) 
revealed the red algae Rhodomela confervoides 
shows potent inhibition against the PTP1B. Four 
bromophenol derivatives were derived from 
this red alga: 2,2’,3,3'-tetrabromo-4,4’,5,5’- 
tetrahydroxydiphenyl methane; 3-bormo-4,5-bis 
(2,3-dibromo-4,5-dihydroxybenzyl) pyrocatechol; 
bis(2,3-dibromo-4,5-dihydroxybenzyl) ether; 
and =. 2,2’,3-tribromo-3’,4,4’,5-tetrahydroxy-6’- 
ethyloxymethyldiphenylmethane, which inhibit 
PTPIB with ICs) values of 2.4, 1.7, 1.5, and 
0.84 uM, respectively (Yong et al. 2008). 


35.3.4 Marine Metagenomics 


Marine world confers shelter for various life 
forms including autotrophs, heterotrophs, psy- 
chrophiles, mesophiles, thermophiles, hyperther- 
mophiles, acidophiles, alkaliphiles, halophiles, 
barophiles, etc., having diverse metabolic behav- 
ior. Undoubtedly, they are adapted well to survive 
such extreme conditions with the help of poten- 
tial biomolecules which can predominantly be of 
enzymes, proteins, and simple/complex com- 
pounds. Exploration of such products can be 
applied for various purposes that can withstand in 
extreme conditions. Many of such organisms are 
uncultivable in the laboratory mainly because of 
the lack of providing or mimicking their natural 
conditions. But, metagenomics is an important 
tool aiding to explore such potential products 
coded by a metagenome that is obtained from 
marine floral, faunal, and environmental samples. 
Metagenomics is engaged with sequential steps 
of DNA extraction from metagenome, library 
construction, library screening, target gene 
sequencing, and product characterization. DNA 
could be isolated by either direct or indirect 
methods, though direct methods result in higher 
yield. In case of lower yield, multiple displace- 
ment amplification (MDA) technique can be 
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employed to amplify the whole genome to yield 
good amount of DNA. Choice of vector for 
library construction depends on the DNA frag- 
ment size, e.g., 10 kb fragment needs vectors 
used in small-size library construction, viz., high 
copy number plasmid pUC19; likewise, fosmids/ 
cosmids can harbor 20—40 kb inserts, while BACs 
have the capacity for 100-200 kb inserts. Broad- 
host-range vectors can also be employed for 
library construction that allows the expression of 
inserts in different hosts. Apart from E. coli, 
hosts like Bacillus, Pseudomonas, Streptomyces, 
and Rhizobium (having more RNA polymerase o 
factors) and certain archaebacteria are utilized 
for expression. To determine the bioactive clones 
from the constructed library, two strategies could 
be employed, namely, activity-based screening 
and sequence-based screening. The former strat- 
egy enables the finding of genes that code for 
novel or already known class of bioactive com- 
pounds or enzymes. It is all about the host’s abil- 
ity to anabolize such products and _ not 
experiencing the toxicity. The latter strategy can 
be accomplished with the engagement of two 
techniques, i.e., PCR and hybridization. It can be 
performed only if the knowledge of the sequence 
is available, and it is all about the presence of 
conserved domain or consensus sequence in the 
clones (Felczykowska et al. 2012). For the 
sequence-based screening of the clones for the 
bioactive polyketides/short peptides, biosynthetic 
gene clusters of modular polyketide synthases 
(PKSs), non-ribosomal peptide synthetases 
(NRPSs), and hybrid polyketide synthase/non- 
ribosomal peptide synthetase can be employed. 
The sequence of NRPS gene clusters of target 
clone can be annotated into the polypeptide 
sequence with the help of computer packages 
ClustScan and CompGen (Zucko et al. 2010). He 
et al. (2013b) have isolated eleven cyclopeptides 
(Cyclo(L-Thr-L-Leu), Cyclo(L-Val-D-Pro), 
Cyclo(L-Ie-D-Pro), Cyclo(L-Leu-L-Pro), 
Cyclo(L-Val-L-Leu), Cyclo(L-Leu-L-Ile), 
Cyclo(L-Leu-L-Leu), Cyclo(L-Phe-L-Tyr), 
Cyclo(L-Trp-L-Pro), Cyclo(L-Val-L-Trp), and 
Cyclo(L-Ile-L-Trp)) from antibacterial (against 
Bacillus cereus) clone pDC113 of E. coli 
EPI300-TIR, by activity-based screening of 
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metagenomic library of Japanese marine sponge 
Discodermia calyx. Previously, red pigments 
Zn-coproporphyrin If, coproporphyrin — III, 
Zn-protoporphyrin [X, and protoporphyrin IX 
have been isolated from the clones of pDC112 
and pDC114 by functional screening of sponge 
D. calyx metagenomic library (He et al. 2012b). 
Donia et al. (2011) have found anticancer cyano- 
bactin class of the compound minimide, through 
sequence-based screening from the ascidian sam- 
ple Didemnum molle. Antitumor compound 
bryostatin, from marine bacteria Candidatus E. 
sertula, a symbiont of marine bryozoan Bugula, 
apratoxin A from marine cyanobacterium 
Lyngbya_ bouillonii, salinosporamide A from 
marine actinomycete Salinispora pacifica, and 
onnamide from marine sponge Theonella swin- 
hoei have been discovered through sequence- 
based screening of NRPS/PKS gene clusters and 
genome mining (Nikolouli and Mossialos 2012). 
Apart from bioactive compounds, marine 
metagenomics can also be applied for screening 
the oil-degrading enzymes from the microbes at 
the site of oil seepage from ship to control oil pol- 
lution. Enzymes like esterase, lipase, protease, 
DNA polymerase, cellulase, chitinase, amidase, 
alkaline hydroxylase, pigments, proteins, and 
dyes explored from extremophiles via metage- 
nomics can be applied in various industrial and 
research usages (Felczykowska et al. 2012). 


35.4 Molecular Applications 


Among the marine organisms, the algae are found 
to be the ideal expression systems; this is because 
of higher growth rate (lower doubling time within 
24 h), cost-effective biomass production, post- 
transcriptional and posttranslational modifica- 
tions comparable with other eukaryotes, and 
controllable bioreactor growth (Janssen et al. 
2003). Moreover, microalgae possess strong con- 
stitutive homologous promoters in their expres- 
sion systems. For instance, fucoxanthin 
chlorophyll-binding protein (Fcp A), from marine 
alga Phaeodactylum tricornutum, has extensively 
been used for the expression of heterologous pro- 
teins (Walker and Collet 2005). On the other hand, 
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cold-shock proteins (CSPs) from marine organ- 
isms are important proteins for industrial usage 
below 10 °C. An Antarctic psychrophilic bacte- 
rium, Shewanella livingstonesis Ac10, has been 
regarded as a better model organism in character- 
izing the CSP chaperons. These are with peptidyl 
prolyl cis-trans isomerase activity (PPlase) that 
catalyzes the cis-trans isomerization of proline 
imidic peptide bonds which is required for proper 
folding of protein at cold temperatures. Tig is also 
a cold-adapted chaperon isolated from S. living- 
stonesis, possessing PPlase activity that properly 
folds the nascent peptides (Heidelberg et al. 
2002). An engineered fluorescent protein, phyto- 
chrome, IPF1.4, which emits light at infrared 
region, expressed from a marine bacterium 
Deinococcus radiodurans, has been used in the 
visualization of even deeper tissues in medical 
fields. iRFP, another phytochromic protein engi- 
neered in marine bactertum Rhodopseudomonas 
palustris, is also used in infrared imaging (Filonov 
et al. 2011). The marine algae Phaeodactylum tri- 
cornutum, Cyclotella cryptica, Navicula saproph- 
ila, Cylindrotheca fusiformis, and Thalassiosira 
weissflogii have been evolved as suitable expres- 
sion systems for heterologous proteins with a 
number of selectable marker genes including ble 
(zeocin (bleomycin family)-binding protein), nat 
(nourseothricin acetyltransferase), sat-1 (strepto- 
thricin acetyltransferase), and nptII (neomycin 
phosphotransferase II). Zaslavskaia et al. (2001) 
have developed a multiple cloning site vector 
from diatom P. tricornutum by adding ble (bleo- 
mycin antibiotic) selectable marker, and multiple 
cloning site regions are linked with fcpA (fuco- 
xanthin, chlorophyll a/c-binding protein) pro- 
moter. The recombinated P. tricornutum grew as a 
facultative heterotrophic organism, and this engi- 
neered organism can express various genes from 
heterologous species. Reporter genes including 
cat (chloramphenicol acetyl transferase), luc 
(luciferase), uidA (B glucuronidase), and gfp 
(green fluorescent protein) could be engineered 
with diatom-specific promoter (fcp) to make effi- 
cient selection of expression vectors (Montsant 
et al. 2004). Red alga Porphyra yezoensis has 
been genetically engineered with bacterial based 
nitroreductase gene (nfs) to enable the organism 
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to detoxify the trinitrotoluene for bioremediation 
purpose (Bernasconi et al. 2004). 

Certain animal models help the scientific com- 
munities in assessing the lethal or sublethal 
impacts of contaminants or mixture of contami- 
nants for research studies. The utilization of 
marine organisms as biological models would be 
logically based on three categories, namely, filter 
feeders and pelagic and benthic species. Lee 
et al. (2009) have suggested that the presence of 
sema domain clusters (domain of secreted or 
transmembrane protein involved in the axon 
guidance of neural development) in the sea hare, 
Aplysia californica (mollusk), makes this organ- 
ism a powerful model system for studying the 
functioning of the nervous system. The transpar- 
ency of the sea urchin’s eggs enabled them as a 
model for studying the fertilization phenomena, 
and it assists to evaluate the impacts of toxins or 
drugs (http://www.exploratorium.edu/imaging- 
station/research/urchin/story_urchin1.php). 


35.5 Conclusion 


This review summarizes and substantiates the 
broad spectrum of bioactive natural products from 
marine bacteria, fungi, and algae with pharmaco- 
logical and industrial importance. This represents 
their efficient anabolic properties. Though immense 
of bioactive compounds with variety of pharmaco- 
logical and industrial potentials have intensively 
been explored, a very few of them have been 
brought to the human applications and a few syn- 
thetic derivatives (Soblidotin (TZT1027), 
Plinabulin (NPI-2358), Curacin A derivative, etc.) 
of marine-derived natural products were tested for 
clinical trials in various phases. This is because of 
very low yield, lack of stability, difficulty in in vitro 
laboratory synthesis, etc. In case of microbial culti- 
vation, parameters like aeration, growth rate, nutri- 
ents, and contaminants would influence the yield of 
bioactive products. Despite the vast opportunity for 
using marine organisms in human welfare, clear- 
cut knowledge is required in isolating these unique 
bioactive metabolites. Mass cultivation strategies 
using improved bioreactors, synthesizing chemical 
analogs of a product without affecting its function- 
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ality and improving their stability, may encourage 
the launching of the product. The research on this 
field must further be expanded and implemented 
for future use. Marine metagenomics is an impor- 
tant tool to find the potential products mainly from 
uncultivable and extremophilic organisms. These 
marine forms are the golden source for mining eco- 
nomically valuable products with the tool of bio- 
technology to aid human well-being. 
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Abstract 

Arid lands occupy nearly one third of the world’s total land area, and about 
13 % of the world population are living in these inhospitable areas. 
Agriculture production from these areas is minimal due to prevailing 
hostile environment, and most of the land is underutilized. Recent advances 
in modern agriculture and biotechnology have provided scope of utilizing 
those underutilized arid lands for agriculture production. Plant tissue cul- 
ture is one of the applications of biotechnology by which plants can be 
mass propagated using vegetative and sexual tissues without losing their 
genetic fidelity. Somaclonal variation is common among in vitro-derived 
plants; however, it could broaden genetic variability and genetic improve- 
ment. Molecular marker technology can be applied to identify variability 
and to select useful variants. Genomic and transcriptomic data and genetic 
engineering through gene transfer techniques pave the way for genetic 
improvement of different crops and plants. This chapter discusses the 
efforts made to improve jojoba [Simmondsia_ chinensis (Link) 
C.K. Schneid], an industrial oil-yielding species; date palm (Phoenix 
dactylifera L.), one of the oldest domesticated desert trees with socioeco- 
nomic importance; and Acacia, an important fodder and energy species for 
semiarid and arid regions. 
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species came under threat. Arid lands occupy 
nearly one third of the world’s total land area, and 
about 13 % of the world population are living in 
these inhospitable areas. Agriculture production 
from these areas is minimal due to drought, high 
temperature, salinity and scarcity of water 
resources, and, hence, serious problem for food 
security. Recent advances in modern agriculture 
and biotechnology have provided scope of utiliz- 
ing underutilized arid land resources for agricul- 
ture production and have great potential in 
solving many problems pertaining to agriculture, 
industry, environment, and health which has 
direct relevance to sustainable development of 
countries located in arid land. Tissue culture 
technology has the potential to multiply plants 
in vitro using minimum quantity of propagules 
without losing their genetic fidelity and facili- 
tates large-scale production of quality planting 
material and in genetic improvement of species. 
This chapter discusses the efforts made to 
improve jojoba [Simmondsia chinensis (Link) 
C.K. Schneid], an industrial oil-yielding species; 
date palm (Phoenix dactylifera L.), one of the 
oldest domesticated desert trees with socioeco- 
nomic importance; and Acacia, an important fod- 
der and energy species for semiarid and arid 
regions. 


36.2 Jojoba: Simmondsia 
chinensis (Link) C.K. Schneid: 
Simmondsiaceae 


36.2.1 Introduction 


Simmondsia_ chinensis, commonly known as 
jojoba, is native to the Sonoran Desert of south- 
western USA and northern Mexico (Mills et al. 
1997), with a life span of 100—200 years. It is an 
economically important wind-pollinated, ever- 
green, perennial, dioecious shrub. It can tolerate 
extreme temperatures (—5 to 54 °C) and can be 
cultivated on a variety of marginal lands that are 
not suitable for conventional agriculture 
(Bhardwaj et al. 2010). The seeds contain 
50-55 % oil (Chikara and Kumari 1991; 
Bhardwaj et al. 2010), and the oil properties are 


M.P. Reddy 


similar to the oil obtained from the sperm whale 
(Low and Hackett 1981), which is now listed as 
an endangered species. Jojoba oil and its deriva- 
tives have potential in a wide range of applica- 
tions in cosmetics, lubricants, plastic industries, 
pharmaceuticals, and medicine and also as bio- 
fuel (Reddy and Chikara 2010; Kumar et al. 
2012). Recognition of jojoba oil as an alternative 
to sperm whale oil and its adaptability to grow on 
marginal lands has led to surge of interest in agri- 
culture production of jojoba across the world. 
The plant is dioecious and exhibits tremendous 
variability in growth and yield (Chikara and 
Kumari 1991). Generally male plants outnumber 
in seed-raised population which is not desirable 
in commercial plantations, and identification of 
sex in early seeding stages has met with limited 
success (Agrawal et al. 2007). Setting up a plan- 
tation with asexual propagules saves time in 
replanting, as well as crop production and lineage 
(Llorente and Apostolo 2013). Vegetatively prop- 
agation is carried out by rooting of stem cuttings 
or by tissue culture method. Vegetative propaga- 
tion by layering (Bashir et al. 2005), grafting 
(Bashir et al. 2006), and stem cuttings (Bashir 
et al. 2007) has been established with satisfactory 
results using selected genotypes. The major limi- 
tations of these techniques are the availability of 
large quantities of desired planting material and 
the seasonal propagation. 


36.2.2 Micropropagation 


Micropropagation is an alternative method of 
vegetative propagation and offers advantages, a 
fort for commercial production in a limited space 
without losing its genetic nature and with a very 
limited starting material; the technique is also 
important in genetic improvement of the species. 
Micropropagation has reached a commercial 
level in many plant species in recent years 
(Chandra and Mishra 2003). During the past few 
decades, therefore, several attempts have been 
made to propagate jojoba in vitro on various cul- 
ture media supplemented with different concen- 
trations and combinations of plant growth 
regulators, with varying levels of success 


36 Desert Plant Biotechnology: Jojoba, Date Palm, and Acacia Species 


(Chaturvedi and Sharma 1989; Llorente and 
Apostolo 1998; Roussos et al. 1999; Gao and 
Cao 2001; Prakash et al. 2003; Tyagi and Prakash 
2004; Bashir et al. 2008; Singh et al. 2008, 2011; 
Kumar et al. 2009, 2010b). BAP is the most com- 
monly used growth regulator in jojoba tissue cul- 
ture (Llorente and Apostolo 1998; Tyagi and 
Prakash 2004; Bashir et al. 2007). According to 
Hassan (2003) and Kumar et al. (2009), com- 
bined effect of BAP with NAA or IAA was found 
to be very effective for establishment and subse- 
quent multiplication. In most such experiments, 
cytokinins (BAP, kinetin, and zeatin) have been 
used in combination with auxins (NAA, IAA, 
and IBA). Bashir et al. (2008), working with dif- 
ferent genotypes of jojoba, showed that BAP 
alone was better than kinetin alone or BAP + 
kinetin for in vitro shoot initiation, whereas 
Singh et al. (2008) reported BAP in combination 
with adenine proved to be the best for shoot 
induction and subsequent multiplication. Increase 
in KNO, concentration in the medium improved 
shoot multiplication rate and in vitro flowering in 
20 % of male cultures. 

Success of in vitro regeneration depends on 
the control of morphogenesis, which is influenced 
by the kind of tissue or explants, age, origin of 
the initial tissue, composition of media, plant 
growth regulators (PGRs), media additives, cul- 
ture environment, etc. (Giri et al. 2004; Rai et al. 
2010). In jojoba the number of shoots produced 
in vitro depended upon the source of explant 
(Gao and Cao 2001), type of explant (Hassan 
2003), composition of media, nature of growth 
regulators and their concentrations and combina- 
tion, genotype (Llorente and Apostolo 1998; 
Prakash et al. 2003; Tyagi and Prakash 2004), 
and type of vessels and cultural conditions (Mills 
et al. 2004). Shoot explants of male and female 
exhibited either differential morphogenic behav- 
ior (Prakash et al. 2003) or similar response 
(Singh et al. 2008) under the influence of various 
adjuvants. Llorente and Apostolo (2013) reported 
a protocol suitable for micropropagation of dif- 
ferent genotypes of jojoba. According to Singh 
et al. (2011) regeneration is influenced signifi- 
cantly by the size of the leaf attached to the peti- 
ole and the orientation of the explant (1.e., vertical 
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or horizontal). Petioles obtained from pruned 
branches of plants responded better than those 
from unpruned plants. However, the efficiency of 
regeneration of in vitro leaves was not influenced 
by the size of the leaf attached to the petiole, but 
the orientation of the explant significantly influ- 
enced regeneration (Fig. 36.1). Infection-free 
cultures (90-99 %) could be established when 
the explants were collected from the field during 
October to May (Singh et al. 2008), and the 
response of explants was better when the explants 
were collected between August and November 
(Kumar et al. 2009). 

The auxin to cytokinin ratio in the medium is 
critical for morphogenesis (Rout et al. 2000; 
Pati et al. 2006). High cytokinin and low auxin 
concentration in the medium favored shoot 
induction, reverse concentrations promoted root 
induction, and intermediate concentration 
induced callus development (Harris et al. 1989). 
Sucrose (Kumar et al. 2009), adenine sulfate 
(Singh et al. 2008), and triiodobenzoic acid 
(Prakash et al. 2003) had a significant effect on 
shoot multiplication and elongation in jojoba. 
The superiority of BAP in shoot induction may 
be due to its ability to metabolize more rapidly or 
may induce natural hormones like zeatin synthe- 
Sis within the tissues (Rai et al. 2010). 

Rooting and subsequent establishment of 
plantlets is very important to make micropropa- 
gation successful (Pati et al. 2006). Half-strength 
MS medium was adequate for root induction 
(Singh et al. 2008; Kumar et al. 2009). 
Experiments involving rooting with either auxins 
alone or in combination with other hormones 
showed significant differences (Chaturvedi and 
Sharma 1989; Tyagi and Prakash 2004; Bashir 
et al. 2008). According to Singh et al. (2008) IBA 
alone was more effective in root induction as 
compared to IBA+ JAA or IBA+ JAA+ 
NAA. Tyagi and Prakash (2004) observed differ- 
ences in the rooting behavior of male and female 
plants, whereas Singh et al. (2008) reported no 
difference in rooting response of male and female 
genotypes. Clonal differences in rooting and sub- 
sequent acclimatization were also recorded 
(Apostolo et al. 2001; Bashir et al. 2008). Twenty- 
four to 32 % rooting with IAA pulse treatment 
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Fig. 36.1 Micropropagation of jojoba using nodal segments. (a) Multiple shoots (primary culture), (b)subculture( 
15-20 shoots), (c) in vitro rooting, (d) ex vitro rooting, (e) 1-year old plants (Source: Singh et al. 2011) 


and 50-60 % rooting with IAA and IBA combi- 
nations were reported (Mills et al. 1997). 
According to Chaturvedi and Sharma (1989), the 
combination of IBA and NAA was more effective 
in inducing rhizogenesis than either of them 
alone. It is an established fact that although aux- 
ins are essential for root induction, they are not 
required for root growth; their continued pres- 
ence may even inhibit the root growth (Mills 


et al. 1997). Addition of activated charcoal has 
been reported to promote rooting (Thomas 2008). 
Rooting is also promoted by the addition of caf- 
feic acid (Chaturvedi and Sharma 1989) and 
cyclodextrins (Apostolo et al. 2001). 

Hardening of rooted plants prior to transfer in 
the soil increases the survival rate. Various types of 
substrates such as soil vermiculite mixture (Gulati 
and Jaiwal 1996), sterilized sand (Thakur et al. 
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2001), soil (Dewan et al. 1992), and soilrite and soil 
(Agrawal et al. 2007) have been used. Meyghani 
et al. (2005) observed that peat and perlite at a ratio 
of 1:1 or 1: 2 is the most suitable medium for trans- 
planting and subsequent adaptation of jojoba plant- 
lets. Singh et al. (2008) reported the best survival 
rate when sand alone was used as a substrate. The 
survival rate was higher for plantlets which devel- 
oped roots in vitro in response to IBA; however, no 
differences were observed during field establish- 
ment (Bashir et al. 2008). Kumar et al. (2010b) 
observed 75 % field establishment in jojoba plants 
acclimatized in presterilized soil-sand (1:3), 
whereas Singh et al. (2008) reported 99 % field 
establishment when sand alone was used, as com- 
pared to soilrite alone and soilrite + sand. 


36.2.3 Somatic Embryogenesis 


Somatic embryos are believed to be bipolar struc- 
tures that arise from individual cells and often 
have no vascular connection with the maternal 
tissue of the explant (Rai et al. 2010). Embryos 
may develop directly from somatic cells (direct 
embryogenesis) or can be preceded by numerous 
organized non-embryogenic mitotic cycles (indi- 
rect embryogenesis). Somatic embryogenesis has 
a great potential for clonal multiplication and an 
ideal experimental process for studying differen- 
tiation and totipotency in plant cells (Litz and 
Gray 1992). In addition, under controlled envi- 
ronmental conditions, somatic embryos germi- 
nate readily, similar to their seed embryo 
counterparts. Besides being useful for clonal 
multiplication, regeneration through somatic 
embryogenesis may also be useful for genetic 
transformation (Kim and Liu 1999) and to 
develop new products (Benzioni 1995). Reports 
on somatic embryogenesis in jojoba are sparse 
and focused mostly on in vitro wax production 
using zygotic embryos (Wang and Janick 1986). 
Hamama et al. (2001) developed a protocol for 
the induction, maturation, and germination of 
somatic embryos from leaf explants. Direct 
somatic embryogenesis was observed with some 
zygotic embryo explants, whereas leaf-derived 
embryonic calli did not mature on any of the 
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maturation/germination media (Mohammed 


et al. 2008). 


36.2.4 Synthetic Seeds 


Due to differences in male to female ratio and 
long gestation period (Sharma et al. 2008; Kumar 
et al. 2011), jojoba may be a suitable species to 
develop synthetic seed, as the technology has 
advantages besides effective method of propaga- 
tion of known sex, germplasm storage, and con- 
servation (Rai et al. 2009). Only two reports are 
available on the production of synthetic seeds in 
jojoba (Hassan 2003; Kumar et al. 2010b), where 
shoot tips and axillary buds were encapsulated 
using sodium alginate and calcium chloride 
and complete plant regeneration from encapsu- 
lated shoot tips has been achieved. 


36.2.5 Molecular Approaches 
for Identification of Genetic 
Fidelity and Sex 
Determination 


While generating quality planting material using 
micropropagation on a commercial scale, genetic 
stability is the most important to maintain the 
advantageous characters of elite plants (Rahman 
and Rajora 2001). Micropropagated plants gener- 
ated using shoot tips and axillary buds are 
reported to maintain clonal fidelity (Ostry et al. 
1994), whereas Tripathi et al. (2006) reported 
somaclonal variation as a major constraint in 
commercial production of true-to-type plants. 
Somaclonal variations occur in response to the 
in vitro stresses and are manifested in the form of 
DNA methylation, chromosome rearrangements, 
and point mutations (Phillips et al. 1994). 
Therefore, a quality checkup for true-to-type 
planting material at an early stage of develop- 
ment is very essential. Molecular markers serve 
as an important tool to check the genetic unifor- 
mity and the true-to-type nature of micropropa- 
gated plants. 

Among various DNA-based markers, ampli- 
fied fragment length polymorphism (AFLP) and 
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restriction fragment length polymorphism 
(RFLP) are the most reliable markers for screen- 
ing of genetic fidelity in tissue-cultured plants. 
Due to requirement of radioactive labeling and 
expensive enzymes, these techniques have less 
preference in many _ laboratories. Random 
Amplified Polymorphic DNA (RAPD) and inter- 
simple sequence repeats (ISSR) markers, on the 
other hand, require only small amount of DNA 
sample, do not involve radioactive labeling, and 
are simple, fast, and proven to be quite efficient 
in detecting genetic fidelity even in closely 
related organism such as near isogenic lines 
(Reddy et al. 2002). The marker systems have 
been successfully used in demonstrating soma- 
clonal variation in different plant species (Joshi 
and Dhawan 2007). However, such reports are 
very limited in jojoba. Kumar et al. (2011), using 
RAPD and ISSR markers, demonstrated that 
axillary bud multiplication is a safe method for 
production of true-to-type plants in jojoba. Early 
detection of sex (Agarwal et al. 2007, 2011; 
Sharma et al. 2009; Ince et al. 2010; Hossain 
et al. 2001) and genetic variability (Amarger and 
Mercier 1995) using molecular techniques is also 
reported in jojoba. 


36.2.6 Molecular Approaches for Oil 
Synthesis 


The liquid wax from jojoba seed has a carbon 
chain length of C38—C44 and is composed mainly 
of C20:1 fatty acids and C22:1 fatty alcohols. Oil 
has a multitude of important applications in medi- 
cine and cosmetics industry and as a lubricant. The 
high cost and limited production of jojoba waxes 
restrict their use in spite of high demand. The 
advent of genetic engineering has provided novel 
opportunities to increase the availability of plant- 
derived lipids (Kalscheuer et al. 2006). Enzymes 
involved in wax synthesis in developing jojoba 
seeds have been characterized (Metz et al. 2000) 
and the corresponding cDNAs cloned (Metz and 
Lassner 1994). Overexpression of these genes has 
resulted in production of jojoba wax in the seeds of 
transgenic model plants (Lassner et al. 1996; 
Lardizabal et al. 2000). Kalscheuer et al. (2006) 
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demonstrated the possibility of de novo synthesis 
of wax esters in a recombinant microorganism by 
co-expression of a fatty alcohol-producing bifunc- 
tional acyl-coenzyme. 


36.2.7 Conclusions and Future 
Prospects 


During the recent past, significant progress has 
been achieved in micropropagation, marker- 
assisted selection, and sex detection at seedling 
stage. However, there is a need to exploit modern 
tools of biotechnology and molecular biology to 
develop a robust micropropagation system that 
can be applied to different genotypes and an effi- 
cient transformation system for genetic improve- 
ment, disease resistance, yield, etc. In addition, 
in vitro production of useful metabolites has not 
been attempted in this species and needs to be 
addressed in future research efforts. 


36.3 Date Palm (Phoenix 
dactylifera L.) 


36.3.1 Introduction 


Date palm (Phoenix dactylifera L.) is a dioecious 
edible fruit-yielding tree species that grows in 
warm and arid climates, and its global annual 
production is about 5.5 million metric tons. Since 
ancient times this plant has been recognized as 
the “tree of life” because of its integration in 
human settlement and food security in hot and 
barren parts of the world, where only a few plant 
species can flourish. Date palm provides a major 
source of income for local farmers and associated 
industries in arid zones. The economic utility of 
these palms is multifold as staple food, bever- 
ages, ornamentals, and industrial materials 
(Balick and Beck 1990). It is one of the oldest 
domesticated trees with socioeconomic impor- 
tance (Masmoudi-A et al. 2009). The earliest cul- 
tivation of date palm goes back to 3,700 BC, in 
the area between the Euphrates and the Nile 
River (Munier 1973). Expansion of date palm 
agriculture is faced with challenges stemming 
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from propagation and genetic improvement, a 
limitation that deprives the actual potential (Jain 
2012). The heterozygous nature of this dioecious 
species hampers the use of seed which produces 
off-type seedlings and takes 8—10 years or more 
for fruiting (Jain 2012). Two to three percent of 
male palms are sufficient to meet the pollination 
requirements, but in seed-raised population, half 
of the progeny is composed of male trees and not 
possible to identify before flowering (Eke et al. 
2005). The application of tissue culture tech- 
niques for date palm has many advantages for 
propagation of healthy selected female cultivars, 
large-scale multiplication, avoiding seasonal 
effects, production of genetically uniform plants, 
fast exchange of plant material, and economical 
reliability. Looking at the advantages, micro- 
propagation techniques have been developed for 
rapid mass propagation of date palm. Plant regen- 
eration through tissue culture is also important 
for genetic improvement (Parveez et al. 
2000) and conservation of genetic resource 
(Engelmann 2004). 


36.3.2 Micropropagation 


Since 1970 extensive efforts were made to mass 
propagate date palm using tissue culture tech- 
niques (Schroeder 1970; Eeuwens 1978; Tisserat 
1979; Sharma et al. 1986; Bouguedoura et al. 
1990; Bhaskaran and Smith 1992; Sharon and 
Shankar 1998; Al-Khalifah 2000; Fki et al. 2003, 
2011; Al-Khayri 2007; Al-Khateeb 2008a, b; 
Kumar et al. 2010a; Jain 2012). 

To establish aseptic cultures, zygotic embryos 
(Reynolds and Murashige 1979), shoot tips 
(Veramendi and Navarro 1997), lateral buds 
(Bouguedoura et al. 1990), leaves (Fki et al. 
2003), or inflorescences (Bhaskaran and Smith 
1992; Fki et al. 2003) have been used. Among 
different types of disinfecting solutions used, 
mercuric chloride (HgCl,) was found to be the 
most efficient disinfectant (Drira and Benbadis 
1985). Like for many other plant species 
Murashige and Skoog (1962), medium has been 
the most preferred one, and 2,4-D is the most 
popular PGR for callogenesis (Fki et al. 2003). 


Liquid media for plant regeneration has been 
widely used (Fki et al. 2003; Sharma et al. 1986) 
for large-scale regeneration. Micropropagation 
via direct organogenesis has also been attempted 
for rapid clonal propagation of elite genetic mate- 
rial of date palm (Al-Khayri 2007; Khierallah 
and Bader 2007; Jain 2007) and different geno- 
types (Aaouine 2003). The major concern with 
this approach is somaclonal variation that is 
dependent on various factors including genotype, 
explants, and plant growth regulators (Jain 2012). 
Direct regeneration of vegetative buds minimizes 
the risk of somaclonal variation for commercial 
production of date palm cultivars (Jain 2012). 


36.3.3 Somatic Embryogenesis 


Somatic embryogenesis has tremendous poten- 
tial for rapid large-scale plant production. The 
advantages of somatic embryogenesis are cost- 
effectiveness, both shoot meristem development 
and root meristem development taking place in a 
single step, quick and easy to scale up, automa- 
tion, long-term storage (cryopreservation), 
virus-free plants, and facilitates genetic transfor- 
mation (Jain 2007). Somatic embryogenesis has 
the high potential for mass propagation of date 
palm; hence, more emphasis was given to somatic 
embryogenesis (Reynolds and Murashige 1979; 
Bhaskaran and Smith 1992; El Hadrami et al. 
1995; Fki et al. 2003; Al-Khateeb 2008a; 
Othmani et al. 2009). Somatic embryogenesis 
can be achieved either directly from explant 
(Sudhersan et al. 1993) or from embryogenic 
calli (Al-Khayri 2005), though the first method is 
the most preferred one but has not been perfected 
yet (Fki et al. 2011). The duration of the cultiva- 
tion period in liquid medium was found to be 
very important for the balanced germination of 
somatic embryos (Jain 2007). Kumar et al. 
(2010a) reported an optimized protocol for 
somatic embryogenesis (Fig. 36.2) and regenera- 
tion from embryogenic suspension cultures of 
date palm for commercial production (Fki et al. 
2003). Smith and Aynsley (1995) studied field 
performance of tissue culture-derived date palm 
clonally produced by somatic embryogenesis, 
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Fig. 36.2 Micropropagation of date palm through 
somatic embryogenesis. (a) Embryogenic callus, (b) 
development of somatic embryo, (c) germination of 


and the results demonstrated that these plants 
started bearing fruits within 4 years from field 
planting. 


36.3.4 Embryo Rescue 


Removal of a zygotic embryo from the seed and 
planting in a sterile nutrient culture medium is 
known as embryo rescue technique. Embryo cul- 
ture has several potential applications in crop 
improvement programs and to produce hybrids 
(Hodel 1977). Excised embryos cultured in vitro, 
under suitable basal nutrient culture media, usu- 
ally germinate immediately. Embryo rescue was 


somatic embryo, (d) regenerated plantlets, (e) hardened 
plants in nursery (Source: Kumar et al. 2010a) 


successful in reducing the date palm height 
(Sudhersan et al. 2009). 


36.3.5 Regeneration 
from Protoplasts 


Ever since regeneration of fertile plants from iso- 
lated protoplasts was reported in Nicotiana taba- 
cum (Nagata and Takebe 1970), plant regeneration 
through protoplasts has been reported for more 
than 400 plant species (Davey et al. 2005). There 
are very few reports on date palm protoplasts. 
Chabane et al. (2007) and Rizkalla et al. (2007) 
were successful in inducing callus formation 
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from protoplasts. However, plant regeneration 
from protoplast callus is yet to be accomplished. 


36.3.6 Genetic Transformation 


Genetic engineering helps in reducing time in 
developing new and improved cultivars. T-DNA 
delivery with Agrobacterium tumefaciens 
(Horsch et al. 1984) and direct gene transfer with 
particle bombardment. Klein et al. (1988) have 
been used successfully in genetic transformation. 
These developments resulted in production of the 
first fertile transgenic plants (McCabe et al. 1988; 
Fromm et al. 1990; Vasil et al. 1992; Wan and 
Lemaux 1994). Agrobacterium-mediated trans- 
formation has several advantages over particle 
bombardment method, e.g., integration of a well- 
defined DNA sequence, typically low copy num- 
ber, and preferential integration into actively 
transcribed chromosomal regions (Gheysen et al. 
1998), and different approaches have been pur- 
sued to improve the efficiency of Agrobacterium- 
mediated transformation in recalcitrant monocot 
plant species (Cheng et al. 2004; Kumlehn et al. 
2006). Successful infection of date palm embryo- 
genic callus with Agrobacterium, (Saker et al. 
2009) and gene transfer by particle bombardment 
method (Habashi et al. 2008; Saker et al. 2007) 
has been reported. Preliminary investigations of 
physical factors influencing gene delivery using a 
biolistic gun indicated that flight distance of 
microprojectiles and applied pressure were the 
most important parameters (Saker et al. 2007). 


36.3.7 Molecular Approaches 
for Identification of Genetic 
Fidelity and Sex 
Determination 


Molecular markers are a very important resource 
for genetic improvement of all crops. DNA mark- 
ers, especially those based on simple sequence 
repeats and single-nucleotide polymorphism, play 
an increasingly important role in plant variety 
identification, germplasm resource collection, and 
genetic improvement. Molecular marker studies 
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are very limited in date palm. Random Amplified 
Polymorphic DNA (RAPD) technique has been 
developed to characterize date palm genotypes or 
cultivars and to examine their phylogenetic rela- 
tionships (Trifi et al. 2000; Jain et al. 2011). Kumar 
et al. (2010a), using RAPD and ISSR markers, 
reported that the micropropagation protocol devel- 
oped by them for rapid in vitro multiplication is 
suitable for clonal propagation of date palm and 
corroborated the fact that somatic embryogenesis 
can also be used as one of the safest modes for 
production of true-to-type plants (Fig. 36.3). The 
frequency of somaclonal variations was found to 
be age dependent, and RAPD analysis showed 
genetic variations in approximately 4 % of the 
analyzed plants (Saker et al. 2000). 


36.3.8 Genomics 


There have been a very limited number of 
genome-wide studies on date palm. Al-Dous 
et al. (2011) from Weill Cornell Medical College 
in Qatar tried to sequence the entire date palm 
genome using Solexa (illumine) sequencer based 
on a shotgun method and released the sequence 
data subsequently (http://qatar-weill.cornell.edu/ 
research/datepalm Genome/index.html). 
According to their analyses, the genome assem- 
bly has a predicted genome size of ~550 Mbp, 
assembled 58 % of the genome (382 Mb), and 
predicted 25,059 genes (Al-Dous et al. 2011). 
Bourgis et al. (2011) have reported comparative 
transcriptomic study on mesocarps of both oil 
and date palm based on pyrosequencing data 
from the Roche GS FLX Titanium platform. 
Recently Al-Mssallem et al. (2013) report a 
genome assembly for an elite variety (Khalas), 
which is 605.4 Mb in size and covers 490 % of 
the genome (B671 Mb) and 496 % of its genes 
(B41,660 genes). Genomic sequence analysis 
demonstrates that Phoenix dactylifera experi- 
enced a clear genome-wide duplication after 
either ancient whole genome duplications or 
massive segmental duplications. Genetic diversity 
analysis indicates that its stress resistance and 
sugar metabolism-related genes tend to be 
enriched in the chromosomal regions, where the 





Fig. 36.3 PCR amplification products obtained with (a) 
RAPD primer OPE-15 and (b) ISSR primer UBC-835. 
Lane M represents one kb ladder; lane 1 represents mother 


density of single-nucleotide polymorphisms is 
relatively low. Using transcriptomic data, they 
also illustrated the date palm’s unique sugar 
metabolism that underlies fruit development and 
ripening. Besides, full-genome assemblies of 
plastid (Yang et al. 2010) and mitochondrion 
(Fang et al. 2) and transcriptomic profiles for 
fruit development based on pyrosequencing data 
(Yin et al. 2012) are also reported. The genomic 
and transcriptomic data may be useful for further 
genomic studies not only on P. dactylifera but 
also in other Arecaceae species. 


36.3.9 Conclusions and Prospects 
Date palm is an important plant species in Near 


Eastern countries and a main source of income. 
Infestations of red palm weevil (RPW) have been 
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plant; lanes 2—17 represent tissue cultured raised plants at 
the 42nd cycle of transfer (Source: Kumar et al. 2010a) 


reported in over 50 % of the date palm-growing 
countries (Faleiro 2006). Genetic improvement 
of date palm for RPW resistance seems to be the 
utmost priority to address. Developing a robust 
technology that can be applied to all the geno- 
types, optimization of bioreactor technology for 
large-scale production of propagules, and devel- 
opment of haploid plants are also very essential 
to make date palm cultivation very successful. 


36.4 Acacia Species 

36.4.1 Introduction 

Genus Acacia belongs to Leguminosae family 
and encompasses approximately 1,300 tree and 


shrub species of tropical and warm temperate 
regions. Due to its utility in afforestation, and 
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agroforestry, this plant has become a key compo- 
nent of arid regions. The plant has a fast-growing 
tap root that enables it to utilize moisture stored 
in lower soil layers and to remain green for a long 
time in the dry season. The extensive root system 
makes it ideal for dune stabilization and 
preventing soil erosion. Acacia species have the 
natural ability to fix atmospheric nitrogen and 
hence can thrive in low-fertility soils (Galiana 
et al. 2002). They can be used for combating 
desertification and for restoring degraded land 
and for economic products, like gum, tannins, 
wood, and fodder (Badji et al. 1993; Quoirin 
et al. 2000). The fast expansion of Acacia planta- 
tions warrants the need to develop efficient meth- 
ods for mass production of superior quality 
planting stocks (Quoirin et al. 2000). Acacia 
propagated through seeds exhibit poor germina- 
tion, and seed-raised plants show wide genetic 
variability and are not true-to-mother plants. 
Vegetative propagation through stem cutting is 
not successful in Acacia species as rooting of cut- 
tings harvested from mature tree 1s difficult (Beck 
and Dunlop 2001). 


36.4.2 Micropropagation 


In the last three to four decades, in vitro 
approaches have been efficiently utilized in 
large-scale propagation and genetic improve- 
ment of the tree species (Pefia and Séguin 2001). 
Propagation of woody trees through tissue cul- 
ture has many advantages over conventional 
propagation methods like fast multiplication, 
production of disease-free plants, season- 
independent production of plants, quick release 
of improved cultivars, germplasm conservation, 
and easy exchange (Asthana et al. 2011; 
Phulwaria et al. 2012). Many species of Acacia 
have been studied for micropropagation (Dewan 
et al. 1992; Nandawani 1995; Rout et al. 1995; 
Hossain et al. 2001; Xie and Hong 2001a, b; 
Al-Khalifah 2004; Abbas et al. 2010; Samake 
et al. 2011; Rathore et al. 2014). The results 
showed that A. tortilis, A. ehrenbergiana, and A. 
gerrardii responded well, whereas A. seyal and 
A. senegal were difficult to establish in vitro, and 
these differences persisted throughout in vitro 
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and acclimatization stages (Al-Khalifah 2004). 
Mathur and Chandra (1983) cultured stem seg- 
ments of a mature tree of A. nilotica that pro- 
duced a few shoot buds from the nodal portion, 
and roots emerged from the opposite pole. Singh 
et al. (1993) also reported in vitro axillary bud 
induction from explants of a 20—25-year-old 
field-grown tree. However, they were not suc- 
cessful in inducing the roots. Recently, Rathore 
et al. (2014) reported a micropropagation method 
using liquid culture and nodal explants collected 
from 15- to 20-year-old A. nilotica. The multipli- 
cation rate in liquid cultures was ten times higher 
as compared to agar medium cultures. In vitro 
regeneration of A. mangium from shoot culture 
(Ahmad 1991; Galiana et al. 1991; Bhaskaran 
and Subhash 1996) and A. mangium and A. man- 
gium X A. auriculiformis, a hybrid (Monteuuis 
et al. 2013), has been reported, and clonal propa- 
gation of superior plants of A. mangium is cur- 
rently being carried out for large plantations. In 
vitro regeneration of A. mangium through 
somatic embryogenesis and organogenesis (Xie 
and Hong 2001a, b) and protoplast isolation 
(Toshihiro and Sonoko 1999) have also been 
reported. 


36.4.3 Genetic Transformation 


Commercial production of transgenic annual 
crops has become a reality in many parts of the 
world. Trees were considered to be recalcitrant 
for genetic transformation, and many researchers 
wonder whether genetic engineering of perennial 
trees will allow their eventual commercialization. 
Recent advances in plant molecular biology and 
biotechnology made it possible to develop trans- 
genic trees for different value-added purposes 
(Pena and Séguin 2001; Xie and Hong 2002; 
Yang et al. 2008) including woody leguminous 
trees (Igasaki et al. 2000; Xie and Hong 2002; 
Yang et al. 2008). A number of different 
explants have been used in the development 
of Agrobacterium-mediated transformation 
(Archilletti et al. 1995; Franche et al. 1997; Polin 
et al. 2006; Li et al. 2007). More specifically, for 
the transformation of leguminous trees, stems 
from rejuvenated shoots and leaves were used as 
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explants (Xie and Hong 2002; Igasaki et al. 2000; 
Saafi and Borthakur 2002). Xie and Hong (2002) 
reported Agrobacterium-mediated genetic trans- 
formation of A. mangium using rejuvenated 
shoots as explant. High resistance to kanamycin 
(<300 mg L“'), during initial selection of trans- 
formed plants observed, is a major limitation in 
legume genetic transformation (Xie and Hong 
2002). 


36.5 Conclusions and Future 
Prospects 


Genetic engineering of trees entails a major 
investment that can generate considerable envi- 
ronmental and economic benefits over the long 
term. The research undertaken in plant molecular 
biology for trees 1s not comparable to the effort 
deployed for annual crops. Nevertheless, physio- 
logical and developmental processes specific to 
trees are being studied at the molecular level. 
Knowledge of the reproductive development, 
wood formation, and defense mechanisms 1s 
essential for the effective integration of biotech- 
nology into tree improvement. Certainly, elimi- 
nating the requisite tissue culture step for legume 
transformation would be a great boost in progress 
toward development of high-throughput systems. 
Approaches for development of non-tissue cul- 
ture transformation systems such as floral dip 
method (Desfeux et al. 2000; Li et al. 2002) or 
meristem culture (McCabe et al. 1988; Rohini 
and Rao 2000) would be a great boost in progress 
toward development of high-throughput 
systems. 
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Abstract 

The twenty-first century human society is dealing with many difficulties 
besides climate challenge, and most of them are though diverse but highly 
interconnected. Transformation in agriculture and rural development are 
demanding urgent attention in this scenario. Human civilization is basi- 
cally an agrarian one, and rural people who practice agriculture are liter- 
ally the real bread earners. Neglecting agriculture and these endogenous 
people has visibly started to affect our ecosystem, environment, economy, 
and civilization. Sustainable solutions to most of these problems can be 
created only by combining endogenous knowledge reserve and the mod- 
ern biotechnological know-how. In order to successfully carry forward this 
process, we first need to ensure endogenous development. Rural develop- 
ment is inevitably connected to several key endeavors having global 
impacts like food security, protection of environment, and conservation of 
natural resource base including local landraces, germplasm, and biodiver- 
sity. Successful application of rurally relevant biotechnology will be a 
necessity in transforming agriculture besides enhancing and protecting 
rural livelihoods as well as rural integrity. 
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37.1 Introduction 

Technology is instrumental for human develop- 
ment. But efficient distribution system is equally 
important to deliver its benefit to all (UNDP 
2001). Information and communication technol- 
ogy have made this world literally a global vil- 
lage in terms of connectivity. But as evident wise 
use of appropriate technologies or know-how 
available is a great demand to give a dignified life 
to our resource-poor real villages (rural 
livelihoods where 75 % of world’s poor live) over 
the globe. Rural development and poverty reduc- 
tion 1s not anymore a charity as this is an urgent 
need linked to global climate change and sustain- 
ability concerns. In 2008 World Bank called for 
renewed emphasis on agriculture for develop- 
ment to reach the Millennium Development Goal 
of decreasing poverty by 2015. The International 
Assessment of Agricultural Knowledge, Science 
and Technology for Development ([AASTD) 
also stressed on the importance of transformation 
in the use of agricultural knowledge, science, 
technology, and management for social, environ- 
mental, and economic sustainability (UNEP 
2008; The World Bank 2008). The field of Dio- 
technology has accelerated greatly in the twenti- 
eth century following rediscovery of Gregor 
Mendel’s laws of inheritance, discovery of DNA’s 
(deoxyribonucleic acid) structure, and serendipi- 
tous development of recombinant DNA technol- 
ogy and regeneration techniques (Curtis 2008; 
Mirkov 2003). Biotechnological innovations and 
scientific rural development through responsible 
use of rurally relevant biotechnologies are the 
requirement of today. Crop productivity is declin- 
ing, environment is quite degraded, and food 
price is on the rise. The affluent percent of the 
increasing world population on the other hand is 


demanding for diverse foods. Rural-urban inte- 
gration is leading to diversification of income 
sources of very poor rural people whose main 
occupation was agriculture so far. In order to 
ensure food security and restore the environmen- 
tal balance, ensuring endogenous development, 
or improving the condition of human food pro- 
ducers, the crucial but highly neglected human 
resource is of utmost importance especially in 
developing countries but also in some parts of the 
developed world. Rural development is linked to 
agricultural development, food security, poverty 
reduction, world economy, global climate, and 
vice versa. A successful instance in this context is 
Vietnam which once was a food-deficit country 
and is now the second largest rice exporter in 
world. Seventy three percent of Vietnam’s popu- 
lation lives in rural areas, and agriculture is their 
main source of income. They have achieved this 
success largely through the development of their 
smallholding endogenous farmers (IFAD Food 
prices 201 4a). 


37.2. What Is Rural in the Modern 
Trend of Urbanization? 


Human civilization has been an agrarian one and 
hence invariably rural for long. Around 
10,000 years ago, the nomadic Homo sapiens 
gradually shifted to a grounded farming-based 
lifestyle. This is known as the Neolithic revolu- 
tion or the New Stone Age in the human history 
that marks the beginning of a civilization based 
on agriculture still alive today (Gepts 2003). The 
rural and urban differentiation emerged gradu- 
ally over the course of time. While urban society 
flourished with industrial revolution and contin- 
ued to explore diverse professions. Agriculture 
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and related farming practices continued to be the 
profession of rural area. Urban lifestyle grew far 
more comfortable and became a coveted destina- 
tion of human race. And although functionally 
complementing each other, rural areas remained 
neglected and isolated due to lower population 
density and poor connectivity with the urban cen- 
ters of distinct economic development. In the 
trending wave of urbanization today, rural (com- 
prised of villages) is defined as anything that is 
not urban. But this scenario is not static. In the 
developing world, rural and urban areas are 
becoming increasingly interconnected socially 
and economically blurring the line of contrasting 
difference of the past and creating several non- 
farm income options for poor rural people. 
According to the UNFPA’s (United Nations 
Population Fund) 2007 “State of World 
Population Report,” half of the world population 
(3.3 billion) lives in urban areas, and it is esti- 
mated to grow up to 4.9 billion by 2030 (IFAD 
Rural poverty portal 2014b; UNFPA 2007). In 
this wave of urban-centric economic develop- 
ment, chronically deprived rural areas which also 
house half of the human population demand 
equal development. World food production has 
increased considerably in the last 50 years owing 
to long-term public and nonprofit investment in 
basic and agricultural research, but food insecu- 
rity and malnutrition are the main problem of the 
twenty-first century. While on the one hand 
increased income is putting pressure to increase 
food production due to increased consumption of 
diverse diet including more and more animal 
products, ironically, on the other hand, poverty is 
one of the main reasons behind hunger and mal- 
nutrition in many developing countries in Africa 
and Asia and the Pacific as well as in developed 
ones, and majority of the affected people live in 
rural areas (Cohen 2003). This loss of equilib- 
rium is leading to outflow of more and more peo- 
ple from rural to urban areas in the hope of better 
life resulting in an overload on the urban ecosys- 
tem. However the reality is the increase in the 
number of poor people living in city slums hav- 
ing very little facilities for a decent living. The 
first wave of urbanization noted in Europe and 
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North America from 1750 to 1950 fueled by 
industrialization increased the percent of urban 
population from 10 to 52. This process was grad- 
ual (spread over two centuries). But the second 
wave of urbanization that is now flowing over the 
developing world is rapid and ill prepared. The 
urban population is projected to increase from 18 
to 56 % in a short span of 80 years (1950-2030). 
In order to stop this futile vicious cycle, rural 
development should be seen as an urgent require- 
ment and opportunity especially by the develop- 
ing countries, most of which are basically rural 
(UNFPA 2007). 


37.3 Rural Poverty and Food 
Insecurity 


The world poverty map points out that around 1.2 
billion (one fifth of world’s population) are the 
subject of this daily injustice (Fig. 37.1). Among 
them around 800 million (70 %) live in rural 
areas. While rural poverty is commonplace in 
developing regions of the world, it is present even 
in the affluent countries like North America and 
Europe. Thus Eastern and Southern Europe, 
Central America, and parts of North America 
suffer from rural poverty (IFAD Rural poverty 
portal 2014b). Rural poverty is the result of lack 
of assets (land, livestock, and money), limited 
economic opportunities, poor education, discrim- 
ination (based on gender, age, race, and ethnic- 
ity), demographic factors, violent conflicts due to 
regional sociopolitical situation, and environ- 
mental degradation (Cohen 2003). 

Eradicate Extreme Poverty and Hunger 1s top- 
ping among the eight United Nations (UN) 
Millennium Development Goals with urgent 
attention. The UN is working with governments, 
civil societies, and other partners aiming for 
halving the poverty level of 1990 (from 28.3 % of 
all people in low- and middle-income economies 
to 14.2 %) by 2015. MDG 2013 report states that 


about 700 million fewer people lived in conditions 
of extreme poverty in 2010 than in 1990; ....how- 
ever one in eight people still go to bed hungry 
despite major progress. (UN 2013) 
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Fig.37.1 World poverty 
map (Source: Rural 
poverty portal, 
International Fund for 
Agricultural Development 
2014b) 


Poverty map 

Gni per capita 

Atlas method 
(Current Uss) 
Source: World Bank 2009 


37.4 Biotechnology Is Not 
a Magic Wand 


The 1992 Convention on Biological Diversity 
(CBD) defined biotechnology as “any technolog- 
ical application that uses biological systems, liv- 
ing organisms or derivative thereof, to make or 
modify products or processes for specific uses” 
(The CBD 2001). Whatever be the present defini- 
tion of this technology, biotechnology has always 
been an integral part of human development 
probably from prehistoric time. Selective breed- 
ing to raise better crop and livestock seen in the 
history of agriculture reflects the glorious back- 
ground of biotechnology. Expansion of biotech- 
nology in the twentieth century revolutionized 
farming business and will continue in doing the 
same in the twenty-first century. Food supply has 
outpaced the dreadful population growth of the 
twentieth century. Since 1950, international 
research is interested in useful research suitable 
in different agroecological regions around the 
world. Genetically improved high-yielding rice 
and maize varieties raised respectively by two 
CGIAR (Consultative Group on International 
Agricultural Research) institutes [International 
Rice Research Institute (IRRI), Philippines, and 
International Maize and Wheat Improvement 
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Centre (CIMMYT), Mexico] and the necessary 
inputs like fertilizers, insecticides, and irrigation 
resulted in the so-called Green Revolution start- 
ing in the late 1960s worldwide but prominently 
visible in developing countries like Mexico and 
India. Since the mid-1990s, transgenic technol- 
ogy is offering genetically modified (GM) soy- 
bean, cotton, and canola with important traits like 
insect resistance and _ herbicide tolerance 
(Chrispeels and Sadava 2003; Pardey and Wright 
2003). GM crops however are undergoing oppo- 
sition from various sectors of the society. If there 
is any scientific concern to express in this matter 
in addition to biosafety and loss of biodiversity, it 
is that the concept of GM crops is based on the 
same principle of purchased input and monocul- 
ture (cultivation of selected varieties only) as was 
practiced in Green Revolution is a subject of 
unsustainability in the long term. There is loom- 
ing concern of eventual loss of crop biodiversity 
by replacing numerous local cultivars with 
selected genetically uniform crops. Secondly this 
uniformity leads to the danger of sudden uncon- 
trollable disease outbreaks which is unusual in 
conventional sustainable agriculture. In addition 
since it is a high-input technology and controlled 
by multinational biotech and seed companies, 
future possibilities of rendering monopoly and 
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increasing poverty of very poor small and 
medium farmers by decreasing their importance 
further are a large concern. However in the con- 
text of this chapter, it is important to remember 
that this GM debate is mostly irrelevant in rural 
biotechnology or development of rural liveli- 
hoods of developing countries. Most importantly, 
the know-how of biotechnology is not enough at 
this crossroad as biotechnology can’t behave as a 
magic wand itself to solve our each and every 
problem. In order to utilize biotechnology 
effectively, implementing its sustainable and 
humane application is our own responsibility 
(IFAD 2002; Chrispeels and Sadava 2003). The 
twentieth century biotechnology has relied heav- 
ily upon purchased input strategy which is high 
input. However, intensive and high-input agricul- 
tural practices assumed so far aiming for infinite 
food production are now causing global concerns 
by deforestation (unplanned destruction of for- 
ests to create more arable land), soil degradation 
(intensive farming of the same land for years), 
water pollution (extensive use of pesticides and 
herbicides), and loss of crop biodiversity (using 
genetically uniform crops) (Chrispeels and 
Sadava 2003). Developing countries which eco- 
nomically cannot afford high-input agriculture 
for long should stop competing with the devel- 
oped world. Low-input farming by combining 
indigenous or conventional technologies with 
scientific knowledge is highly suitable here and 
could be quite profitable if managed properly in 
these regions especially. This is termed as regen- 
erative or sustainable agriculture. This approach 
takes care of preserving and restoring the natural 
resource base of local environment (IFAD 2002). 


37.5 Promoting Sustainable 
Agriculture by Using Modern 
and Indigenous Know-How 
and Supporting 

the Resource-Poor 


Smallholders 


Faced by striking climate changes caused by 
mostly anthropogenic factors, practicing sustain- 
ability is the only solution the world is hoping for 
(IPCC 2007). It is equally clear that rural endoge- 
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nous farmers or smallholders have a key role in 
building a sustainable livelihood. As a true pro- 
ducer these farmers are also the protector of local 
biodiversity however being ignorant most of the 
time of this great fact and unappreciated as well. 
Poverty is the chronic burden that leaves them 
stuck, literally unrewarded and deprived of the due 
recognition (IFAD 2002). There are about 525 
million smallholder farms worldwide: 388 million 
in Asia, 44 million in Europe and Russia, 33 mil- 
lion in Africa, and five million in America. These 
smallholder farms face varied challenges like lim- 
ited access to land, water, market, credit, organiza- 
tion, education, or guidance to cope up with risks 
and that reflect in the quality and quantity of their 
products (IFC 2013). The 2011 Rural Poverty 
Report insists to support rural areas to shift from 
subsistence farming to farming for business eftec- 
tively to put an end to poverty and hunger. The 
report has put an equal stress on preserving the 
natural resource base of the rural areas through 
intensification of sustainable agriculture (IFAD 
2011). Combining the knowledge of modern bio- 
technology with indigenous knowledge and prac- 
tices of farmers to protect the natural resource base 
of a region like soil, water, and local biodiversity is 
the key concept of sustainable agriculture. FAO’s 
(Food and Agriculture Organization of the United 
Nations) “Save and Grow” guide for policymakers 
toward sustainable intensification of smallholder 
crop protection is a concise guide for everybody 
interested to understand this concept (FAO Save 
and Grow 2012a, b, c, d, e). A global transforma- 
tion is highly sought after to make agriculture low 
input, observant, natural, locally appropriate, eco- 
nomic, profitable for the farmers, and environment 
caring. Smallholders will be instrumental in this 
effort for their collective experience and knowl- 
edge of local conditions. However, the majority of 
the smallholders are resource poor. Sufficient sup- 
port in the form of fund, policy, know-how, and 
guidance will have to be provided to make them 
bring the desired transformation in farming (IFC 
2013). In order to recognize the importance of 
endogenous small farmers in reducing poverty and 
ensuring food security and sustainable develop- 
ment, the United Nations has declared 2014 as the 
International Year of Family Farming UYFF). 
With over 40 [TY FF National Committees, the mis- 
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sion is to improve the situation of family farmers 
through communication, awareness activities, pol- 
icy advocacy, and research (IFAD IYFF 2014c). 


37.6 Some Sustainable Farming 
Techniques Practiced 
Successfully Around 
the World 

37.6.1 Conservation Tillage 

Soil is the base of agriculture. Conventional agri- 

culture relies hugely upon tillage. This has 

adverse effects like loss of rich topsoil, soil 
organic matter and nutrient, and soil-water con- 
tent. Intensive and extensive agriculture amplifies 
these losses raising serious concern. Reduced to 
zero-till farming is gradually spreading. This 
helps in conserving the soil, increasing drought 
tolerance through water retention and carbon 
sequestration (reducing greenhouse gas emis- 
sions), and increasing profit by saving in labor 

and energy. Weed and pest management is a 

problem in this practice. Applying integrated pest 

and weed management techniques is to be accom- 
panied and optimized to aid this approach (FAO 

Soil 2012c; Shaver 2003; The World Bank 2008). 





Water stress indicator ‘ a. 
in major basins ‘ 
Overexploited ; 
(more than 1.0) a 
Heavily exploited 
(0.8 to 1.0) 
Moderately exploited 
{0.5 to 0.8) 
Slightly exploited P 
(0 to 0.5) 
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37.6.2 Biological Nitrogen Fixation 


This is a form of green manuring to restore soil 
fertility. Nitrogen-fixing legumes (preferably inocu- 
lated with nitrogen-fixing Rhizobium) are utilized 
by intercropping with cereals to increase soil fertil- 
ity. Leguminous trees and shrubs like Sesbania ses- 
ban, Tephrosia vogelii, Crotalaria ochroleuca, and 
Faidherbia albida which accumulate nitrogen in 
their leaves and roots are used in eastern and south- 
ern Africa to complement nitrogen-deficient maize 
farming. This is a successful application of agro- 
forestry. A fernlike floating plant Azolla pinnata 1s 
inoculated with nitrogen-fixing bacteria (its symbi- 
otic partner) in rice fields as green manuring in 
Asian wetland rice cultivation (FAO Soil 2012c; 
Shaver 2003; The World Bank 2008). 


37.6.3 Rainwater Harvesting 
and Deficit Irrigation 


Water is a limited but indispensable natural 
resource. Intensification of agriculture by high- 
input strategies in the twentieth century has led to 
depletion of surface water in many areas 
(Fig. 37.2). With increasing demand for water in 
industry, livestock, aquaculture, and domestic 


Fig. 37.2 Global water stress map noted in river basins (Source: World Development Report 2008; The World Bank 


2008) 
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use, efficient management of water in agriculture 
is the key to sustainability in the twenty-first cen- 
tury especially in semiarid and arid zones. Using 
improved crop varieties and practicing good soil 
health management, supplemental irrigation by 
harvesting rain water, allowing controlled water 
stress (deficit) during vegetative stage, and apply- 
ing water during critical growth stages like before 
flowering have multiple benefits like saving water 
and preventing soil salinization by excessive irri- 
gation but little reduction in crop productivity. 
Agroforestry, conservation tillage, and integrated 
natural resource management approaches also 
have notable contribution in water management 
(FAO Water 2012d; UNEP 2008; The World 
Bank 2008). 


37.6.4 Integrated Pest 


Management 


Integrated pest management (IPM) practices of 
balancing on-farm pest-predator populations to 
achieve natural pest control without excessive 
use of pesticides 1s invaluable in maintaining 
agroecology as well as for sustainable agriculture 
as it is evident that chemical pesticides cannot be 
a sustainable solution for pest control besides 
their huge adverse environmental impacts. 

Pesticides derived from natural sources such 
as animals, plants, microbes (bacteria, fungus, 
virus, or protozoan), and certain minerals are 
known as biopesticides. Biopesticides are less 
toxic, more specific, and a good companion of 
IPM. Bt proteins derived from strains of the bac- 
terium Bacillus thuringiensis thus kill specifi- 
cally one or a few related species of insect larvae 
(The World Bank 2008). Bentley (1983) has dis- 
cussed the various aspects that are relevant to 
agriculture in tropics with reference to biological 
control and future farming practices and con- 
cluded that an integrated approach between pure 
and applied biologists will improve the quality of 
life. Likewise the role of nectaries in agriculture 
in relation to biological control of crop pests has 
been discussed by Chuabal et al. (1998), and they 
conclude that nectaries play a vital role in agri- 
culture. For more details the reader may refer the 
original paper. 
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37.6.5 Integrated Crop-Livestock 


Management 


This is the best example of sustainable agriculture 
in an efficient and economical way (Fig. 37.3). 
Thus crop residues serve as feed for livestock, 
while animal power is used in agriculture and 
other farm purposes; animal waste is used as bio- 
fertilizer of soil, in producing biogas and energy 
as well as biopesticide (Rota and Sperandini 2010). 


37.6.6 Fishery and Aquaculture 


Fishery and aquaculture can similarly be linked to 
integrated farm management for effective mainte- 
nance and management of water and other natural 
resources. Fish culture in rice fields (either in the 
same plot or adjacent ones) as practiced in China 
(world’s largest fish producer and exporter) and 
many other Asian countries 1s a successful exam- 
ple of such integration (FAO 201 2a, b, c, d, e). 


37.6.7 Algal Biotechnology 


Besides biofuel production, rapidly growing 
algae can produce oil, alcohol, protein, and bio- 
mass and have high potential of serving as a plant 
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Fig. 37.3 The cyclic nature of resource management in 
integrated crop-livestock farming system (Source: http:// 
www.ifad.org/Irkm/factsheet/integratedcrop.pdf, Rota and 
Sperandini 2010) 
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factory for functional foods, enzymes, drugs, and 
allied novel products. Algae can also be used as 
green fertilizer and bioinsecticide and in biore- 
mediation. Highly potential algal biotechnology, 
however, still needs to overcome technological 
difficulties and establish itself as cost-effective. 


37.6.8 Biofuel 


With the ever-increasing oil prices, first- 
generation liquid biofuels like ethanol and bio- 
diesel offer a lucrative supply of renewable 
energy from plants with environmental benefits. 
Currently ethanol is produced from maize and 
sugarcane, while biodiesel is produced from soy- 
bean, sunflower, palm oil, Jatropha, and other 
agricultural feedstocks. Thus substantial scien- 
tific and technological progress achieved during 
the last decade (2001—2011) promises successful 
management and domestication of Jatropha as a 
future industrial oilseed crop. As for example, 
genomic breeding programmes will be highly 
promising to develop perennial crops specially 
like oil palm (Elaeis guineensis), jatropha 
(Jatropha curcas), macaw palm (Acrocomia acu- 
leate) and pongamia (Pongamia pinnata) dedi- 
cated for biodiesel production (Alves et al. 2015; 
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8% 
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Brazil 
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46% 


L. Bandopadhyay and S.R. Sikdar 


Bahadur et al. 2013; Carels et al. 2013). The sec- 
ond generations of biofuels are produced from 
nonedible lignocellulosic biomass mostly derived 
from crop and forest residues. Converting the 
harmful algal blooms into biodiesel and other 
biofuels like bioethanol, biohydrogen, and biogas 
is another possibility. Production cost is a con- 
cern however so far. Many countries are produc- 
ing and encouraging biofuel production 
(Fig. 37.4). But its economic viability (produc- 
tion cost and effect on food and feed prices) is 
still under assessment (The World Bank 2008). 


37.6.9 Sericulture 


Rearing silkworm on mulberry plants for produc- 
tion of raw silk 1s a low-input/high-output option 
for enhancing rural livelihoods economically as 
well as culturally. For example, Indian silk, 
famous for its diverse pattern and designs, is an 
ancient village craft practiced in many parts of 
the country. In China fish cultivation is integrated 
with mulberry cultivation and silkworm rearing. 
As a perennial plant mulberry also helps in 
agroforestry-based soil fertility enhancement and 
conservation (Bao-tong and Hua-zhu 1984; 
Central Silk Board 2014). 


Biodiesel (6.5 billion liters) 


Other 
12% 


United States 
13% 





European 
Union 
75% 


Fig. 37.4 Percentage of global production of biofuels ethanol and biodiesel in 2006 (Source: World Development 


Report 2008; The World Bank 2008) 
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37.6.10 Apiculture 


Apiculture or beekeeping for production of honey 
and other bee products such as beeswax, royal 
jelly, pollen, propolis, or bee venom or queens is 
another option for diversifying income option for 
smallholders as part of farming. Colony manage- 
ment besides dealing with deadly honeybee pests 
and diseases, production and processing of honey, 
handling beeswax, and pollen trapping are several 
aspects of apiculture. Suryanaryana has reviewed 
the role of nectar in bee biology and beekeeping 
in detail and commented that modern beekeeping 
is still in infancy in several countries including 
India. Technical assistance through apiculture 
training for ensuring quality control of products 
to marketing is required at the national level for 
successful and truly profitable implementation of 
this method through the smallholders (Gegner 
2003; MAAREC and USDA 2004; Suryanarayana 
1998). 


37.6.11 Mushroom Cultivation 


As a quality food, mushroom is a good source of 
protein, vitamins, and minerals and has high mar- 
ket demand. In addition it has also many medici- 
nal benefits including antibacterial, antiviral, and 
antiprotozoal activities and cytotoxicity against 
cancerous tumor cells. Edible mushroom cultiva- 
tion using commercially viable strains and fol- 
lowing proper culture techniques is a great option 
for generating food and income security for 
smallholders and rural people. Again, national 
mushroom cultivation programs supporting the 
technical and marketing part of the business are a 
necessity for the benefit of the resource-poor 
smallholders (Chakravarty 2011; Marshall and 
Nair 2009). 


37.6.12 Medicinal Plants 


Medicinal plants are wealth of every rural neigh- 
borhood and are greatly used by people in devel- 
oping countries. Thus around 6,000—7,000 Indian 
plants have been documented to have usage in 


parallel systems of medicine like Ayurveda, 
Siddha, Unani, and Homeopathy in India. The 
increasing percentage of world population is 
trending toward use of plant-based medicine. 
Cultivation of medicinal plants is not only profit- 
able but contributes in conservation of local and 
global biodiversity. Bahadur et al. have discussed 
in depth various aspects of medicinal plant diver- 
sity, conservation of rare/threatened plants, and 
biotechnological applications including micro- 
progation, transgenics, antibodies, vaccines from 
transgenics apart from antioxidants, nutraceuti- 
cals, and dermaceuticals. In addition they have 
provided about 60 websites where information 
on medicinal plants can be accessed. The World 
Health Organization (WHO) has also brought out 
a series of monographic volumes on selected 
medicinal plants that aims to provide scientific 
information on their safety, efficacy, and quality 
control of plants used worldwide. These provide 
comprehensive scientific references for drug reg- 
ulatory authorities, physicians, traditional health 
practitioners, | pharmacists, | manufacturers, 
research scientists and the general public. Each 
monograph has a standard format with informa- 
tion on pharmacopoeial quality assurance, 
medicinal uses, pharmacology, safety issues, and 
dosage forms. Encouragement and support at the 
national level will be required eventually to 
establish the practice of cultivating, document- 
ing, and preserving the local medicinal germ- 
plasm (Bahadur et al. 2007; Kuipers 1997; 
NMPB 2014; WHO 1999, 2004, 2007, 2009). 
As evident most of the abovementioned farm- 
ing practices have been proved to be sustainable 
and profitable. However, in order to inculcate 
these practices among the resource-poor endog- 
enous farmers globally, much effort (in terms of 
training, guidance, and funding) and follow-up 
procedures (to ensure profitability and continua- 
tion of the practices) are still required at the 
national level. There are several local factors 
associated as well depending upon agroecology, 
culture, and economy of the region of interest 
which must be taken care of in order to be locally 
relevant and effective while applying these sus- 
tainable practices aiming for endogenous devel- 
opment. These efforts and investments if 
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successfully executed will on the way keep build- 
ing the desired human resource, the core group in 
protecting our global environment and ensuring 
our every dire need sustainably utilizing the ever- 
increasing benefits of modern biotechnology and 
allied technologies. 


37.7 Modern Biotechnology Can 
Help in Increasing Genetic 
Diversity to Help a Local 
Farmer and Ensuring Global 
Food Security 


Revolution in biotechnology has enhanced the 
toolbox of modern breeder. The much-debated 
transgenic technology has a great promise in 
genetic modification. Genetically modified 
insect-resistant and herbicide-resistant crops are 
presently covering a notable acreage of global 
cultivation. Tissue culture-based techniques like 
somatic hybridization, haploid breeding, embryo 
rescue, and culture have aided in bringing widely 
additional variations from intra/interspecific to 
intergeneric that could be possible by conven- 
tional breeding. These methods are highly poten- 
tial and free of transgenic controversies, biosafety 
concerns, and purchased (high) input strategies, 
as they are based on the same principle of genetic 
improvement relying upon a whole genome as 
practiced by conventional breeding instead of 
selected genes only as practiced by transgenic 
technology. Mutagenesis (induced by chemicals 
or radiation), somaclonal mutagenesis, and 
Targeting Induced Local Lesions in Genomes 
(TILLING: molecular biology-based technique) 
are the techniques for raising variations by muta- 
tion and are equally widely potential. DNA- 
marker-assisted selection technologies based on 
molecular biology techniques like PCR (poly- 
merase chain reaction), nucleic acid hybridiza- 
tion, and sequencing have greatly complemented 
selection part in the wide crossing programs. 
Tissue culture-based micropropagation and 
somatic embryogenesis have been instrumental in 
vegetative propagation of elite or disease-free 
plant varieties. Biotechnology is also aiding in 
diagnosis of diseases, plant nutrition, and plant 
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protection combined with IPM and other farm 
management techniques (FAO 2010). 

Nature believes in unity in_ diversity. 
Purchased input agriculture ignores that fact. 
The monoculture farming has already caused 
notable loss in local biodiversity. Irresponsible 
use of pesticides has added to this by increasing 
threat from once manageable crop pests. Crop 
productivity intensification has a limit. The fact 
that high-input/high-output way of agriculture 
is not sustainable is already visible. Endogenous 
development is also an inseparable part of this. 
Future sustainable agricultural practices require 
identification and conservation of rural landra- 
ces and germplasms by maintaining village seed 
banks. Documenting rural biodiversity knowl- 
edge, raising improved varieties with agronomic 
traits of interest of local farmers (by breeding 
programs using the gene bank of wild crop rela- 
tives), and on-farm research to solve the real 
challenges faced by farmers using the power of 
biotechnological innovations are required. In 
addition encouraging and supporting mainte- 
nance of kitchen garden (for cultivation and 
preservation of local landraces and medicinal 
plants especially) by smallholders will be a real 
wholesome approach while aiming for both 
rural development and conservation of rural 
natural resources. CGIAR institutes including 
ICRISAT (International Crops Research 
Institute for the Semi-Arid Tropics), ICARDA 
(International Centre for Agricultural Research 
in the Dry Areas), and IRRI are working visibly 
in this mission. These institutes are also involved 
in genetic improvement in livestock and fish 
(CGIAR 2012; FAO 2010). However, more and 
more national awareness and involvement are 
highly desired. Creating options for rural public 
works is not the ultimate solution. In order to 
build a sustainable society endogenous develop- 
ment, preservation of regional natural resources 
and rural integrity are equally important. We 
need diverse schemes and projects to realize 
these goals at the national level throughout 
the globe. 

Domestication of crops and livestock for the 
last 10,000 years has inevitably caused unavoid- 
able loss of this precious biodiversity. 
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Intensification of agriculture under increasing 
pressure of population and consumer demand 
will continue to do the same as part of civiliza- 
tion. From 2011 to 2020, besides the Strategic 
Plan for Biodiversity, the “Second Global Plan of 
Action for Plant Genetic Resources for Food and 
Agriculture” will thus be supervising and moni- 
toring the conservation of world’s heritage of 
plant genetic resources across the globe. This 1s a 
supporting component of IPTGRFA (The 
International Treaty on Plant Genetic Resources 
for Food and Agriculture) (FAO 2011). Access to 
genetic diversity in crops from other regions of 
the world is also a necessity to find global solu- 
tion for local problems. ITPGRFA and CBD are 
working to realize that goal through establish- 
ment of rules and regulations at the international 
level (FAO Crops 2012b). 


37.8 Conclusion 


Half of the world’s population is rural, while 
75 % of the world’s poor live in rural areas where 
agriculture is the main source of income. In the 
current trend of urbanization however, rural peo- 
ple are moving toward diverse sources of income, 
and agriculture is becoming the less preferable 
choice due to its risks and challenges beyond the 
ability of poor rural farmers especially in devel- 
oping countries to cope up with. Intensive agri- 
culture has hugely affected global environment 
partly out of need and mostly out of ignorance. 
Transforming the way of agriculture is a neces- 
sity. Combining the benefits of conventional and 
modern technologies and supporting the develop- 
ment of endogenous farmers are a priority of 
today. Practicing sustainable agriculture and 
enhancing rural livelihood are intimately con- 
nected with overcoming the current global chal- 
lenges including food security, poverty reduction, 
and climate changes and eventually in ensuring 
the sustainability of human civilization. 

In this context we need to remember, however, 
that rural development is not an overnight event. 
The follow-up procedures are most important 
to keep the systems consistent and error-free 
everywhere. The helpless status of Br cotton 
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cultivation in Gujarat, India (the Navbharat 151 
case), where science has got bewildered by 
business, politics, and ethical issues is a great 
reminder of this concern or need (Gupta and 
Chandak 2005). 
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